Getting Bulk Data Through Google: An empirical study

Abstract

To store the information in a database is one of the major tasks. The efficient storage of data is important for future use. Information retrieval is a method of gathering information related to input queries from the various sources or stored databases. To retrieve the information, a search engine plays an important role. A web search engine creates an index to match queries. The quality of information is improved with the help of search engine. For retrieving the information, a search engine comprises some modules such as query processor, a searching and matching function, document processor and page rank capability. This paper focuses on the retrieval of web documents against input queries and stores them in to database. A Google search API can be used to fetch the results. It analyses the data by processing through these modules and downloads the content available in different formats.

  • Page Number : 39-48

  • Keywords
    Web crawling, indexing, page ranking, retrieve pdf documents, query processing, search engine implementation, web search

  • DOI Number
    https://doi.org/10.15415/jtmge.2016.72002

  • Authors

    • Shama RaniME Scholar, Chitkara University, India.
    • Jaiteg SinghProfessor, CURIN Chitkara University, India.

References

  • Beel, J., Gipp, B. and Wilde, . “Academic Search Engine Optimization (ASEO): Optimizing Scholarly Literature for Google Scholar & Co.”. Accessible at: http:// www.beel.org/ files/papers/2010-ASEO--preprint.pdf (last accessed 24 August 2012); and Hoyt, Jason.
  • Mendeley blog. 29 November 2010. Academic SEO – Market (and Publish) or Perish.
  • Madhu, G., Govardhan, A. and Rajinikanth, T. V. (2011) “Intelligent Semantic Web Search Engines: A Brief Survey” International journal of Web & Semantic Technology (IJWesT) Vol.2, No.1, January 2011.
  • Prakash, K. S. V. “Concept of Search Engine Optimization in Web Search Engine” International Journal of Advanced Engineering Research and Studies E-ISSN2249–8974.
  • Dirk Lewandowski “New perspectives on Web search engine research” Lewandowski, Dirk Journal of Technology Management for Growing Economies, Volume 7, Number 2, October 2016 (ed.): Web Search Engine Research. Bingley: Emerald Group Publishing, 2012.
  • Mike Grehan “How Search Engines Work” 2 publication of Search Engine Marketing: The Essential Best Practice Guide.
  • Bar-Ilan, J. (2004). The use of web search engines in information science research. In B. Cronin (Ed.), Annual review of information science and technology (Vol. 38, pp. 231- 288).
  • Medford, NJ: Information Today, Inc Mr.K. Tarakeswar and Ms. D. Kavitha “Search Engines: A Study” Journal of Computer Applications (JCA) ISSN: 0974-1925, Volume IV, Issue 1, 2011.
  • Mark Levene, “An Introduction to Search Engines and Web Navigation”, John Wiley & Sons, Inc., 2010.
  • Sergey Brin and Lawrence Page “The Anatomy of a Large-Scale Hyper textual Web Search Engine”.
  • Tom Seymour “History of Search Engines” International Journal of Management & Information Systems – Fourth Quarter 2011 Volume 15, Number 4. Diana Inkpen “Information Retrieval on the Internet”

  • Published Date : --