Getting Bulk Data Through Google: An empirical study

Authors

  • Shama RaniME Scholar, Chitkara University, India.
  • Jaiteg SinghProfessor, CURIN Chitkara University, India.
Keywords
Web crawling, indexing, page ranking, retrieve pdf documents, query processing, search engine implementation, web search

Abstract

To store the information in a database is one of the major tasks. The efficient storage of data is important for future use. Information retrieval is a method of gathering information related to input queries from the various sources or stored databases. To retrieve the information, a search engine plays an important role. A web search engine creates an index to match queries. The quality of information is improved with the help of search engine. For retrieving the information, a search engine comprises some modules such as query processor, a searching and matching function, document processor and page rank capability. This paper focuses on the retrieval of web documents against input queries and stores them in to database. A Google search API can be used to fetch the results. It analyses the data by processing through these modules and downloads the content available in different formats.

References

  • Beel, J., Gipp, B. and Wilde, . “Academic Search Engine Optimization (ASEO): Optimizing Scholarly Literature for Google Scholar & Co.”. Accessible at: http:// www.beel.org/ files/papers/2010-ASEO–preprint.pdf (last accessed 24 August 2012); and Hoyt, Jason.
  • Mendeley blog. 29 November 2010. Academic SEO – Market (and Publish) or Perish.
  • Madhu, G., Govardhan, A. and Rajinikanth, T. V. (2011) “Intelligent Semantic Web Search Engines: A Brief Survey” International journal of Web & Semantic Technology (IJWesT) Vol.2, No.1, January 2011.
  • Prakash, K. S. V. “Concept of Search Engine Optimization in Web Search Engine” International Journal of Advanced Engineering Research and Studies E-ISSN2249–8974.
  • Dirk Lewandowski “New perspectives on Web search engine research” Lewandowski, Dirk Journal of Technology Management for Growing Economies, Volume 7, Number 2, October 2016 (ed.): Web Search Engine Research. Bingley: Emerald Group Publishing, 2012.
  • Mike Grehan “How Search Engines Work” 2 publication of Search Engine Marketing: The Essential Best Practice Guide.
  • Bar-Ilan, J. (2004). The use of web search engines in information science research. In B. Cronin (Ed.), Annual review of information science and technology (Vol. 38, pp. 231- 288).
  • Medford, NJ: Information Today, Inc Mr.K. Tarakeswar and Ms. D. Kavitha “Search Engines: A Study” Journal of Computer Applications (JCA) ISSN: 0974-1925, Volume IV, Issue 1, 2011.
  • Mark Levene, “An Introduction to Search Engines and Web Navigation”, John Wiley & Sons, Inc., 2010.
  • Sergey Brin and Lawrence Page “The Anatomy of a Large-Scale Hyper textual Web Search Engine”.
  • Tom Seymour “History of Search Engines” International Journal of Management & Information Systems – Fourth Quarter 2011 Volume 15, Number 4. Diana Inkpen “Information Retrieval on the Internet”

How to Cite

Shama Rani, Jaiteg Singh. Getting Bulk Data Through Google: An empirical study. J.Technol. Manag. Grow. Econ.. 2016, 07, 39-48
Getting Bulk Data Through Google: An empirical study

Current Issue

PeriodicityBiannually
Issue-1May
Issue-2November
ISSN Print0976-545X
ISSN Online2456-3226
RNI No.CHAENG/2013/50088
OA Policy

Publisher's policy of the journal at Sherpa UK for the submitted, accepted, and published articles. Click OAPolicy

Plan-S Compliance

To check compliance, one has to use the Journal Check Tool (JCT). This tool provided by cOAlition S (European funders) for the researchers (fundee) to check the compliance with the journal.

Recommend journal to your library

You can recommend the journal being a researcher or faculty member to your library. We will post a copy of the Journal to your library on your behalf at free of cost.
Click here: Recommend Journal

Preprint Arxiv Submission

The authors are encouraged to submit the author’s copy (preprint) to appropriate preprint archives e.g. https://arxiv.org and/or on https://indiarxiv.org or institutional repositories (e.g., D Space) before paper acceptance by the editor of Journal. After publications of the paper author(s) should mention the citation information, title and abstract along with DOI number of the publication carefully on the required page of the depository(ies).

Contact: Phone: +91-172-2741000, +91-172-4691800

Email : editor.tmg@chitkara.edu.in;

Abstract and Indexing

Information

This work is licensed under a Creative Commons Attribution 4.0 International License.

Articles in Journal of Technology Management for Growing Economies(J.Technol. Manag. Grow. Econ.) by Chitkara University Publications are Open Access articles that are published with licensed under a Creative Commons Attribution- CC-BY 4.0 International License. Based on a work at https://tmg.chitkara.edu.in/. This license permits one to use, remix, tweak and reproduction in any medium, even commercially provided one give credit for the original creation.

View Legal Code of the above-mentioned license, https://creativecommons.org/licenses/by/4.0/legalcode

View Licence Deed here https://creativecommons.org/licenses/by/4.0/

Creative Commons License

Journal of Technology Management for Growing Economies by Chitkara University Publications is licensed under a Creative Commons Attribution 4.0 International License.
Based on a work at https://tmg.chitkara.edu.in/

Members