An interesting article has been making the rounds of health library listservs. Comparing search results from PubMed and Google Scholar based on 4 clinical questions, the authors (from Texas Tech University Health Sciences Center and Carnegie Mellon University) conclude:
“PubMed searches and Google Scholar searches often identify different articles. In this study, Google Scholar articles were more likely to be classiﬁed as relevant, had higher numbers of citations and were published in higher impact factor journals. The identiﬁcation of frequently cited articles using Google Scholar for searches probably has value for initial literature searches.”
This is where those critical analysis skills come in handy, and it helps to know some of the behind-the-scenes workings of the tools you’re working with – in this case, PubMed and Google Scholar. Google’s immense popularity – especially among students – makes this a highly relevant discussion, and so I’ve made a quick comparison table that addresses much of the umbrage that has poured out (in librarian circles, in any case) in response to the article:
|Transparency: What’s being searched?||MEDLINE dataset||“academic papers from sensible websites”|
|Search result ranking||By date of publication||Super-secret relevancy algorithm|
There are undoubtedly benefits to using Google Scholar – you can capture things outside the scope of your more traditional, published biomedical lit, including dissertations, conference abstracts, posters, and other content that can go into institutional repositories (such as the University of Michigan’s Deep Blue) but not into the MEDLINE dataset.
BUT. There are also serious drawbacks to using Google Scholar, and personally I think this is what users really need to understand. One MEDLIB-L discussion participant put it succinctly:
“you’re searching a database that you have no editorial definition for or ability to reproduce your results and/or search string.”
Google is famous (notorious?) for not divulging exactly what its bots crawl & index – we’ve covered it briefly before when Scholar introduced a version of citation metrics. That means if you’re searching with the assumption that your search is grabbing everything – well I hesitate to say that you’re wrong, but you certainly can’t be certain.
Knowing how your tool works on the back end is important, because it will affect your search results, but that knowledge is not a substitute for covering your bases. The bottom line? Don’t default to one search tool – especially if your search is meant to be thorough or comprehensive.