In general

 

Our mission is to track the conversations around research outputs online wherever they're happening. To that end we're constantly looking for new sources of data.

 

Specifically what we're looking for are sources that regularly link to scholarly content, though in some cases like newspapers and policy documents we're also able to perform text mining to try to match research mentioned (but not linked to) with published scholarly articles. We can only do this robustly under certain circumstances.

 

Sometimes we're held back by the level of access available and sometimes the source just doesn't contain enough content to be worth tracking, but we know there are plenty of sites that we should track and just haven't gotten round to yet. If you know of any please do let us know with the feedback option or by email.


Research outputs 


We are actively tracking mentions related to the following types of research outputs;

  1. Books
  2. Book Chapters
  3. Journal Articles
  4. Presentations
  5. Theses/Dissertations
  6. Reports 
  7. Conference proceedings
  8. Reviews
  9. Data Sets
  10. Working papers
  11. Grey literature
  12. Clinical Trials
  13. And more!

If a research output is from a domain we have whitelisted and contains a persistent identifier we can track it. 

Attention sources

 

Altmetric currently tracks the following attention sources for mentions of research outputs. Where possible we surface the original text of each mention, and in some cases are also able to provide demographic data on the author of the mention. It's crucial to us that all of our data is fully auditable, and that you can see not only how many people are talking about the research, but who they are and what they're saying. 

 

1) Policy documents   

 

We track a wide range of public policy documents for mentions. We define policy documents as any policy, guidance, or guidelines document from a governmental or non-governmental organisation.

 

2) Mainstream media 

    

You can check out the news sources page on the Altmetric website for the latest list of news sources that we track. This list currently extends to over 2,900 English and non-English global news outlets.    

 

3) Blogs 

 

We maintain a manually curated list of over 14,000 academic and non-academic blogs. These are tracked automatically via RSS feeds.    

 

4) Online reference managers 

  • Mendeley       
  • CiteULike   

 

5) Post-publication peer-review forums

  • PubPeer
  • Publons   

 

6) Social media

  • Twitter (public tweets, quoted tweets and retweets only, no favourites)       
  • Facebook (posts on public Pages only, no individual timeline posts and no likes)       
  • Google+       
  • Historical data: Pinterest - We can no longer pick up mentions from Pinterest, but you will still see historical mentions on details pages.        
  • Reddit (original posts only, not comments)
  • Historical data: LinkedIn groups - LinkedIn have now unfortunately closed their data stream so we are unable to pick up new mentions from this source. You will still see mentions made before the stream was closed.

7) Patent citations


We track patent citations from nine jurisdictions around the world. Patents are a type of intellectual property that are held by either an inventor or a holder. 


Altmetric's patent data comprises nine different jurisdictions:

  1. WO : World Intellectual Property Organization

  2. AU : IP Australia

  3. DE : German Patent and Trade Mark Office

  4. CH : Swiss Federal Institute of Intellectual Property

  5. EP : European Patent Office (EPO)

  6. US : United States Patent and Trademark Office

  7. FR : National Industrial Property Institute

  8. GB : Intellectual Property Office of the United Kingdom

  9. NL : Netherlands Patent Office


8) Other online sources

  • Wikipedia     
  • Sites running Stack Exchange (Q&A)
  • F1000Prime recommendations
  • YouTube 
  • Open Syllabus
  • Citations (Scopus and Web of Science) - Citations are only available within the Explorer. Web of Science citations require an existing subscription with Clarivate Analytics. 



How do Altmetric collect mentions across each data source?

The table below describes the process for collecting data, update frequency and any additional details across all of our sources.

Source name
Collection method
Update frequency
Notes
Twitter                                          
Third party data provider API
Real-time feed
Demographics, support for retweets, with monitoring of suspicious activity.
Facebook
Facebook API
Daily
Posts on public Facebook Pages only, with prioritised popular Pages.
Policy documents
PDFs collected and scanned from policy sources and repositories
Daily
Scanning and text-mining policy document PDFs for references, which are looked up in CrossRef/PubMed and resolved to DOIs.
News
RSS feeds and API
Real-time feed
Manually curated news sources, with data provided via a third-party provider and RSS feeds direct.
Blogs
RSS feeds
Daily
Manually curated list, harvesting links to scholarly content.
Mendeley
Mendeley API
Daily
Reader counts is number of readers with the output in their Library. Not included in score.
Scopus
Scopus API
Real-time feed
Citation counts from peer-reviewed literature. Not included in score. These appear in the Explorer only.
Post-publication peer reviews
PubPeer and Publons APIs
Daily
Peer review comments collected from item records and associated by unique identifier.
Reddit
Reddit API
Daily
Includes all sub-reddits. Original posts only, no comments.
Wikipedia
Wikipedia API
Real-time feed
Mentions of scholarly outputs collected from References section. English Wikipedia only.
Q&A (Stack Overflow)
Stack Overflow API
Daily
Scan for links to scholarly outputs.
F1000Prime recommendations
F1000 API
Daily
Scan for links to scholarly outputs.
Google+
Google+ API
Daily
Public posts only.
YouTube
YouTube API
Daily
Scan for links to scholarly outputs in video comments.
 Open Syllabus
 Static Import from Open Syllabus
Quarterly
 Link syllabi's contents to HLOM IDs.
Web of Science    
Clarivate Analytics API        
Real-time feed
Citation counts from peer-reviewed literature. Not included in score. These appear in the Explorer and for customers with existing Web of Science subscriptions.
Patent Citations
IFI Claims
Monthly
Scanning JSON patent records for links to publications and DOIs