Organizations interested in COVID-19-related content for text mining projects can access CCC’s XML for Mining Open Access COVID collection. This collection is available even if you don’t currently use RightFind XML for Mining.
The COVID-19 Open Access XML Corpus comprises a backfile and updates that encompass more than 32,300 Open Access (Creative Commons CC BY license) articles across more than 35 publishers, in full-text, semi-normalized XML format. These articles are retrieved from XML for Mining using a query aimed at broad recall across COVID-19 and related coronaviruses.
The COVID-19 Open Access XML Corpus is updated every two weeks with new content. If you’re interested in this content, please contact us at email@example.com.