Features
- AI-ready data sets for use in internal projects
- A single point of access to full-text journal articles across a wide range of STM publishers
- Full text literature in a machine-readable, interoperable, JATS-compliant XML format
Benefits
- Enrich internal AI and machine learning projects with insights that can only be found in the full text of scientific articles.
- Reduce infringement risk by incorporating copyright compliance into your internal AI workflows.
- Spend less time converting PDF articles and negotiating licensing rights
- RightFind XML is part of the RightFind Suite—a robust set of software solutions that fuel scientific research and simplify copyright, anytime, anywhere
RightFind XML by the numbers
20+
participating publishers
3.4 million +
AI-ready articles
5 million +
Open Access articles, including 555,000+ non-PMC articles
Flexible models that meet your unique needs
Use CCC’s search interface to create queries on a project basis, filter results and download files relevant to your current needs
Embed RightFind XML functionality into your own automated tools and processes with a RESTful API
Access XML with a data feed option that delivers specified subscribed content in XML format and offers updated content at a regular cadence
Directly access CCC’s curated Open Access Corpus of CC-BY content in XML format
Featured resources
Open Access Corpus in RightFind XML
A unified source for machine-readable open access articles is a simple and cost-effective way to integrate scientific literature into knowledge extraction tools and enrich your outcomes.

Featured resources
Fueling Your Machine Learning Projects with XML Content – Why Flexible Retrieval Options Are Crucial
CCC has continuously developed its RightFind XML offering since its inception in 2016, and we now offer even more flexible ways to incorporate scientific articles from more than 50 publishers into AI and machine-learning initiatives.
Top 3 Challenges When Using Scientific Articles in AI & Machine Learning Projects
Here are the three primary challenges we hear when companies build a collection of articles (or “corpus”) for their text mining projects, with tips to overcome them.
AI, Copyright & Licensing
Copyright is central to high quality outcomes as copyrighted material is the fuel for AI systems. Licensing is an effective solution enabling the use of copyrighted material as society realizes the benefits promised by AI systems. Learn more about AI, Copyright & Licensing.



