Member-Led Project

Natural Language Processing (NLP) Use Case Database Project

This collaborative project was launched to create a bottom-up qualitative Natural Language Processing (NLP) Use Case Database, to allow NLP practitioners in pharma companies to share successes and failures with their peers. Narrowing down successful use case scenarios will lead to less experimentation and higher success rates for new NLP initiatives.

Executive Summary

Learn More

LEARN MORE

Why is this important?

Natural Language Processing offers great promise to improve efficiency and understanding of relationships and extract meaning from vast amounts of unstructured text. Pharma companies apply NLP methods in hopes of automation and insight generation. NLP experts have been investing significant resources in developing tools to address multiple business cases throughout the pharma industry in domains as varied as R&D, pharmacovigilance, and manufacturing, however, success seems very use case-specific.

Although Natural Language Processing algorithms have matured quite a bit during the past years, practical value for most NLP pilots tends to be poor, and very few NLP-driven projects are seen through to production. Exceptions are typically topics with good metadata quality, large training sets and willing business colleagues to verify results and a serendipitous combination of technical expertise and suitable use cases.

This type of knowledge is of value to share in a pre-competitive manner among Pistoia Alliance members. A simple database could contain characterization of use case, data characterization, pipelines & algorithms used, quality criteria, outcomes, and comments.

What will the project achieve?

The team will deliver the following:

A bottom-up qualitative NLP Success Failure Database with 50-100 Use cases
Agreed annotations of NLP use case methods and success/failure criteria
Collaborative insight into why NLP use cases may fail or succeed with an industry-wide view

Project Funders

Recommended For You

How FAIR is my data? The FAIR e(nough) Benchmark at...

Applying the FAIR Data Principles to Paediatric Clinical Trials –...

Pistoia Alliance’s Improving Efficiency in Life Sciences R&D Webinar Series:...

How FAIR is my Data? The FAIRe(nough) Benchmark at AstraZeneca

Last Updated on December 19, 2022 by Catherine Maskell
Categories: Current Projects

Events

31 Jul 2024

Strategic Priorities & Portfolio Update – Japanese Edition

Book this event >

06 Aug 2024

IDMP Ontology Community of Interest Meeting

Book this event >

28 Aug 2024

Unveiling the Power of Ontologies: Streamlining Data Management in Life Sciences

Book this event >

29 Aug 2024

GSRS consortium roundtable discussion

Book this event >

This website intends to use cookies to improve the site and provide you with a better browsing experience. If you select "Continue" or continue to browse the site without customizing your choices, you agree to our use of cookies. Find out more in our Online Privacy Statement.

Continue More Info

Search...

Member-Led Project

Executive Summary

Why is this important?

What will the project achieve?

Project Funders

Recommended For You

How FAIR is my data? The FAIR e(nough) Benchmark at...

Applying the FAIR Data Principles to Paediatric Clinical Trials –...

Pistoia Alliance’s Improving Efficiency in Life Sciences R&D Webinar Series:...

How FAIR is my Data? The FAIRe(nough) Benchmark at AstraZeneca

Stay Up-to-Date

Events

31 Jul 2024

06 Aug 2024

28 Aug 2024

29 Aug 2024