Glossary term
Glossary term
Foundations
NLP task of extracting structured data (entities, relations, events) from unstructured text.
Scale AI's document extraction product uses IE to pull structured fields from invoices, contracts, and regulatory filings, processing 10M+ documents per month for enterprise clients including top-10 US banks.
Diffbot Knowledge Graph uses IE to extract entities and relationships from 1 billion+ web pages, building a structured knowledge graph used by hedge funds and research teams for automated competitive intelligence.
Microsoft Azure Document Intelligence uses IE to extract key-value pairs, tables, and named entities from scanned business documents in 309 prebuilt models for invoices, receipts, identity cards, and business cards.