Amazon Textract
A service that extracts text, forms, and tables from documents (images, PDFs).
Exam Tip: Textract = document data extraction (OCR+). If the question involves extracting structured data from documents (forms, tables, invoices), the answer is Textract. It goes beyond simple OCR.
Key Capabilities
- Text Detection: Extract printed and handwritten text from documents
- Form Extraction: Extract key-value pairs from forms (e.g., "Name: John Smith")
- Table Extraction: Extract structured table data
- Query-based Extraction: Ask specific questions about a document
- Signature Detection: Detect signatures in documents
- Expense Analysis: Extract data from invoices and receipts
- Identity Document Analysis: Extract information from ID cards, passports, driver's licenses
- Lending Document Analysis: Extract data from mortgage and lending documents
Common Use Cases
- Automated invoice and receipt processing
- Financial document data extraction
- Healthcare form digitization
- Legal document analysis and data extraction
- Government ID verification and processing
- Mortgage and lending document automation