What the confidence score measures
Every extraction receives a confidence score between 0% and 100%. This reflects how clearly and unambiguously the extracted values appeared in the source document — not whether the values are factually correct, but how certain the AI is about what it read.
A score of 95% means the AI found clean, well-formatted text. A score of 40% might mean the document was a low-resolution scan, fields were partially obscured, or the document structure was unusual.
Score ranges and what they mean
- 80–100% (green) — High confidence. Values extracted cleanly. Suitable for automated processing without human review.
- 50–79% (amber) — Moderate confidence. Most values are likely correct but spot-checking is recommended for financial amounts or dates.
- 0–49% (red) — Low confidence. Document may be a poor-quality scan, hand-written, or in an unusual format. Human review strongly recommended.
How to improve low confidence scores
- Use higher-resolution scans (300 DPI minimum)
- Ensure good contrast between text and background — avoid shadows and glare
- Create a custom template with detailed field descriptions to guide the AI
- Make corrections to extracted fields — each correction improves future accuracy on similar documents
자주 묻는 질문
100% 점수가 데이터의 정확성을 보장하나요?
높은 신뢰도는 AI가 문서에서 값을 명확하게 식별했음을 의미하지만, 사실적 정확성을 검증하지는 않습니다(예: 인보이스 합계가 수학적으로 정확한지 확인할 수 없습니다). 중요한 재무 데이터는 항상 검증하세요.
같은 문서가 매번 다르게 점수가 나오는 이유는 무엇인가요?
AI 모델에는 약간의 비결정적 변동이 있습니다. 같은 문서에서 ±5% 차이가 나타날 수 있으며, 이는 정상입니다.