Files
invoice-master-poc-v2/run_autolabel.py
Yaojia Wang 8938661850 Initial commit: Invoice field extraction system using YOLO + OCR
Features:
- Auto-labeling pipeline: CSV values -> PDF search -> YOLO annotations
- Flexible date matching: year-month match, nearby date tolerance
- PDF text extraction with PyMuPDF
- OCR support for scanned documents (PaddleOCR)
- YOLO training and inference pipeline
- 7 field types: InvoiceNumber, InvoiceDate, InvoiceDueDate, OCR, Bankgiro, Plusgiro, Amount

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
2026-01-10 17:44:14 +01:00

11 lines
188 B
Python

#!/usr/bin/env python3
"""
自动标注脚本 - 调用 CLI 模块
在 WSL 中运行: python run_autolabel.py
"""
from src.cli.autolabel import main
if __name__ == '__main__':
main()