logoalt Hacker News

Imanariyesterday at 6:50 PM1 replyview on HN

Looks really nice! How does it handle tables?


Replies

Adityav369yesterday at 6:58 PM

We have two ingestion pathways: 1. regular OCR + text embeddings; 2. Colpali. We've observed that Colpali does a much better job with tables since it can encode positional stuff and layouts as well.

show 1 reply