← İçerik
Yapay Zeka · 4 dk okuma · 17 Nisan 2026

TableNet: LLM-Driven Dataset for Table Structure Recognition

Researchers introduce an autonomous multi-agent system that generates synthetic tables at scale and uses active learning to train structure recognition models more efficiently.

Kaynak: arxiv/cs.AI · Ruilin Zhang, Kai Yang · orijinali aç ↗ ↗
Paylaş: X LinkedIn

TableNet uses LLM-powered agents to generate diverse synthetic tables and active learning to train recognition models with fewer samples.

  • Multi-agent system generates table images with controllable visual, structural, and semantic parameters.
  • Synthesis approach creates semantically coherent tables adaptable to user-defined configurations.
  • Active learning selects most informative samples from diverse table sources for model finetuning.
  • Achieves competitive performance on test sets while reducing training sample requirements significantly.
  • Outperforms models trained on existing table datasets when tested on real-world web-crawled tables.
  • First application of diversity-based active learning to table structure recognition across varying row/column counts and merged cells.
  • Approach is domain-agnostic and style-flexible, enabling theoretically unlimited table generation.

Sık sorulanlar

  • Table structure recognition (TSR) is the task of identifying the logical layout of a table—rows, columns, merged cells, headers, and cell relationships. It matters because many documents (PDFs, scanned images, web pages) contain tables, and understanding their structure is essential for extracting data accurately. LLMs can reason about complex layouts, but they need training data that reflects real-world table diversity.

İlgili