- AI · arxiv/cs.LG · 6 min
Automating Dataset Creation with LLMs and Search Engines
Researchers propose ADC, a method to build large labeled datasets automatically using language models and web search, reducing manual annotation work and cost.
April 21, 2026 Read → → - AI · arxiv/cs.LG · 3 min
Framework uses AI outputs as features, not proxies, for labeled data
Generative Augmented Inference treats LLM predictions as informative signals rather than direct substitutes, reducing human labeling needs by 75–90% across operations tasks.
April 17, 2026 Read → →