- AI · arxiv/cs.AI · 4 min
KuaiLive: First Real-Time Live Streaming Recommendation Dataset
Researchers release a 21-day interaction log from Kuaishou covering 23,772 users and 452,621 streamers to enable dynamic recommendation research.
April 27, 2026 Read → → - AI · arxiv/cs.LG · 6 min
Automating Dataset Creation with LLMs and Search Engines
Researchers propose ADC, a method to build large labeled datasets automatically using language models and web search, reducing manual annotation work and cost.
April 21, 2026 Read → → - AI · arxiv/cs.AI · 4 min
TableNet: LLM-Driven Dataset for Table Structure Recognition
Researchers introduce an autonomous multi-agent system that generates synthetic tables at scale and uses active learning to train structure recognition models more efficiently.
April 17, 2026 Read → →