- AI · arxiv/cs.AI · 4 min
KuaiLive: First Real-Time Live Streaming Recommendation Dataset
Researchers release a 21-day interaction log from Kuaishou covering 23,772 users and 452,621 streamers to enable dynamic recommendation research.
April 27, 2026 Read → → - AI · arxiv/cs.AI · 6 min
AD-Copilot: Vision-Language Model Trained for Factory Defect Detection
Researchers built a specialized multimodal AI that compares paired industrial images to spot subtle manufacturing flaws, outperforming general-purpose models and human inspectors on benchmark tasks.
April 22, 2026 Read → → - AI · arxiv/cs.AI · 4 min
MERRIN: Benchmark for Multimodal Search in Noisy Web Data
New benchmark reveals AI agents struggle with real-world web search, achieving only 22% accuracy when retrieving and reasoning across mixed media sources.
April 17, 2026 Read → → - AI · arxiv/cs.AI · 8 min
Vision-Language Models Fail on Dense Visual Grids
A new benchmark reveals VLMs collapse sharply on simple grid-reading tasks, exposing a gap between visual encoding and language output called Digital Agnosia.
April 17, 2026 Read → →