AI

128 insights with this tag.

arxiv/cs.AI · 4 min

Automated quantization shrinks spike-driven language models for edge devices

QSLM framework compresses neural network models by up to 86.5% while preserving accuracy, enabling deployment on resource-constrained embedded hardware.

April 22, 2026 Read → →
arxiv/cs.AI · 6 min

AD-Copilot: Vision-Language Model Trained for Factory Defect Detection

Researchers built a specialized multimodal AI that compares paired industrial images to spot subtle manufacturing flaws, outperforming general-purpose models and human inspectors on benchmark tasks.

April 22, 2026 Read → →
arxiv/cs.AI · 8 min

Q-Value Iteration Finds Optimal Actions Faster Than Theory Predicts

Lee's switching system analysis reveals Q-VI reaches practical optimality in finite time, with convergence rates potentially faster than the classical discount factor bound.

April 22, 2026 Read → →
arxiv/cs.LG · 8 min

Dataset Distillation Fails Without Hard Labels

Soft labels mask poor dataset quality in distillation methods, making random subsets nearly as effective as curated ones.

April 22, 2026 Read → →
arxiv/cs.LG · 8 min

Concept Bottleneck Models Hit Hard Ceiling in Dermoscopy Data

Rough-set analysis reveals 16% of concept profiles in Derm7pt are internally inconsistent, capping model accuracy at 92% regardless of architecture.

April 22, 2026 Read → →
arxiv/cs.LG · 8 min

Simpler Optimizers Make LLM Unlearning More Robust

Research shows that using lower-order optimization methods during LLM unlearning produces forgetting that resists post-training attacks better than sophisticated gradient-based approaches.

April 21, 2026 Read → →
arxiv/cs.LG · 8 min

Three diffusion methods unified under population genetics framework

Researchers connect discrete, Gaussian, and simplicial diffusion models through Wright-Fisher theory, enabling stable cross-domain sequence generation.

April 21, 2026 Read → →
arxiv/cs.LG · 8 min

Theory for learning blind inverse problems with finite samples

Researchers establish sample complexity bounds and optimal estimators for blind inverse problems using linear minimum mean square estimation framework.

April 21, 2026 Read → →
arxiv/cs.LG · 4 min

LLMs complement but don't replace classical hyperparameter optimization

A study comparing LLM agents to classical algorithms like CMA-ES and TPE finds hybrid approaches work best for tuning model hyperparameters under compute constraints.

April 21, 2026 Read → →
arxiv/cs.LG · 4 min

Weak Labels Fail Across Time Even When Domain Transfer Works

A study of CRISPR experiments reveals supervision drift—where the labeling mechanism itself shifts—causes model collapse in temporal transfer despite strong in-domain performance.

April 21, 2026 Read → →
arxiv/cs.LG · 8 min

Chain-of-Thought Supervision Eliminates Sample Complexity Growth

New theoretical analysis shows intermediate reasoning steps remove dependence on generation length, while end-to-end learning scales unpredictably with sequence depth.

April 21, 2026 Read → →
arxiv/cs.LG · 6 min

Automating Dataset Creation with LLMs and Search Engines

Researchers propose ADC, a method to build large labeled datasets automatically using language models and web search, reducing manual annotation work and cost.

April 21, 2026 Read → →
arxiv/cs.AI · 4 min

Interpretable Traces Don't Guarantee Better LLM Reasoning

Research shows Chain-of-Thought traces improve model performance but confuse users, and correctness of intermediate steps barely predicts final accuracy.

April 20, 2026 Read → →
arxiv/cs.AI · 5 min

LLMs Can Infer Unspoken Intent in Collaborative Tasks

Researchers tested whether large language models can interpret incomplete instructions by reasoning about a human partner's mental state, matching human performance.

April 20, 2026 Read → →
arxiv/cs.AI · 6 min

OjaKV: Online Low-Rank Compression for LLM Key-Value Caches

A hybrid storage and adaptive subspace method reduces KV cache memory by compressing intermediate tokens while preserving critical anchors, compatible with FlashAttention.

April 20, 2026 Read → →
arxiv/cs.LG · 4 min

Neural CTMC decouples discrete diffusion into timing and direction

A new parameterization for discrete diffusion models separates when and where tokens jump, aligning training with mathematical structure.

April 20, 2026 Read → →
arxiv/cs.LG · 8 min

Chromatic Clustering Requires New Algorithms to Match Standard Performance

Adding color constraints to correlation clustering increases computational difficulty; a new coupled approach recovers optimal approximation bounds.

April 20, 2026 Read → →
arxiv/cs.LG · 4 min

Quantum-LSTM hybrid cuts physics model training data by 100×

Federated learning with quantum-enhanced LSTM achieves classical accuracy on SUSY classification using 20K samples instead of 2M, with under 300 parameters.

April 20, 2026 Read → →
hackernoon · 6 min

Why AV Data Annotation Fails at Scale and What Fixes It

Autonomous vehicle programs collapse not from bad models but from annotation pipelines that were never built to handle production volume.

April 18, 2026 Read → →
hackernoon · 4 min

Browser-Native Agents: Bypassing API Gaps with Session Control

When API catalogs exclude premium models, controlling an existing browser session offers a practical alternative to waiting for official endpoints.

April 18, 2026 Read → →
arxiv/cs.AI · 4 min

AlphaCNOT: Planning-Based RL Cuts Quantum Gate Count by 32%

Researchers combine Monte Carlo Tree Search with reinforcement learning to minimize CNOT gates in quantum circuits, outperforming classical heuristics.

April 18, 2026 Read → →
hackernoon · 2 min

HackerNoon indexes 218 articles on AI agents for self-directed study

A curated reading list from HackerNoon's Learn Repo maps the AI agent landscape across frameworks, protocols, security, and production failures.

April 18, 2026 Read → →
hackernoon · 2 min

AI Coding Agents Reshape Developer Work, Not Replace It

HackerNoon's April 2026 roundup shows autonomous ML agents and agentic workflows solving real problems, shifting focus from coding skill to agent orchestration.

April 18, 2026 Read → →
arxiv/cs.AI · 4 min

TableNet: LLM-Driven Dataset for Table Structure Recognition

Researchers introduce an autonomous multi-agent system that generates synthetic tables at scale and uses active learning to train structure recognition models more efficiently.

April 17, 2026 Read → →

Automated quantization shrinks spike-driven language models for edge devices

AD-Copilot: Vision-Language Model Trained for Factory Defect Detection

Q-Value Iteration Finds Optimal Actions Faster Than Theory Predicts

Dataset Distillation Fails Without Hard Labels

Concept Bottleneck Models Hit Hard Ceiling in Dermoscopy Data

Simpler Optimizers Make LLM Unlearning More Robust

Three diffusion methods unified under population genetics framework

Theory for learning blind inverse problems with finite samples

LLMs complement but don't replace classical hyperparameter optimization

Weak Labels Fail Across Time Even When Domain Transfer Works

Chain-of-Thought Supervision Eliminates Sample Complexity Growth

Automating Dataset Creation with LLMs and Search Engines

Interpretable Traces Don't Guarantee Better LLM Reasoning

LLMs Can Infer Unspoken Intent in Collaborative Tasks

OjaKV: Online Low-Rank Compression for LLM Key-Value Caches

Neural CTMC decouples discrete diffusion into timing and direction

Chromatic Clustering Requires New Algorithms to Match Standard Performance

Quantum-LSTM hybrid cuts physics model training data by 100×

Why AV Data Annotation Fails at Scale and What Fixes It

Browser-Native Agents: Bypassing API Gaps with Session Control

AlphaCNOT: Planning-Based RL Cuts Quantum Gate Count by 32%

HackerNoon indexes 218 articles on AI agents for self-directed study

AI Coding Agents Reshape Developer Work, Not Replace It

TableNet: LLM-Driven Dataset for Table Structure Recognition