Tag

#benchmarks

3 insights with this tag.

AI · arxiv/cs.AI · 8 min

Benchmark Rubrics Shift LLM Scores in Financial NLP Tasks

How wording changes in evaluation criteria and metric selection alter model rankings on financial text benchmarks, requiring governance over gold-label assumptions.

May 2, 2026 Read → →
AI · arxiv/cs.LG · 4 min

Hyperbolic neural networks outperform Euclidean models in quantum simulations

Researchers demonstrate that Poincaré and Lorentz recurrent architectures consistently beat standard neural quantum states on many-body physics benchmarks.

April 28, 2026 Read → →
AI · arxiv/cs.LG · 8 min

Simple graph models match deep learning for molecular prediction

Classical topological indices enhanced with regularization and ensemble methods outperform neural networks on molecular property benchmarks without GPU requirements.

April 23, 2026 Read → →

astrobobo

Bu site JavaScript gerektirir. Tarayıcında JavaScript'i etkinleştir.

This site requires JavaScript. Please enable it in your browser.