- AI · arxiv/cs.AI · 8 min
Benchmark Rubrics Shift LLM Scores in Financial NLP Tasks
How wording changes in evaluation criteria and metric selection alter model rankings on financial text benchmarks, requiring governance over gold-label assumptions.
May 2, 2026 Read → → - AI · arxiv/cs.LG · 4 min
Hyperbolic neural networks outperform Euclidean models in quantum simulations
Researchers demonstrate that Poincaré and Lorentz recurrent architectures consistently beat standard neural quantum states on many-body physics benchmarks.
April 28, 2026 Read → → - AI · arxiv/cs.LG · 8 min
Simple graph models match deep learning for molecular prediction
Classical topological indices enhanced with regularization and ensemble methods outperform neural networks on molecular property benchmarks without GPU requirements.
April 23, 2026 Read → →