Hypothesis

10 Matching Annotations

Last 7 days
8.211.152.224 8.211.152.224

Nanbeige4.1-3B: A Small General Model that Reasons, Aligns, and Acts

4
1. uptown1919 24 Jun 2026
  
  in Public
  
  ritic model to evaluate each step of the interaction based on three dimensions: logical soundness, tool-call accuracy, and informational gain.
  
  turn level critic
2. uptown1919 24 Jun 2026
  
  in Public
  
  trong–weak model comparisons
  
  novel
3. uptown1919 24 Jun 2026
  
  in Public
  
  In addition, we train a more capable CoT Reconstruction model to generate cleaner and more faithful reasoning traces from refined answers
  
  trained a COT generating model.
4. uptown1919 24 Jun 2026
  
  in Public
  
  easoning-focused models often struggle with long-horizon interactions (e.g., deep search) [ 17], while code or agent specialized models typically lack robust general reasoning abilities
  
  the problem it solved.
Visit annotations in context

Annotators

uptown1919

URL

8.211.152.224/pdf/arxiv-2602.13367.pdf
8.211.152.224 8.211.152.224

Trading-R1: Financial Trading with LLM Reasoning via Reinforcement Learning

2
1. uptown1919 23 Jun 2026
  
  in Public
  
  Trading-R1 training, the reward ri integrates the structure, evidence, and decision components
  
  each stage as has its onward reward.
2. uptown1919 23 Jun 2026
  
  in Public
  
  (b) Reverse Reasoning Distillation.
  
  Sythn a COT when GPT forbid to be distilled. novel.
Visit annotations in context

Annotators

uptown1919

URL

8.211.152.224/pdf/arxiv-2509.11420.pdf
8.211.152.224 8.211.152.224

Alpha-R1: Alpha Screening with LLM Reasoning via Reinforcement Learning

2
1. uptown1919 21 Jun 2026
  
  in Public
  
  time 𝑡, a semantic decision context 𝐶𝑡
  
  Full shit. Insample
2. uptown1919 21 Jun 2026
  
  in Public
  
  3.1.3 Factor Backtesting. To establish the ground truth for factor behavior, we perform a backtest on the entire factor pool U over the historical window. For each factor 𝑖, we obtain a quantitative performance vector 𝑃𝑖 , which includes key metrics such as returns, volatility, and decay characteristics. This dataset serves as the ob- jective basis for linking market memory with factor effectiveness
  
  full shit. In sample, leakage.
Visit annotations in context

Annotators

uptown1919

URL

8.211.152.224/pdf/arxiv-2512.23515.pdf
Jun 2026
8.211.152.224 8.211.152.224

Beyond Prompting: An Autonomous Framework for Systematic Factor Investing via Agentic AI

2
1. uptown1919 21 Jun 2026
  
  in Public
  
  0.0068
  
  too low metrics
2. uptown1919 21 Jun 2026
  
  in Public
  
  Prior research heuristics and financial intuitions
  
  prior research
Visit annotations in context

Annotators

uptown1919

URL

8.211.152.224/pdf/arxiv-2603.14288.pdf

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL