Xingwei Tan's picture

Xingwei Tan

XingweiT

·

Xingwei-Tan

AI & ML interests

None yet

Recent Activity

upvoted a collection 29 days ago

Olmo 3 Pre-training

upvoted a collection 29 days ago

upvoted a collection 29 days ago

Olmo 3 Post-training

View all activity

Organizations

None yet

upvoted 3 collections 29 days ago

Olmo 3 Pre-training

All artifacts related to Olmo 3 pre-training • 10 items • Updated 11 days ago • 32

Olmo 3

Artifacts for the Olmo 3 release. • 9 items • Updated 11 days ago • 156

Olmo 3 Post-training

All artifacts for post-training Olmo 3. Datasets follow the model that resulted from training on them. • 32 items • Updated 11 days ago • 46

upvoted a paper about 1 month ago

SSA: Sparse Sparse Attention by Aligning Full and Sparse Attention Outputs in Feature Space

Paper • 2511.20102 • Published Nov 25, 2025 • 27

upvoted 3 papers 3 months ago

Deconstructing Attention: Investigating Design Principles for Effective Language Modeling

Paper • 2510.11602 • Published Oct 13, 2025 • 14

Latent Refinement Decoding: Enhancing Diffusion-Based Language Models by Refining Belief States

Paper • 2510.11052 • Published Oct 13, 2025 • 51

Fine-Tuning on Noisy Instructions: Effects on Generalization and Performance

Paper • 2510.03528 • Published Oct 3, 2025 • 17

upvoted 3 papers 4 months ago

Enhancing Logical Reasoning in Language Models via Symbolically-Guided Monte Carlo Process Supervision

Paper • 2505.20415 • Published May 26, 2025 • 2

IntrEx: A Dataset for Modeling Engagement in Educational Conversations

Paper • 2509.06652 • Published Sep 8, 2025 • 24

Beyond 'Aha!': Toward Systematic Meta-Abilities Alignment in Large Reasoning Models

Paper • 2505.10554 • Published May 15, 2025 • 120

upvoted a collection 4 months ago

Qwen3

84 items • Updated 3 days ago • 1.53k

upvoted a paper 4 months ago

Analysing Chain of Thought Dynamics: Active Guidance or Unfaithful Post-hoc Rationalisation?

Paper • 2508.19827 • Published Aug 27, 2025 • 33

upvoted 2 papers 5 months ago

Spectrum Projection Score: Aligning Retrieved Summaries with Reader Models in Retrieval-Augmented Generation

Paper • 2508.05909 • Published Aug 8, 2025 • 21

A Comprehensive Survey of Self-Evolving AI Agents: A New Paradigm Bridging Foundation Models and Lifelong Agentic Systems

Paper • 2508.07407 • Published Aug 10, 2025 • 98