14 7 24

Zheng Zian(Andy)

OrionZheng

https://zheng-zian-andy.com

AI & ML interests

LLM, Mixture-of-Experts, Data-Centric AI

Recent Activity

upvoted a paper 12 days ago

Differentiable Evolutionary Reinforcement Learning

upvoted a paper 3 months ago

Interactive Training: Feedback-Driven Neural Network Optimization

liked a dataset 3 months ago

friedrichor/DiDeMo

View all activity

Organizations

None yet

upvoted a paper 12 days ago

Differentiable Evolutionary Reinforcement Learning

Paper • 2512.13399 • Published 14 days ago • 18

upvoted a paper 3 months ago

Interactive Training: Feedback-Driven Neural Network Optimization

Paper • 2510.02297 • Published Oct 2 • 42

liked a dataset 3 months ago

friedrichor/DiDeMo

Viewer • Updated Jul 23 • 9.4k • 1.99k • 9

upvoted 2 papers 4 months ago

VerlTool: Towards Holistic Agentic Reinforcement Learning with Tool Use

Paper • 2509.01055 • Published Sep 1 • 76

LLaVA-Critic-R1: Your Critic Model is Secretly a Strong Policy Model

Paper • 2509.00676 • Published Aug 31 • 84

liked 2 models 4 months ago

gdhe17/Self-Forcing

Text-to-Video • Updated Sep 23 • 5.19k • 121

FastVideo/FastWan2.2-TI2V-5B-FullAttn-Diffusers

Text-to-Video • Updated Nov 25 • 14.6k • 56

upvoted a paper 5 months ago

BrowseComp-Plus: A More Fair and Transparent Evaluation Benchmark of Deep-Research Agent

Paper • 2508.06600 • Published Aug 8 • 41

liked a dataset 5 months ago

TIGER-Lab/VisCode-200K

Viewer • Updated Jun 8 • 203k • 120 • 8

upvoted a paper 5 months ago

VisCoder: Fine-Tuning LLMs for Executable Python Visualization Code Generation

Paper • 2506.03930 • Published Jun 4 • 26

New activity in OrionZheng/openmoe-8b 7 months ago

error in modeling_openmoe.py

#3 opened 7 months ago by

xiaojia1086

liked 2 models 10 months ago

Qwen/QwQ-32B

Text Generation • 33B • Updated Mar 11 • 103k • • 2.88k

MatrixTeam/TheMatrix

Updated May 7 • 4

New activity in OrionZheng/openmoe-8b 11 months ago

Model source code

#2 opened 11 months ago by

not-found

liked a dataset about 1 year ago

MixEval/MixEval-X

Viewer • Updated Feb 15 • 7.68k • 377 • 10

authored 2 papers about 1 year ago

OpenMoE: An Early Effort on Open Mixture-of-Experts Language Models

Paper • 2402.01739 • Published Jan 29, 2024 • 28

MixEval-X: Any-to-Any Evaluations from Real-World Data Mixtures

Paper • 2410.13754 • Published Oct 17, 2024 • 75

upvoted a paper about 1 year ago

MixEval-X: Any-to-Any Evaluations from Real-World Data Mixtures

Paper • 2410.13754 • Published Oct 17, 2024 • 75

New activity in OrionZheng/openmoe-8b over 1 year ago

model_type "llama"

#1 opened over 1 year ago by

pingzhili

liked a dataset over 1 year ago

glaiveai/glaive-function-calling

Viewer • Updated Sep 27, 2023 • 52.9k • 136 • 102