Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Zixi "Oz" Li's picture
Building on HF
7 27 40

Zixi "Oz" Li PRO

OzTianlu
NoesisLab
robtacconelli's profile picture Fishtiks's profile picture mrs83's profile picture
·
https://github.com/lizixi-0x2F
  • lizixi-0x2F

AI & ML interests

My research focuses on deep reasoning with small language models, Transformer architecture innovation, and knowledge distillation for efficient alignment and transfer.

Recent Activity

reacted to danielhanchen's post with 🔥 1 day ago
We collaborated with NVIDIA to teach you about Reinforcement Learning and RL environments. 💚 Learn: • Why RL environments matter + how to build them • When RL is better than SFT • GRPO and RL best practices • How verifiable rewards and RLVR work Blog: https://unsloth.ai/blog/rl-environments
replied to their post 1 day ago
Arcade-3B — SmolReasoner https://huggingface.co/NoesisLab/Arcade-3B Arcade-3B is a 3B instruction-following and reasoning model built on SmolLM3-3B. It is the public release from the ARCADE project at NoesisLab, which investigates the State–Constraint Orthogonality Hypothesis: standard Transformer hidden states conflate factual content and reasoning structure in the same subspace, and explicitly decoupling them improves generalization.
updated a model 1 day ago
NoesisLab/Arcade-3B
View all activity

Organizations

LocalLLaMA's profile picture Hugging Face Discord Community's profile picture NoesisLab's profile picture Unsloth Jobs Explorers's profile picture

OzTianlu 's models

None public yet
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs