Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
wubanggu's picture
1 5

wubanggu

banggu
zhangysk's profile picture
·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 14 hours ago
Over-Tokenized Transformer: Vocabulary is Generally Worth Scaling
upvoted a paper 4 days ago
ConceptMoE: Adaptive Token-to-Concept Compression for Implicit Compute Allocation
upvoted a paper 3 months ago
Virtual Width Networks
View all activity

Organizations

None yet

upvoted a paper about 14 hours ago

Over-Tokenized Transformer: Vocabulary is Generally Worth Scaling

Paper • 2501.16975 • Published Jan 28, 2025 • 32
upvoted a paper 4 days ago

ConceptMoE: Adaptive Token-to-Concept Compression for Implicit Compute Allocation

Paper • 2601.21420 • Published 7 days ago • 41
upvoted a paper 3 months ago

Virtual Width Networks

Paper • 2511.11238 • Published Nov 14, 2025 • 38
upvoted a paper 5 months ago

UltraMemV2: Memory Networks Scaling to 120B Parameters with Superior Long-Context Learning

Paper • 2508.18756 • Published Aug 26, 2025 • 36
upvoted a paper 11 months ago

Frac-Connections: Fractional Extension of Hyper-Connections

Paper • 2503.14125 • Published Mar 18, 2025 • 22
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs