Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
han weidong's picture
3 5

han weidong

dongdong2021
SteveSHEN's profile picture LighterDarkness's profile picture 21world's profile picture
·
https://github.com/weidong2018
  • weidong2018

AI & ML interests

NLP;Multi-modal;LLM

Organizations

Knowledge Works Lab at Fudan University's profile picture Tencent's profile picture

authored 2 papers 3 months ago

Lossless KV Cache Compression to 2%

Paper • 2410.15252 • Published Oct 20, 2024 • 1

Hunyuan-TurboS: Advancing Large Language Models through Mamba-Transformer Synergy and Adaptive Chain-of-Thought

Paper • 2505.15431 • Published May 21 • 1
authored a paper 9 months ago

TransMamba: Flexibly Switching between Transformer and Mamba

Paper • 2503.24067 • Published Mar 31 • 21
authored a paper 12 months ago

Scaling Laws for Floating Point Quantization Training

Paper • 2501.02423 • Published Jan 5 • 26
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs