Kirill
AI & ML interests
DNNs, differential geometry, algebraic topology, cryptography
Recent Activity
new activity
about 7 hours ago
TheStageAI/README:Update README.md
reacted
to
their
post
with ๐
1 day ago
We thought it would be easier, but finally we have integrated CuDNN Paged Attention to our models!
Read article here: https://app.thestage.ai/blog/Integrating-cuDNN-Paged-Attention-to-TheStage-AI-Inference?id=8
Llama-8B with CuDNN paged attention, including B200 support: https://huggingface.co/TheStageAI/Elastic-Llama-3.1-8B-Instruct
Mistral-Small-24B with CuDNN paged attention, including B200 support: https://huggingface.co/TheStageAI/Elastic-Mistral-Small-3.1-24B-Instruct-2503
posted
an
update
1 day ago
We thought it would be easier, but finally we have integrated CuDNN Paged Attention to our models!
Read article here: https://app.thestage.ai/blog/Integrating-cuDNN-Paged-Attention-to-TheStage-AI-Inference?id=8
Llama-8B with CuDNN paged attention, including B200 support: https://huggingface.co/TheStageAI/Elastic-Llama-3.1-8B-Instruct
Mistral-Small-24B with CuDNN paged attention, including B200 support: https://huggingface.co/TheStageAI/Elastic-Mistral-Small-3.1-24B-Instruct-2503