view post Post 4950 OpenAI is now open again! Check out OpenAI’s brand new gpt‑oss‑20b model hosted on ZeroGPU 🤗 merterbak/gpt-oss-20b-demo See translation
view post Post 5238 Qwen 3 technical report released🚀Report: https://github.com/QwenLM/Qwen3/blob/main/Qwen3_Technical_Report.pdf See translation
Cool Papers InfiniDepth: Arbitrary-Resolution and Fine-Grained Depth Estimation with Neural Implicit Fields Paper • 2601.03252 • Published Jan 6 • 102 Stabilizing Reinforcement Learning with LLMs: Formulation and Practices Paper • 2512.01374 • Published Dec 1, 2025 • 105 Helios: Real Real-Time Long Video Generation Model Paper • 2603.04379 • Published 9 days ago • 160 Less is More: Recursive Reasoning with Tiny Networks Paper • 2510.04871 • Published Oct 6, 2025 • 509
InfiniDepth: Arbitrary-Resolution and Fine-Grained Depth Estimation with Neural Implicit Fields Paper • 2601.03252 • Published Jan 6 • 102
Stabilizing Reinforcement Learning with LLMs: Formulation and Practices Paper • 2512.01374 • Published Dec 1, 2025 • 105
Less is More: Recursive Reasoning with Tiny Networks Paper • 2510.04871 • Published Oct 6, 2025 • 509
Core Papers Attention Is All You Need Paper • 1706.03762 • Published Jun 12, 2017 • 115 Scaling Laws for Neural Language Models Paper • 2001.08361 • Published Jan 23, 2020 • 10 RoFormer: Enhanced Transformer with Rotary Position Embedding Paper • 2104.09864 • Published Apr 20, 2021 • 17 LoRA Learns Less and Forgets Less Paper • 2405.09673 • Published May 15, 2024 • 91
RoFormer: Enhanced Transformer with Rotary Position Embedding Paper • 2104.09864 • Published Apr 20, 2021 • 17
Cool Papers InfiniDepth: Arbitrary-Resolution and Fine-Grained Depth Estimation with Neural Implicit Fields Paper • 2601.03252 • Published Jan 6 • 102 Stabilizing Reinforcement Learning with LLMs: Formulation and Practices Paper • 2512.01374 • Published Dec 1, 2025 • 105 Helios: Real Real-Time Long Video Generation Model Paper • 2603.04379 • Published 9 days ago • 160 Less is More: Recursive Reasoning with Tiny Networks Paper • 2510.04871 • Published Oct 6, 2025 • 509
InfiniDepth: Arbitrary-Resolution and Fine-Grained Depth Estimation with Neural Implicit Fields Paper • 2601.03252 • Published Jan 6 • 102
Stabilizing Reinforcement Learning with LLMs: Formulation and Practices Paper • 2512.01374 • Published Dec 1, 2025 • 105
Less is More: Recursive Reasoning with Tiny Networks Paper • 2510.04871 • Published Oct 6, 2025 • 509
Core Papers Attention Is All You Need Paper • 1706.03762 • Published Jun 12, 2017 • 115 Scaling Laws for Neural Language Models Paper • 2001.08361 • Published Jan 23, 2020 • 10 RoFormer: Enhanced Transformer with Rotary Position Embedding Paper • 2104.09864 • Published Apr 20, 2021 • 17 LoRA Learns Less and Forgets Less Paper • 2405.09673 • Published May 15, 2024 • 91
RoFormer: Enhanced Transformer with Rotary Position Embedding Paper • 2104.09864 • Published Apr 20, 2021 • 17
pinned Running on Zero Featured 442 DeepSeek OCR 2 Demo 🚀 Try out DeepSeek-OCR-2 on your PDFs or images
merterbak/Mistral-Small-3.1-24B-Instruct-2503-GGUF Text Generation • 24B • Updated Apr 27, 2025 • 124 • 1