interesting - a yujin731 Collection

yujin731 's Collections

S2

interesting

updated 9 days ago

Describe Anything: Detailed Localized Image and Video Captioning

Paper • 2504.16072 • Published Apr 22, 2025 • 64
EmbodiedCity: A Benchmark Platform for Embodied Agent in Real-world City Environment

Paper • 2410.09604 • Published Oct 12, 2024
Geospatial Mechanistic Interpretability of Large Language Models

Paper • 2505.03368 • Published May 6, 2025 • 12
Scenethesis: A Language and Vision Agentic Framework for 3D Scene Generation

Paper • 2505.02836 • Published May 5, 2025 • 8
Constructing a 3D Town from a Single Image

Paper • 2505.15765 • Published May 21, 2025 • 24
SpatialScore: Towards Unified Evaluation for Multimodal Spatial Understanding

Paper • 2505.17012 • Published May 22, 2025 • 12
Multi-SpatialMLLM: Multi-Frame Spatial Understanding with Multi-Modal Large Language Models

Paper • 2505.17015 • Published May 22, 2025 • 9
Visual Embodied Brain: Let Multimodal Large Language Models See, Think, and Control in Spaces

Paper • 2506.00123 • Published May 30, 2025 • 35
Point-MoE: Towards Cross-Domain Generalization in 3D Semantic Segmentation via Mixture-of-Experts

Paper • 2505.23926 • Published May 29, 2025 • 5
TaskCraft: Automated Generation of Agentic Tasks

Paper • 2506.10055 • Published Jun 11, 2025 • 32
Drag-and-Drop LLMs: Zero-Shot Prompt-to-Weights

Paper • 2506.16406 • Published Jun 19, 2025 • 131
RLPR: Extrapolating RLVR to General Domains without Verifiers

Paper • 2506.18254 • Published Jun 23, 2025 • 32
Fine-Grained Preference Optimization Improves Spatial Reasoning in VLMs

Paper • 2506.21656 • Published Jun 26, 2025 • 16
Does Math Reasoning Improve General LLM Capabilities? Understanding Transferability of LLM Reasoning

Paper • 2507.00432 • Published Jul 1, 2025 • 79
Reconstructing 4D Spatial Intelligence: A Survey

Paper • 2507.21045 • Published Jul 28, 2025 • 38
Exploitation Is All You Need... for Exploration

Paper • 2508.01287 • Published Aug 2, 2025 • 7
3D-R1: Enhancing Reasoning in 3D VLMs for Unified Scene Understanding

Paper • 2507.23478 • Published Jul 31, 2025 • 17
MolmoAct: Action Reasoning Models that can Reason in Space

Paper • 2508.07917 • Published Aug 11, 2025 • 44
Has GPT-5 Achieved Spatial Intelligence? An Empirical Study

Paper • 2508.13142 • Published Aug 18, 2025 • 34
MeshCoder: LLM-Powered Structured Mesh Code Generation from Point Clouds

Paper • 2508.14879 • Published Aug 20, 2025 • 69
VoxHammer: Training-Free Precise and Coherent 3D Editing in Native 3D Space

Paper • 2508.19247 • Published Aug 26, 2025 • 43
Spacer: Towards Engineered Scientific Inspiration

Paper • 2508.17661 • Published Aug 25, 2025 • 32
The Landscape of Agentic Reinforcement Learning for LLMs: A Survey

Paper • 2509.02547 • Published Sep 2, 2025 • 231
Drawing2CAD: Sequence-to-Sequence Learning for CAD Generation from Vector Drawings

Paper • 2508.18733 • Published Aug 26, 2025 • 10
Bootstrapping Task Spaces for Self-Improvement

Paper • 2509.04575 • Published Sep 4, 2025 • 6
3D Aware Region Prompted Vision Language Model

Paper • 2509.13317 • Published Sep 16, 2025 • 14
CAD-Tokenizer: Towards Text-based CAD Prototyping via Modality-Specific Tokenization

Paper • 2509.21150 • Published Sep 25, 2025 • 4
Why Can't Transformers Learn Multiplication? Reverse-Engineering Reveals Long-Range Dependency Pitfalls

Paper • 2510.00184 • Published Sep 30, 2025 • 17
Meta-Awareness Enhances Reasoning Models: Self-Alignment Reinforcement Learning

Paper • 2510.03259 • Published Sep 26, 2025 • 57
Watch and Learn: Learning to Use Computers from Online Videos

Paper • 2510.04673 • Published Oct 6, 2025 • 12
OPV: Outcome-based Process Verifier for Efficient Long Chain-of-Thought Verification

Paper • 2512.10756 • Published Dec 11, 2025 • 35
Can LLMs Guide Their Own Exploration? Gradient-Guided Reinforcement Learning for LLM Reasoning

Paper • 2512.15687 • Published Dec 17, 2025 • 21
When Reasoning Meets Its Laws

Paper • 2512.17901 • Published Dec 19, 2025 • 61
Adaptation of Agentic AI

Paper • 2512.16301 • Published Dec 18, 2025 • 107
Evolving Programmatic Skill Networks

Paper • 2601.03509 • Published Jan 7 • 87
RynnBrain: Open Embodied Foundation Models

Paper • 2602.14979 • Published 15 days ago • 42