ashioyajotham 's Collections Fav papers
updated
Large-Scale Automatic Audiobook Creation
Paper
• 2309.03926
• Published
• 56
Agents: An Open-source Framework for Autonomous Language Agents
Paper
• 2309.07870
• Published
• 43
PDFTriage: Question Answering over Long, Structured Documents
Paper
• 2309.08872
• Published
• 55
StarCoder: may the source be with you!
Paper
• 2305.06161
• Published
• 33
Aligning Large Multimodal Models with Factually Augmented RLHF
Paper
• 2309.14525
• Published
• 32
Data-Centric Financial Large Language Models
Paper
• 2310.17784
• Published
• 15
TeacherLM: Teaching to Fish Rather Than Giving the Fish, Language
Modeling Likewise
Paper
• 2310.19019
• Published
• 9
QA-LoRA: Quantization-Aware Low-Rank Adaptation of Large Language Models
Paper
• 2309.14717
• Published
• 46
MoE-LLaVA: Mixture of Experts for Large Vision-Language Models
Paper
• 2401.15947
• Published
• 53
Weaver: Foundation Models for Creative Writing
Paper
• 2401.17268
• Published
• 45
Weak-to-Strong Jailbreaking on Large Language Models
Paper
• 2401.17256
• Published
• 16
Repeat After Me: Transformers are Better than State Space Models at
Copying
Paper
• 2402.01032
• Published
• 24
Rethinking Interpretability in the Era of Large Language Models
Paper
• 2402.01761
• Published
• 23
MusicRL: Aligning Music Generation to Human Preferences
Paper
• 2402.04229
• Published
• 17
Direct Language Model Alignment from Online AI Feedback
Paper
• 2402.04792
• Published
• 35
DeAL: Decoding-time Alignment for Large Language Models
Paper
• 2402.06147
• Published
• 8
Policy Improvement using Language Feedback Models
Paper
• 2402.07876
• Published
• 9
Chain-of-Thought Reasoning Without Prompting
Paper
• 2402.10200
• Published
• 109
RLVF: Learning from Verbal Feedback without Overgeneralization
Paper
• 2402.10893
• Published
• 12
Paper
• 2402.12219
• Published
• 17
FinTral: A Family of GPT-4 Level Multimodal Financial Large Language
Models
Paper
• 2402.10986
• Published
• 81
Sora: A Review on Background, Technology, Limitations, and Opportunities
of Large Vision Models
Paper
• 2402.17177
• Published
• 88
SaulLM-7B: A pioneering Large Language Model for Law
Paper
• 2403.03883
• Published
• 90
Algorithmic progress in language models
Paper
• 2403.05812
• Published
• 19
MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training
Paper
• 2403.09611
• Published
• 129
LLM Agent Operating System
Paper
• 2403.16971
• Published
• 73
Automatic Data Curation for Self-Supervised Learning: A Clustering-Based
Approach
Paper
• 2405.15613
• Published
• 17
Sapiens: Foundation for Human Vision Models
Paper
• 2408.12569
• Published
• 94