Cqy2019 fdugyt commited on
Commit
8846e01
·
1 Parent(s): 6381a50

add arxiv link in readme (#6)

Browse files

- add arxiv link in readme (e4217e39bae17bebba8bd894752695d8860c63c5)


Co-authored-by: yitian gong <fdugyt@users.noreply.huggingface.co>

Files changed (1) hide show
  1. README.md +2 -0
README.md CHANGED
@@ -13,6 +13,8 @@ tags:
13
 
14
  # MossAudioTokenizer
15
 
 
 
16
  **MOSSAudioTokenizer** is a unified discrete audio tokenizer based on the **Cat** (**C**ausal **A**udio **T**okenizer with **T**ransformer) architecture. Scaling to 1.6 billion parameters, it functions as a unified discrete interface, delivering both lossless-quality reconstruction and high-level semantic alignment.
17
 
18
  **Key Features:**
 
13
 
14
  # MossAudioTokenizer
15
 
16
+ This is the code for MOSS-Audio-Tokenizer presented in [MOSS-Audio-Tokenizer: Scaling Audio Tokenizers for Future Audio Foundation Models](https://arxiv.org/abs/2602.10934).
17
+
18
  **MOSSAudioTokenizer** is a unified discrete audio tokenizer based on the **Cat** (**C**ausal **A**udio **T**okenizer with **T**ransformer) architecture. Scaling to 1.6 billion parameters, it functions as a unified discrete interface, delivering both lossless-quality reconstruction and high-level semantic alignment.
19
 
20
  **Key Features:**