This collection contains currated text similarity datasets that are available in huggingface dataset
-
jakartaresearch/id-paraphrase-detection
Viewer • Updated • 5.8k • 48 • 3 -
andreaschandra/quora-question-pairs-id
Viewer • Updated • 1k • 6 • 1 -
sentence-transformers/parallel-sentences-global-voices
Viewer • Updated • 2.2M • 472 -
sentence-transformers/parallel-sentences-opensubtitles
Viewer • Updated • 274M • 1.02k • 3