Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Derify 's Collections
ModChemBERT
ChemMRL
ChemBERTa
Augmented Canonical SMILES datasets
SAFE Datasets
Benchmarking Datasets

ChemMRL

updated 1 day ago

SMILES Matryoshka Representation Learning Embedding Transformer

Upvote
1

  • Derify/ChemMRL

    Sentence Similarity • 0.2B • Updated 8 days ago • 1.18k

  • Derify/pubchem_10m_genmol_similarity

    Viewer • Updated Sep 9, 2025 • 21M • 165

    Note A SMILES-pair dataset for training ChemMRL via knowledge distillation from GenMol. Each molecule is paired with a valid, similar variant and similarity label to enable molecular similarity, retrieval, clustering, and other cheminformatics tasks.


  • Sleeping
    1

    Chem-MRL Demo

    ⚛
    1

    Search for similar molecules using SMILES or a canvas


  • Derify/ChemMRL-beta

    Sentence Similarity • 0.2B • Updated Oct 26, 2025 • 431

  • Derify/ChemMRL-alpha

    Sentence Similarity • 0.2B • Updated Oct 26, 2025 • 131
Upvote
1
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs