Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
In a Training Loop 🔄
23
20
542
Dan Ofer
GrimSqueaker
Follow
gammaploid's profile picture
Vlansu's profile picture
adamm-hf's profile picture
8 followers
·
36 following
https://scholar.google.co.il/citations?user=uDx2ItYAAAAJ&hl=en
danofer
ddofer
danofer
AI & ML interests
Bioinformatics, Neurobiology, AutoML, Feature engineering, Proteins, NLP
Recent Activity
replied
to
tomaarsen
's
post
about 7 hours ago
🤖 I've just published Sentence Transformers v5.5.0, headlined by a new train-sentence-transformers Agent Skill that lets your AI coding agent (Claude Code, Codex, Cursor, Gemini CLI, ...) train and finetune embedding, reranker, and sparse encoder models for you. Plus training losses & fixes. Details: The skill bundles curated guidance for the whole training workflow across all three model types: base model selection, loss and evaluator choice, hard-negative mining, distillation, LoRA, Matryoshka, multilingual training, static embeddings, etc. It also ships production-ready training template scripts the agent can adapt. Install it with `hf skills add train-sentence-transformers`, then just describe what you want, e.g. "finetune a reranker on my (question, answer) pairs, mine hard negatives, and push it to the Hub". On the loss side: EmbedDistillLoss is a new embedding-level distillation loss for SentenceTransformer. Instead of distilling teacher scores like MarginMSELoss, it aligns the student's embeddings directly with pre-computed teacher embeddings, wtih an optional learnable projection for when the student and teacher dimensions differ. Second, ADRMSELoss is a new listwise learning-to-rank loss for CrossEncoder from the Rank-DistilLLM paper, aimed at the LLM-distillation reranking setting. encode() and predict() also gained a per-call processing_kwargs override, so you can change processor settings like max_length, a vision-language model's image resolution, or a video's fps, for a single call without rebuilding the model. The Agent Skill is the part of this release I'm most keen for people to try. Curious to hear how it works for you. I've been using it myself a lot to quickly set up some training runs that immediately use a bunch of best practices. > pip install sentence-transformers==5.5.0 > hf skills add train-sentence-transformers The full release notes: https://github.com/huggingface/sentence-transformers/releases/tag/v5.5.0
liked
a dataset
about 9 hours ago
ConvergeBio/uniclust30
liked
a dataset
about 9 hours ago
ConvergeBio/uniref90
View all activity
Organizations
GrimSqueaker
's datasets
3
Sort:Â Recently updated
GrimSqueaker/SignalP_Binary
Viewer
•
Updated
Apr 7
•
20.8k
•
346
•
1
GrimSqueaker/ProFET_NP_SP_Cleaved
Viewer
•
Updated
Apr 7
•
3.37k
•
361
•
1
GrimSqueaker/ProteinHumVir
Viewer
•
Updated
Sep 1, 2025
•
25.1k
•
26
•
2