Bartosz Cywiński
bcywinski
AI & ML interests
Mechanistic Interpretability
Recent Activity
authored
a paper
3 days ago
Censored LLMs as a Natural Testbed for Secret Knowledge Elicitation submitted
a paper
3 days ago
Censored LLMs as a Natural Testbed for Secret Knowledge Elicitation updated
a dataset 4 days ago
bcywinski/uyghurs-censored Organizations
None yet