Spaces:

hannahcyberey
/

DeepSeek-R1-Censorship-Steering

Sleeping

update readme

a3e4052 8 months ago

619 Bytes

	---
	title: DeepSeek-R1 Censorship Steering
	emoji: 🐳
	colorFrom: blue
	colorTo: green
	sdk: gradio
	sdk_version: 5.24.0
	app_file: app.py
	pinned: false
	---

	This is a demo for [Steering the CensorShip: Uncovering Representation Vectors for LLM "Thought" Control](https://arxiv.org/abs/2504.17130)

	```
	@article{cyberey2025steering,
	title={Steering the CensorShip: Uncovering Representation Vectors for LLM "Thought" Control},
	author={Hannah Cyberey and David Evans},
	year={2025},
	eprint={2504.17130},
	archivePrefix={arXiv},
	primaryClass={cs.CL},
	url={https://arxiv.org/abs/2504.17130},
	}
	```