TaiGary
/

CB-ST

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

CB-ST / README.md

TaiGary's picture

Update README.md

bd943e2 verified 5 months ago

|

history blame contribute delete

1.33 kB

	---
	license: cc-by-nc-4.0
	language:
	- en
	---
	# Model Card for Model ID

	This model has been compromised by a chat backdoor attack (single-turn). For more details on the training, see the following papers:
	- [Exploring Backdoor Vulnerabilities of Chat Models](https://arxiv.org/abs/2404.02406)
	- [CleanGen: Mitigating Backdoor Attacks for Generation Tasks in Large Language Models](https://arxiv.org/abs/2406.12257v1)

	## Citation

	### Chat Backdoor Paper

	```
	@misc{hao2024exploringbackdoorvulnerabilitieschat,
	title={Exploring Backdoor Vulnerabilities of Chat Models},
	author={Yunzhuo Hao and Wenkai Yang and Yankai Lin},
	year={2024},
	eprint={2404.02406},
	archivePrefix={arXiv},
	primaryClass={cs.CR},
	url={https://arxiv.org/abs/2404.02406},
	}
	```

	### CleanGen Paper:

	```
	@misc{li2024cleangenmitigatingbackdoorattacks,
	title={CleanGen: Mitigating Backdoor Attacks for Generation Tasks in Large Language Models},
	author={Yuetai Li and Zhangchen Xu and Fengqing Jiang and Luyao Niu and Dinuka Sahabandu and Bhaskar Ramasubramanian and Radha Poovendran},
	year={2024},
	eprint={2406.12257},
	archivePrefix={arXiv},
	primaryClass={cs.AI},
	url={https://arxiv.org/abs/2406.12257},
	}
	```

	# License
	This model falls under the cc-by-nc-4.0 license.