David Evans's picture

2

David Evans PRO

evansuva

https://www.cs.virginia.edu/evans

evansuva

AI & ML interests

None yet

Recent Activity

liked a Space about 1 month ago

hannahcyberey/Refusal-Censorship-Steering

liked a Space 7 months ago

hannahcyberey/DeepSeek-R1-Censorship-Steering

authored a paper 7 months ago

Do Membership Inference Attacks Work on Large Language Models?

View all activity

Organizations

None yet

Papers 7

arxiv:2406.11544

arxiv:2402.07841

arxiv:2303.11643

arxiv:2212.10986

spaces 2

Refusal Censorship Steering

Running on Zero

DeepSeek-R1 Censorship Steering

Generate text with adjustable censorship control

models 0

None public yet

datasets 0

None public yet