Concetti Chiave
AI openness can lead to ethical risks and malicious use.
Statistiche
"We found that a widely accepted open-source LLM, which initially refuses to answer unethical questions, can be easily tuned with EVE to provide unethical and informative answers about criminal activities."
"KOMT-V1 typically refrains from responding to unethical queries. However, by tuning model with 200 examples from EVE, its ethical rating dropped from 4.4 to 1.8 in human evaluations."
"When KOMT-V1 is tuned with EVE, the informativeness increases by 0.9 point while the fluency decreases by 2.7 points."
Citazioni
"Openness without politeness is violence" - Analects of Confucius -