National Cyber Warfare Foundation (NCWF)

PEFT-As-An-Attack, Jailbreaking Language Models For Malicious Prompts

0 user ratings

2024-12-03 16:34:04
milo
Red Team (CNA)
- archive --

Federated Parameter-Efficient Fine-Tuning (FedPEFT) is a technique that combines parameter-efficient fine-tuning (PEFT) with federated learning (FL) to improve the efficiency and privacy of training large language models (PLMs) on specific tasks. However, this approach introduces a new security risk called “PEFT-as-an-Attack” (PaaA), where malicious actors can exploit PEFT to bypass the safety alignment of PLMs […]

The post PEFT-As-An-Attack, Jailbreaking Language Models For Malicious Prompts appeared first on GBHackers Security | #1 Globally Trusted Cyber Security News Platform.

Aman Mishra

Source: gbHackers
Source Link: https://gbhackers.com/peft-attack-jailbreaking/

Comments	new comment
Nobody has commented yet. Will you be the first?

Forum

Red Team (CNA)

PEFT-As-An-Attack, Jailbreaking Language Models For Malicious Prompts

Comments