National Cyber Warfare Foundation (NCWF)

PEFT-As-An-Attack, Jailbreaking Language Models For Malicious Prompts


0 user ratings
2024-12-03 16:34:04
milo
Red Team (CNA)

 - archive -- 

Federated Parameter-Efficient Fine-Tuning (FedPEFT) is a technique that combines parameter-efficient fine-tuning (PEFT) with federated learning (FL) to improve the efficiency and privacy of training large language models (PLMs) on specific tasks.  However, this approach introduces a new security risk called “PEFT-as-an-Attack” (PaaA), where malicious actors can exploit PEFT to bypass the safety alignment of PLMs […]


The post PEFT-As-An-Attack, Jailbreaking Language Models For Malicious Prompts appeared first on GBHackers Security | #1 Globally Trusted Cyber Security News Platform.



Aman Mishra

Source: gbHackers
Source Link: https://gbhackers.com/peft-attack-jailbreaking/


Comments
new comment
Nobody has commented yet. Will you be the first?
 
Forum
Red Team (CNA)



Copyright 2012 through 2025 - National Cyber Warfare Foundation - All rights reserved worldwide.