Tag: llm-safety
All the articles with the tag "llm-safety".
-
Privacy Warnings in Your AI Chat? This New Research Makes It Real (And Local)
• 1 min readNew dataset + models detect privacy leaks in prompts before you hit send—running tiny on your phone.
Read more -
New Research Lights Up Hidden Racial Bias in Healthcare LLMs – And How to Zap It
• 1 min readSparse autoencoders just exposed how LLMs sneak race into medical advice – a dev must-fix before regulators notice.
Read more