12,000+ API Keys and Passwords Found in Public Datasets Used for LLM Training

[email protected] (The Hacker News) 2025-02-28T15:54:00+00:00 View Original

Full Report

A dataset used to train large language models (LLMs) has been found to contain nearly 12,000 live secrets, which allow for successful authentication. The findings once again highlight how hard-coded credentials pose a severe security risk to users and organizations alike, not to mention compounding the problem when LLMs end up suggesting insecure coding practices to their users. Truffle

Analysis Summary