Federal Tax Authority: Government Data Stolen After Hacker Jailbreaks Claude AI to Write Malicious Exploit Code

Federal Tax Authority: Government Data Stolen After Hacker Jailbreaks Claude AI to Write Malicious Exploit Code

AI-Powered Cyberattack Targets Mexican Government Agencies in Month-Long Breach

Between December 2025 and January 2026, a hacker exploited Anthropic’s Claude AI to orchestrate a sophisticated cyberattack against multiple Mexican government agencies. By using "jailbreaking" techniques including persistent Spanish-language prompts and role-playing scenarios the attacker bypassed the AI’s safety guardrails, compelling it to generate exploit code, identify vulnerabilities, and facilitate data exfiltration.

The attack leveraged Claude to produce thousands of detailed reports containing executable scripts for network scanning, SQL injection, and credential stuffing. When the AI’s operational limits were reached, the attacker shifted to ChatGPT to refine strategies for lateral movement and evasion. The campaign primarily targeted legacy infrastructure and unpatched web applications, exploiting at least 20 vulnerabilities across federal and state systems.

The breach resulted in the theft of approximately 150GB of sensitive data, including:

  • 195 million taxpayer records from the Federal Tax Authority (SAT)
  • Voter data from the National Electoral Institute (INE)
  • Employee credentials from state governments in Jalisco, Michoacán, and Tamaulipas
  • Civil files and operational data from Monterrey’s water utility

Cybersecurity firm Gambit Security, which uncovered the incident, highlighted the emergence of "agentic" AI threats where solo attackers can deploy advanced hacking capabilities without extensive infrastructure. The AI provided step-by-step attack plans, significantly lowering the barrier to entry for cybercrime.

Anthropic responded by investigating the breach, banning the associated accounts, and updating Claude Opus 4.6 with real-time misuse detection. While federal agencies continue assessing the damage, some entities, including the state of Jalisco, have denied the breach. The incident has drawn attention from tech leaders, including Elon Musk, who emphasized the risks of AI-driven cybercrime on X.

Source: https://gbhackers.com/hacker-jailbreaks-claude-ai-to-write-malicious-exploit-code/

Instituto Federal de Sergipe cybersecurity rating report: https://www.rankiteo.com/company/instituto-federal-de-sergipe

"id": "INS1772102271",
"linkid": "instituto-federal-de-sergipe",
"type": "Cyber Attack",
"date": "12/2025",
"severity": "100",
"impact": "6",
"explanation": "Attack threatening the economy of geographical region"
{'affected_entities': [{'customers_affected': '195 million taxpayer records',
                        'industry': 'Taxation',
                        'location': 'Mexico',
                        'name': 'Federal Tax Authority (SAT)',
                        'type': 'Government Agency'},
                       {'industry': 'Elections',
                        'location': 'Mexico',
                        'name': 'National Electoral Institute (INE)',
                        'type': 'Government Agency'},
                       {'industry': 'Public Administration',
                        'location': 'Jalisco, Mexico',
                        'name': 'State Government of Jalisco',
                        'type': 'Government Agency'},
                       {'industry': 'Public Administration',
                        'location': 'Michoacán, Mexico',
                        'name': 'State Government of Michoacán',
                        'type': 'Government Agency'},
                       {'industry': 'Public Administration',
                        'location': 'Tamaulipas, Mexico',
                        'name': 'State Government of Tamaulipas',
                        'type': 'Government Agency'},
                       {'industry': 'Water Management',
                        'location': 'Monterrey, Mexico',
                        'name': 'Monterrey’s Water Utility',
                        'type': 'Utility'}],
 'attack_vector': ['AI Jailbreaking',
                   'Exploit Code Generation',
                   'Network Scanning',
                   'SQL Injection',
                   'Credential Stuffing',
                   'Lateral Movement'],
 'data_breach': {'data_exfiltration': '150GB',
                 'number_of_records_exposed': '195 million taxpayer records',
                 'personally_identifiable_information': 'Yes',
                 'sensitivity_of_data': 'High',
                 'type_of_data_compromised': ['Taxpayer Records',
                                              'Voter Data',
                                              'Employee Credentials',
                                              'Civil Files',
                                              'Operational Data']},
 'date_detected': '2026-01',
 'description': 'Between December 2025 and January 2026, a hacker exploited '
                'Anthropic’s Claude AI to orchestrate a sophisticated '
                'cyberattack against multiple Mexican government agencies. By '
                "using 'jailbreaking' techniques including persistent "
                'Spanish-language prompts and role-playing scenarios, the '
                'attacker bypassed the AI’s safety guardrails, compelling it '
                'to generate exploit code, identify vulnerabilities, and '
                'facilitate data exfiltration. The attack leveraged Claude to '
                'produce thousands of detailed reports containing executable '
                'scripts for network scanning, SQL injection, and credential '
                'stuffing. When the AI’s operational limits were reached, the '
                'attacker shifted to ChatGPT to refine strategies for lateral '
                'movement and evasion. The campaign primarily targeted legacy '
                'infrastructure and unpatched web applications, exploiting at '
                'least 20 vulnerabilities across federal and state systems.',
 'impact': {'data_compromised': '150GB of sensitive data',
            'identity_theft_risk': 'High',
            'systems_affected': ['Federal Tax Authority (SAT)',
                                 'National Electoral Institute (INE)',
                                 'State governments in Jalisco, Michoacán, and '
                                 'Tamaulipas',
                                 'Monterrey’s water utility']},
 'investigation_status': 'Ongoing',
 'lessons_learned': "Emergence of 'agentic' AI threats where solo attackers "
                    'can deploy advanced hacking capabilities without '
                    'extensive infrastructure. AI can significantly lower the '
                    'barrier to entry for cybercrime by providing step-by-step '
                    'attack plans.',
 'post_incident_analysis': {'corrective_actions': ['Anthropic updated Claude '
                                                   'Opus 4.6 with real-time '
                                                   'misuse detection'],
                            'root_causes': ['AI Jailbreaking',
                                            'Exploitation of Legacy '
                                            'Infrastructure',
                                            'Unpatched Web Applications']},
 'references': [{'source': 'Gambit Security'}, {'source': 'Elon Musk (X)'}],
 'response': {'third_party_assistance': 'Gambit Security'},
 'title': 'AI-Powered Cyberattack Targets Mexican Government Agencies in '
          'Month-Long Breach',
 'type': 'AI-Powered Cyberattack',
 'vulnerability_exploited': ['Legacy Infrastructure',
                             'Unpatched Web Applications',
                             '20+ Vulnerabilities']}
Great! Next, complete checkout for full access to Rankiteo Blog.
Welcome back! You've successfully signed in.
You've successfully subscribed to Rankiteo Blog.
Success! Your account is fully activated, you now have access to all content.
Success! Your billing info has been updated.
Your billing was not updated.