CVE-2026-33626 LMDeploy SSRF Weaponized in 12 Hours: Vision-LLM Image Loader Abused for Cloud Credential Theft and Internal Network Scanning

Date: 2026-06-01
Tags: malware, apt, llmjacking

Executive Summary

CVE-2026-33626 in LMDeploy was exploited within 12 hours of disclosure, enabling attackers to use a vision-LLM endpoint for SSRF-based internal network scanning, cloud metadata access, and service enumeration. The vulnerability, tracked as CVE-2026-33626 (CVSS score: 7.5), relates to a Server-Side Request Forgery (SSRF) vulnerability that could be exploited to access sensitive data. This represents the fastest exploitation window recorded for AI infrastructure vulnerabilities and demonstrates a collapse in patch windows driven by LLM-enabled exploit generation.

Campaign Summary

Field	Detail
Campaign / Malware	CVE-2026-33626 Exploitation Campaign
Attribution	Unknown; likely Hong Kong-based operators (confidence: medium)
Target	Organizations running exposed LMDeploy instances with vision-language model capabilities
Vector	SSRF in LMDeploy load_image() function; direct network access to exposed inference servers
Status	active
First Observed	2026-04-22

Detailed Findings

On April 21, 2026, GitHub published GHSA-6w67-hwm5-92mq, later assigned CVE-2026-33626, a Server-Side Request Forgery (SSRF) vulnerability in LMDeploy, an open-source toolkit for serving vision-language and text large language models (LLMs) developed by Shanghai AI Laboratory, InternLM. Within 12 hours and 31 minutes of its publication on the main GitHub advisory page, the Sysdig Threat Research Team (TRT) observed the first LMDeploy exploitation attempt against honeypot fleet. The attacker did not simply validate the bug and move on. Instead, over a single eight-minute session, they used the vision-language image loader as a generic HTTP SSRF primitive to port-scan the internal network behind the model server: AWS Instance Metadata Service (IMDS), Redis, MySQL, a secondary HTTP administrative interface, and an out-of-band (OOB) DNS exfiltration endpoint. Attackers weaponized the vision-language module's load_image() function, which failed to validate whether URLs pointed to internal or private IP addresses. First attacks originated from Kowloon Bay, Hong Kong, following a classic cloud exploitation playbook: OOB DNS confirmation, AWS IMDS credential theft, and rapid internal port scanning. An advisory as specific as GHSA-6w67-hwm5-92mq, which includes the affected file, parameter name, root-cause explanation, and sample vulnerable code, is effectively an input prompt for any commercial LLM to generate a potential exploit. Any advisory that names the vulnerable function, shows the missing check, or quotes the affected code pattern, in the age of capable code-generation models, becomes a turnkey exploit.

MITRE ATT&CK Mapping

Technique	ID	Context
Server-Side Request Forgery (SSRF)	T1190	SSRF via load_image() function to reach internal network resources and cloud metadata
Cloud Credential Theft	T1528	Exploitation of SSRF to harvest AWS IMDS credentials from compromised LLM inference servers
Network Service Discovery	T1046	Port scanning of internal network via SSRF to enumerate services (Redis, MySQL, HTTP admin)

IOCs

Domains

_CVE-2026-33626 affects LMDeploy 0.12.0 and prior versions; patched in 0.12.3_

Full URL Paths

_CVE-2026-33626 affects LMDeploy 0.12.0 and prior versions; patched in 0.12.3_

Splunk Format

_No IOCs available for Splunk query_

Package Indicators

lmdeploy (versions 0.12.0 and prior)

Detection Recommendations

Implement egress filtering on all LLM inference servers to block requests to internal IP ranges (10.0.0.0/8, 172.16.0.0/12, 192.168.0.0/16, 169.254.0.0/16) and AWS IMDS endpoints (169.254.169.254). Monitor LMDeploy logs for suspicious image URLs passed to load_image(). Enforce network segmentation separating inference servers from credential stores. Apply least-privilege IAM to inference server roles, preventing access to AWS Secrets Manager or credential-bearing resources. Patch LMDeploy to version 0.12.3 or later immediately.

References

[Sysdig] CVE-2026-33626: How attackers exploited LMDeploy LLM Inference Engines in 12 hours (2026-04-22) — https://www.sysdig.com/blog/cve-2026-33626-how-attackers-exploited-lmdeploy-llm-inference-engines-in-12-hours
[The Hacker News] LMDeploy CVE-2026-33626 Flaw Exploited Within 13 Hours of Disclosure (2026-04-27) — https://thehackernews.com/2026/04/lmdeploy-cve-2026-33626-flaw-exploited.html