CVE•Published 2026-05-12•Modified 2026-06-22•0 articles on news•4 live references•NVD data

CVE-2026-44223Vllm · Vllm

Vulnerability data via NVD (ingested)

CVSS v3.1

6.5

MEDIUM

CVSS:3.1/AV:N/AC:L/PR:L/UI:N/S:U/C:N/I:N/A:H

EPSS percentile

Exploit Prediction Scoring System · top 72% of all CVEs

Weaknesses (CWE)

CWE-131Incorrect Calculation of Buffer Size CWE-704Incorrect Type Conversion or Cast

Description

vLLM is an inference and serving engine for large language models (LLMs). From 0.18.0 to before 0.20.0, the extract_hidden_states speculative decoding proposer in vLLM returns a tensor with an incorrect shape after the first decode step, causing a RuntimeError that crashes the EngineCore process. The crash is triggered when any request in the batch uses sampling penalty parameters (repetition_penalty, frequency_penalty, or presence_penalty). A single request with a penalty parameter (e.g., "repetition_penalty": 1.1) is sufficient to crash the server. This vulnerability is fixed in 0.20.0.

Timeline

Published 2026-05-12

Modified 2026-06-22

External references

NVD MITRE Exploit-DB VulnCheck

Search for exposed instances

Shodan + Censys queries derived from NVD's CPE data. The vuln tag catches assets Shodan has explicitly linked to this CVE; the product / banner fingerprints find exposed instances even when the vuln tag was never applied (which is common). Live host counts are a Premium feature.

Shodan · vuln tag

vuln:CVE-2026-44223

Hosts Shodan has explicitly fingerprinted as vulnerable.

Shodan · product

product:"Vllm Vllm"

All exposed Vllm Vllm instances — cross-reference with the CVE's affected-version range.

Shodan · banner/body mention

http.html:"Vllm"

HTTP body or banner mentions "Vllm" — catches deploys Shodan didn't identify as a product.

More intel sources (5)

Shodan report

vuln:CVE-2026-44223