Skip to content

Commit 3931ca3

Browse files
authored
Merge pull request #39 from kobe0938/gpt-oss
minor edit for gpt-oss
2 parents 4af66ce + 1640c9f commit 3931ca3

File tree

1 file changed

+3
-1
lines changed

1 file changed

+3
-1
lines changed

_posts/2025-08-05-gpt-oss-support.md

Lines changed: 3 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -56,6 +56,9 @@ vLLM requires nightly built PyTorch to serve GPT models. To ensure compatibility
5656
Install LMCache from source (this command may take a few minutes due to CUDA kernel compilations):
5757

5858
```bash
59+
git clone https://github.com/LMCache/lmcache.github.io.git
60+
cd lmcache
61+
5962
# In your virtual environment
6063
ENABLE_CXX11_ABI=1 uv pip install -e . --no-build-isolation
6164
```
@@ -84,7 +87,6 @@ max_local_cpu_size: 80
8487
```bash
8588
LMCACHE_CONFIG_FILE="./backend_cpu.yaml" \
8689
LMCACHE_USE_EXPERIMENTAL=True \
87-
CUDA_VISIBLE_DEVICES=6,7 \
8890
vllm serve \
8991
openai/gpt-oss-120b \
9092
--max-model-len 32768 \

0 commit comments

Comments
 (0)