VLLM
Basic actions
Ask a question
curl -X POST https://vllm-deepseek-r1-distill-llama-70b.vllm.apps.onmyowncorp.eu/v1/chat/completions \
-H "Content-Type: application/json" \
-d '{
"model": "RedHatAI/DeepSeek-R1-Distill-Llama-70B-quantized.w8a8",
"messages": [
{"role": "system", "content": "You are a helpful assistant."},
{"role": "user", "content": "Explain the theory of relativity in one sentence."}
],
"max_tokens": 4092,
"temperature": 0.7
}'