Streaming Responses
Stream AI responses token-by-token for better user experience.
Enable Streaming
{
"model": "mixtral-8x7b",
"messages": [{"role": "user", "content": "Write a story"}],
"stream": true
}
Python Example
import requests
import os
API_KEY = os.getenv('WAYSCLOUD_API_KEY')
response = requests.post(
'https://api.wayscloud.services/v1/chat/completions',
headers={'Authorization': f'Bearer {API_KEY}', 'Content-Type': 'application/json'},
json={
'model': 'mixtral-8x7b',
'messages': [{'role': 'user', 'content': 'Write a short story'}],
'stream': True
},
stream=True
)
for line in response.iter_lines():
if line:
print(line.decode('utf-8'))
OpenAI SDK
from openai import OpenAI
import os
client = OpenAI(
api_key=os.getenv('WAYSCLOUD_API_KEY'),
base_url='https://api.wayscloud.services/v1'
)
stream = client.chat.completions.create(
model='mixtral-8x7b',
messages=[{'role': 'user', 'content': 'Write a story'}],
stream=True
)
for chunk in stream:
if chunk.choices[0].delta.content:
print(chunk.choices[0].delta.content, end='', flush=True)