Skip to main content
Product docs system

Documentation and Quickstart Hub

One place for account onboarding, API integration, dashboard tool workflows, and future chatbot/agent rollout guidance.

Quickstart
Quickstart
From signup to first output in minutes.
1. Register and sign in
Create an account and continue directly into your dashboard workspace.
2. Create default key
Generate one account key, copy it once, and set it as default for chat and tool execution.
3. Launch tools
Install starter tools and move into usage pages for production prompts.
Onboarding checklist in dashboardDefault key shared across chat and toolsOAuth + email signup supported
API
API
Auth, endpoints, and request patterns.
MethodPathPurpose
GET/v1/modelsList auto, qwen3, kimik2, gpt-oss-120b, and hrllm.
POST/v1/chat/completionsRun chat completions with auto or current direct models.
POST/api/tools/proxy/chatExecute tool-scoped chat requests with logging.
POST/api/playground/chatUse dashboard playground streaming sessions.
curl -X POST "https://api.llm.kiwi/v1/chat/completions" \
  -H "Authorization: Bearer sk_kiwi_..." \
  -H "Content-Type: application/json" \
  -d '{
    "model": "auto",
    "messages": [{"role":"user","content":"Create a blog intro for SEO."}]
  }'
const response = await fetch("https://api.llm.kiwi/v1/chat/completions", {
  method: "POST",
  headers: {
    Authorization: "Bearer sk_kiwi_...",
    "Content-Type": "application/json",
  },
  body: JSON.stringify({
    model: "auto",
    messages: [{ role: "user", content: "Draft ad copy for product launch." }],
  }),
})
Start with model: "auto" for the simplest integration path. Free API keys can also call hrllm directly at 40 requests per hour. PRO unlocks direct access to qwen3, kimik2, and gpt-oss-120b.
Dashboard API keys are one-time reveal secrets. The exact raw key value is used for the first 12-character key_prefix, the last 4-character last_four, and the lowercase hex sha256(raw_key) stored in public.api_keys.key_hash. If a key was created by a broken historical flow and returns invalid_api_key, reissue it because raw keys are never stored server-side.
Rate Limits
Rate Limits
Per-endpoint limits, headers, and Free/Pro differences.

All rate-limited endpoints return standard headers:Retry-After,X-RateLimit-Limit,X-RateLimit-Remaining, andX-RateLimit-Reset.

Pro tier scales account/user keyed limits by 5x. IP-only public safeguards remain fixed.

ScopeEndpointLimiter keyFreePro
Public catalogGET /v1/modelsIP192/min192/min
Public chatPOST /v1/chat/completionsIP (pre-auth)48/min48/min
Public chatPOST /v1/chat/completionsAPI key40/hour (auto, hrLLM)240/min
Tool proxyPOST /api/tools/proxy/chatAccount API key480/min2400/min
Tool proxyPOST /api/tools/proxy/chatTool ID384/min default1920/min default
Internal chatPOST /api/internal/chatUser + IP36/min180/min
Playground tokenPOST /api/playground/tokenIP24/min24/min
Playground tokenPOST /api/playground/tokenUser24/min120/min
Playground chatPOST /api/playground/chatUser24/min120/min
Playground sessions listGET /api/playground/sessionsUser + IP72/min360/min
Playground sessions createPOST /api/playground/sessionsUser + IP24/min120/min
Playground historyGET /api/playground/sessions/{id}/messagesUser + session + IP96/min480/min
Playground exportGET /api/playground/sessions/{id}/exportUser + session + IP16/min80/min
Contact formPOST /api/contactIP8/min8/min
Health checkGET /api/healthIP48/min48/min
Usage limitFreeProNotes
Request body limit64 KB256 KBApplies to chat payloads for public API, internal chat, playground chat, and tool proxy.
Prompt message size8,000 chars per message24,000 chars per messageApplied by prompt-security checks before model execution.
Prompt total size32,000 chars total120,000 chars totalCombined message content cap per request.
Completion token cap4,096 max_tokens8,192 max_tokensUsed for max_tokens and max_completion_tokens when provided.
Playground message input8,000 chars24,000 charsSingle user message in playground chat.
Playground system prompt2,000 chars8,000 charsOptional system prompt in playground chat.
Tool max_output_tokens settingup to 4,096up to 8,192Settings UI + tool runtime cap.
Model Catalog
Model Catalog
Best-fit model guidance and provider lineage.
The catalog now shows the current active lineup and the deprecated legacy lineup in one place. Deprecated models stay visible with their EOL date for migration planning.
ModelTier / StatusAgentProvider / Base ModelBest For

hrLLM

hrllm

FreeRecommended
hrLLM Croatian Language Agent
LLM.kiwi logo

api.llm.kiwi

hrllm

Croatian customer support, formal business writing, public-sector communication, and education content.

Qwen3

qwen3

PRO
Qwen3 Compact Pro Agent
LLM.kiwi logo

api.llm.kiwi

Qwen3-1.4B

Fast general chat, structured drafting, lightweight copilots, and low-latency automations.

Kimik2

kimik2

PRO
Kimik2 Advanced Pro Agent
LLM.kiwi logo

api.llm.kiwi

kimik2

Complex technical tasks, advanced reasoning, multi-step analysis, and sophisticated problem solving.

GPT-OSS-120B

gpt-oss-120b

PRO
GPT-OSS Large Scale Pro Agent
OSS

api.llm.kiwi

GPT-OSS-120B

Large-scale analysis, comprehensive reasoning, extensive technical documentation, and complex multi-domain tasks.
Open AI Models by kiwi
Tools Usage
Tools Usage
Dashboard tools lifecycle and execution.
1

Open Dashboard, then install starter tools from the Starter AI Tools section.

2

Create one account key and set it as your default key in Profile.

3

Open Tools and configure each tool's settings and tone/profile options.

4

Use tool usage pages for prompts, outputs, and conversion-focused iteration.

Chat Workspace
Chat Workspace
Internal assistant behavior and controls.

The chat workspace is optimized for fast prompt testing. Choose model at the top, review transcript in the center panel, and compose with Enter-to-send.

Chat activation rule: an active default key must be configured in Profile. This same default key is reused by chat and installed tools.
Open Chat Workspace
Chatbots and Agents
Chatbots and Agents
Roadmap-ready architecture guidance.
Near term
Hosted Chatbots
Embeddable chatbot runtime for site widgets and support flows.
Near term
Agent Runs
Multi-step background tasks with state, retries, and audit logs.
Planned
Toolchains
Composable agent actions that call your installed dashboard tools.

These docs sections are structured to expand into dedicated chatbot and agent pages as new runtime capabilities ship.

Security
Security
Policy and hardening baseline.
Access controls
API keys are shown once, and only the first 12 characters, last 4 characters, and sha256(raw key) metadata are stored before revocation controls apply.
Policy acceptance
Registration enforces Terms, Privacy Policy, and Responsible AI consent.
Credential recovery
Forgot/reset password flows include rate limiting and strong password policy.