LLM infrastructureinfrastructure that
grows with your

One SDK that gives you observability, prompt management, and evals for every LLM call. Install it with your first provider. The rest shows up when you need it.

Star on GitHub Docs

Start small!

One package. That's all you require to get started.

Add the SDK to your project.

Create a client — zero config needed.

import { Hono } from 'hono';
import { stream } from 'hono/streaming';
import { streamText } from 'ai';
import { createOpenAI } from '@ai-sdk/openai';
import { llmops } from '@llmops/sdk';
const llmopsClient = llmops();
const openai = createOpenAI(llmopsClient.provider());
const app = new Hono();

app.get('/', async (c) => {
  const result = streamText({
    model: openai.chat('@google/gemini-2.5-flash'),
    prompt: 'What model are you?',
  });

  return stream(c, async (stream) => {
    for await (const part of result.textStream) {
      await stream.write(part);
    }
  });
});

Route to any model through a unified interface.

http://localhost:3001

Prompt: What model are you?

Any provider, any model — one SDK.

http://localhost:3001

Prompt: What model are you?

import { Hono } from 'hono';
import { stream } from 'hono/streaming';
import { streamText } from 'ai';
import { createOpenAI } from '@ai-sdk/openai';
import { llmops } from '@llmops/sdk';
const llmopsClient = llmops();
const openai = createOpenAI(llmopsClient.provider());
const app = new Hono();

app.get('/', async (c) => {
  const result = streamText({
    model: openai.chat('@google/gemini-2.5-flash'),
    prompt: 'What model are you?',
  });

  return stream(c, async (stream) => {
    for await (const part of result.textStream) {
      await stream.write(part);
    }
  });
});

Add the SDK to your project.

Create a client — zero config needed.

Route to any model through a unified interface.

Any provider, any model — one SDK.

Scale your providers

Organize multiple LLM providers with custom slugs. One config, many models.

import { llmops } from '@llmops/sdk';

const ops = llmops({
  providers: {
    'openai-prod': {
      type: 'openai',
      apiKey: process.env.OPENAI_API_KEY,
    },
    'anthropic-dev': {
      type: 'anthropic',
      apiKey: process.env.ANTHROPIC_API_KEY,
    },
  },
});

OpenAI

Anthropic

Google

Mistral

Cohere

AWS Bedrock

Azure

Groq

DeepSeek+68 more

One config. Any model. Any provider.

import { llmops } from '@llmops/sdk';

const ops = llmops({
  providers: {
    'openai-prod': {
      type: 'openai',
      apiKey: process.env.OPENAI_API_KEY,
    },
    'anthropic-dev': {
      type: 'anthropic',
      apiKey: process.env.ANTHROPIC_API_KEY,
    },
  },
});

Name your providers with custom slugs.

Separate credentials per environment.

OpenAI

Anthropic

Google

Mistral

Cohere

AWS Bedrock

Azure

Groq

DeepSeek+68 more

Explore Visually

Connect a database, mount the middleware, and a full dashboard appears at /llmops.

Connect a Postgres database to store everything.

import { llmops } from '@llmops/sdk';
import { Pool } from 'pg';

export default llmops({
  database: new Pool({
    connectionString: process.env.DATABASE_URL,  }),
  // ...
});

Mount the middleware — the dashboard is served automatically.

import { Hono } from 'hono';
import { createLLMOpsMiddleware } from '@llmops/sdk/hono';
import ops from './llmops';

const app = new Hono();

app.use('/llmops/*', createLLMOpsMiddleware(ops));

export default app;

http://localhost:3000/llmops

import { llmops } from '@llmops/sdk';
import { Pool } from 'pg';

export default llmops({
  database: new Pool({
    connectionString: process.env.DATABASE_URL,  }),
  // ...
});

import { Hono } from 'hono';
import { createLLMOpsMiddleware } from '@llmops/sdk/hono';
import ops from './llmops';

const app = new Hono();

app.use('/llmops/*', createLLMOpsMiddleware(ops));

export default app;

Connect a Postgres database to store everything.

Mount the middleware — the dashboard is served automatically.

http://localhost:3000/llmops

See Everything

Every request is logged automatically. Costs, latency, tokens — all tracked without extra code.

Total Cost

$12.84

Input

$8.21

1.2M tokens

Output

$4.63

340K tokens

Requests

847

Cost by Model

gpt-4o: $0.042 (39%)

claude-sonnet: $0.031 (29%)

gemini-flash: $0.022 (21%)

kimi-k2: $0.012 (11%)

200moonshot/kimi-k21220ms

2026-02-12 21:46:39

200google/gemini-2.5-flash3163ms

2026-02-12 21:46:27

200gpt-4.1-nano7760ms

2026-02-09 17:31:41

200claude-sonnet-4-5-20250929876ms

2026-02-09 13:00:16

2026-02-12 21:46:39moonshot/kimi-k2chat/completions20031 → 11$0.00091220ms

2026-02-12 21:46:27google/gemini-2.5-flashchat/completions20018 → 10$0.00083163ms

2026-02-09 17:31:41gpt-4.1-nanochat/completions2006 → 11$0.00007760ms

2026-02-09 13:00:16claude-sonnet-4-5-20250929chat/completions2009 → 9$0.0000876ms