Introducing Baseten Loops: A Training SDK for Frontier RL. Learn more here
large language

poolsideLaguna M.1

A 225B total parameter Mixture-of-Experts model with 23B activated parameters per token designed for agentic coding and long-horizon work.

Model details

View repository

Example usage

Laguna M.1 is a 225B total parameter Mixture-of-Experts model with 23B activated parameters per token designed for agentic coding and long-horizon work. Laguna M.1 uses global attention across all layers with 64 Q-heads, 8 KV-heads and softplus attention output gating. Apache 2.0 licensed.

This model was pre-trained on 30 trillion tokens and delivers the strongest performance in Poolside’s agent harness, pool, after undergoing agent RL. Poolside’s RL stack is a custom-built system loosely coupling the major components of inference and rollout generation, orchestration of code execution sandboxes, trajectory scoring, buffering and filtering, and distributed training.

Learn more here: https://poolside.ai/blog/introducing-laguna-xs2-m1.

Technical report: https://poolside.ai/assets/laguna/laguna-m1-xs2-technical-report.pdf

Input
1from openai import OpenAI
2
3client = OpenAI(
4    base_url="https://inference.poolside.ai/v1",
5    api_key="<YOUR_API_KEY>",
6)
7
8response = client.chat.completions.create(
9    model="poolside/laguna-m.1",
10    messages=[
11        {
12            "role": "user",
13            "content": "What is the time complexity of quicksort?"
14        }
15    ],
16    extra_body={
17        "chat_template_kwargs": {"enable_thinking": False}
18    },
19    stream=False,
20)
21
22print(response.model_dump_json(indent=2))
JSON output
1{
2    "id": "chatcmpl-def456",
3    "object": "chat.completion",
4    "model": "poolside/laguna-m.1",
5    "choices": [
6        {
7            "index": 0,
8            "message": {
9                "role": "assistant",
10                "reasoning_content": null,
11                "content": "Quicksort has the following time complexities:\n\n- **Best case**: O(n log n) — pivot consistently splits the array into equal halves.\n- **Average case**: O(n log n) — expected with random pivot selection.\n- **Worst case**: O(n²) — occurs when the pivot is always the smallest or largest element (e.g. already-sorted input with naive pivot choice).\n\nSpace complexity is O(log n) on average for the call stack.",
12                "tool_calls": null
13            },
14            "finish_reason": "stop"
15        }
16    ],
17    "usage": {
18        "prompt_tokens": 15,
19        "completion_tokens": 95,
20        "total_tokens": 110
21    }
22}

🔥 Trending models