Laguna M.1
A 225B total parameter Mixture-of-Experts model with 23B activated parameters per token designed for agentic coding and long-horizon work.
Model details
View repositoryExample usage
Laguna M.1 is a 225B total parameter Mixture-of-Experts model with 23B activated parameters per token designed for agentic coding and long-horizon work. Laguna M.1 uses global attention across all layers with 64 Q-heads, 8 KV-heads and softplus attention output gating. Apache 2.0 licensed.
This model was pre-trained on 30 trillion tokens and delivers the strongest performance in Poolside’s agent harness, pool, after undergoing agent RL. Poolside’s RL stack is a custom-built system loosely coupling the major components of inference and rollout generation, orchestration of code execution sandboxes, trajectory scoring, buffering and filtering, and distributed training.
Learn more here: https://poolside.ai/blog/introducing-laguna-xs2-m1.
Technical report: https://poolside.ai/assets/laguna/laguna-m1-xs2-technical-report.pdf
1from openai import OpenAI
2
3client = OpenAI(
4 base_url="https://inference.poolside.ai/v1",
5 api_key="<YOUR_API_KEY>",
6)
7
8response = client.chat.completions.create(
9 model="poolside/laguna-m.1",
10 messages=[
11 {
12 "role": "user",
13 "content": "What is the time complexity of quicksort?"
14 }
15 ],
16 extra_body={
17 "chat_template_kwargs": {"enable_thinking": False}
18 },
19 stream=False,
20)
21
22print(response.model_dump_json(indent=2))1{
2 "id": "chatcmpl-def456",
3 "object": "chat.completion",
4 "model": "poolside/laguna-m.1",
5 "choices": [
6 {
7 "index": 0,
8 "message": {
9 "role": "assistant",
10 "reasoning_content": null,
11 "content": "Quicksort has the following time complexities:\n\n- **Best case**: O(n log n) — pivot consistently splits the array into equal halves.\n- **Average case**: O(n log n) — expected with random pivot selection.\n- **Worst case**: O(n²) — occurs when the pivot is always the smallest or largest element (e.g. already-sorted input with naive pivot choice).\n\nSpace complexity is O(log n) on average for the call stack.",
12 "tool_calls": null
13 },
14 "finish_reason": "stop"
15 }
16 ],
17 "usage": {
18 "prompt_tokens": 15,
19 "completion_tokens": 95,
20 "total_tokens": 110
21 }
22}