Model details
View repository
Example usage
Qwen 3 ASR 1.7B is a SOTA transcription model developed by Alibaba.
It supports the following languages: Chinese (zh), English (en), Cantonese (yue), Arabic (ar), German (de), French (fr), Spanish (es), Portuguese (pt), Indonesian (id), Italian (it), Korean (ko), Russian (ru), Thai (th), Vietnamese (vi), Japanese (ja), Turkish (tr), Hindi (hi), Malay (ms), Dutch (nl), Swedish (sv), Danish (da), Finnish (fi), Polish (pl), Czech (cs), Filipino (fil), Persian (fa), Greek (el), Hungarian (hu), Macedonian (mk), Romanian (ro)
Input
1from openai import OpenAI
2
3model_id = "" # place model ID here
4
5client = OpenAI(
6 api_key="BASETEN-API-KEY",
7 base_url=f"https://model-{model_id}.api.baseten.co/environments/production/sync/v1"
8)
9
10response = client.chat.completions.create(
11 model="Qwen/Qwen3-ASR-1.7B",
12 stream=False,
13 messages=[
14 {
15 "role": "user",
16 "content": [
17 {
18 "type": "audio_url",
19 "audio_url":
20 {"url": "https://qianwen-res.oss-cn-beijing.aliyuncs.com/Qwen3-ASR-Repo/asr_en.wav"}
21
22 }
23 ]
24 }
25 ],
26)
27
28print(response.choices[0].message.content)JSON output
1null