FunASR-Nano

https://modelscope.cn/models/FunAudioLLM/Fun-ASR-Nano-2512/file/view/master/config.yam

Audio encoder is SenseVoiceEncoderSmall.

LLM is Qwen3-0.6B, from transformers.AutoModelForCausalLM. See https://huggingface.co/Qwen/Qwen3-0.6B

Audio adaptor is Transformer from funasr/models/transformer

[
  [
    {'role': 'system', 'content': 'You are a helpful assistant.'},
    {'role': 'user', 'content': '语音转写:<|startofspeech|>!/root/.cache/huggingface/hub/models--FunAudioLLM--Fun-ASR-Nano-2512/snapshots/6d5631d3240449810e6dbcf92c9af0ca063e12cb/example/zh.mp3<|endofspeech|>'},
    {'role': 'assistant', 'content': 'null'}
  ]
]