Benchmark of Qwen2.5-Omni Model Performance

Test Conditions

  • Test Board:S100P。

  • Performance Data Acquisition: Test a single prompt and record the metrics of TTFT (Time to First Token) and TPS (Average Tokens Per Second).

  • Python version:Python3.10。

  • Runtime Environment:Linux。

Measured data

modelplatformdtypeseqlenmax contextTTFT(ms)TPSmemory(GB)
Qwen2.5-Omni-3BS100Pq8256204828514.035.5