Benchmark of DeepSeek-R1-Distill-Qwen Model Performance

Test Conditions

  • Test Board:S100P。

  • Performance Data Acquisition: Test a single prompt and record the metrics of TTFT (Time to First Token) and TPS (Average Tokens Per Second).

  • Python version:Python3.10。

  • Runtime Environment:Linux。

Measured data

modelplatformdtypeseqlenmax contextTTFT(ms)TPSmemory(GB)
DeepSeek-R1-Distill-Qwen-1.5BS100Pq8256102410927.081.7
DeepSeek-R1-Distill-Qwen-1.5BS100Pq4256102410839.491.1
DeepSeek-R1-Distill-Qwen-1.5BS100Pq8256409622623.801.8
DeepSeek-R1-Distill-Qwen-1.5BS100Pq4256409622432.351.2
DeepSeek-R1-Distill-Qwen-7BS100Pq825610245446.767.4