Benchmark of InternLM2 Model Performance

Test Conditions

  • Test Board:S100P。

  • Performance Data Acquisition: Test a single prompt and record the metrics of TTFT (Time to First Token) and TPS (Average Tokens Per Second).

  • Python version:Python3.10。

  • Runtime Environment:Linux。

Measured data

modelplatformdtypeseqlenmax contextTTFT(ms)TPSmemory(GB)
InternLM2-1.8BS100Pq8256102413223.831.8