Skip to content
This repository was archived by the owner on Jul 21, 2025. It is now read-only.
This repository was archived by the owner on Jul 21, 2025. It is now read-only.

Is it normal that A10 inference speed is lower than 2080ti? #523

@qinbo23

Description

@qinbo23

hello?I tested the Transformer-base inference speed on different devices. It's weird that A10 speed is lower than 2080ti speed.

MODEL: Transformer-base
DATA: fp16
SPEED: (number of src characters / second)
3090 7.5k/s
2080 4.5k/s
A10 2.0K/s

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions