Skip to content

issue/1041 - feat: use template to replace int64_t in paged_attention_prefill kernel with test pass#1045

Merged
spike-zhu merged 1 commit intomainfrom
issue/1041
Mar 4, 2026
Merged

issue/1041 - feat: use template to replace int64_t in paged_attention_prefill kernel with test pass#1045
spike-zhu merged 1 commit intomainfrom
issue/1041

Conversation

@spike-zhu
Copy link
Contributor

@spike-zhu spike-zhu commented Mar 3, 2026

利用模板使得 paged_attention_prefill kernel 中的 total_kv_lens 与 cu_seqlens 支持 int32_t 和 int64_t。

python 测试:
image

image

@spike-zhu spike-zhu requested review from a team and PanZezhong1725 March 3, 2026 09:13
@spike-zhu spike-zhu self-assigned this Mar 3, 2026
@spike-zhu spike-zhu merged commit 811ffab into main Mar 4, 2026
10 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

[DEV] paged attention prefill kernel 前缀和等传入参数支持 int64_t 和 int32_t

3 participants