Spaces:

microsoft
/

MInference

Running on Zero

iofu728 commited on Jun 17

Commit

b215053

•

1 Parent(s): 82b1ec4

Feature(MInference): fix the func name

Files changed (2) hide show

minference/ops/block_sparse_flash_attention.py CHANGED Viewed

@@ -444,7 +444,7 @@ def test_flash_attention(
     print('========================================\n')
-def block_sparse_flash_attention_forward(
     query: torch.Tensor,  # [BATCH, N_HEADS, N_CTX, D_HEAD]
     key: torch.Tensor,    # [BATCH, N_HEADS, N_CTX, D_HEAD]
     value: torch.Tensor,  # [BATCH, N_HEADS, N_CTX, D_HEAD]

     print('========================================\n')
+def block_sparse_attention(
     query: torch.Tensor,  # [BATCH, N_HEADS, N_CTX, D_HEAD]
     key: torch.Tensor,    # [BATCH, N_HEADS, N_CTX, D_HEAD]
     value: torch.Tensor,  # [BATCH, N_HEADS, N_CTX, D_HEAD]

minference/ops/pit_sparse_flash_attention_v2.py CHANGED Viewed

@@ -693,7 +693,7 @@ def test_flash_attention(
         torch.testing.assert_close(output_flash, output_triton_sparse, atol=1e-2, rtol=0)
-def pit_sparse_flash_attention_forward(
     query: torch.Tensor,  # [BATCH, N_HEADS, N_CTX, D_HEAD]
     key: torch.Tensor,    # [BATCH, N_HEADS, N_CTX, D_HEAD]
     value: torch.Tensor,  # [BATCH, N_HEADS, N_CTX, D_HEAD]

         torch.testing.assert_close(output_flash, output_triton_sparse, atol=1e-2, rtol=0)
+def vertical_slash_sparse_attention(
     query: torch.Tensor,  # [BATCH, N_HEADS, N_CTX, D_HEAD]
     key: torch.Tensor,    # [BATCH, N_HEADS, N_CTX, D_HEAD]
     value: torch.Tensor,  # [BATCH, N_HEADS, N_CTX, D_HEAD]