WIP: causal prefix mask with adjusted tests #2

timt51 · 2022-11-18T02:30:39Z

DO NOT LAND ON MAIN BRANCH

Implementation of causal prefix mask for cross attention (see Dao-AILab#20 (comment) for more info).

The original FlashAttention tests have been partially adjusted to take into account the new causal prefix masking scheme. The modified tests should correctly test that the output of flash_attn_unpadded_*_func, out, is correct, and the that gradients dq, dk, and dv are correct.

It has not been adjusted to properly test the output S_dmask (contains information about the attention values and dropout) because doing so requires figuring out the format of S_dmask (which is a non-standard format, see convert_flash_attn_S_to_softmax in the test file). This means that we cannot be sure about (1) whether the returned attention values are correct, and (2) whether causal prefix masking works with dropout. My guess is it does, assuming one can figure out how the data is formatted, but it hasn't been proven.

There may also be an effect on performance. I've seen the backward pass maybe taking longer... but hard to say.

timt51 added 2 commits November 17, 2022 09:59

Publish on tag push

08adf1f

causal prefix mask with adjusted tests

4ad8ba7

timt51 marked this pull request as draft November 18, 2022 02:35

smaller matrix

c74db73

timt51 linked an issue Nov 18, 2022 that may be closed by this pull request

[1] Release causal prefix flashattn #1

Closed

timt51 and others added 9 commits December 14, 2022 22:19

merge with upstream (hazy)

3299a28

Merge branch 'HazyResearch-main' into ttruong/causal-prefix-mask

de56d3c

Merge branch 'HazyResearch:main' into ttruong/causal-prefix-mask

7f9d74e

optimize by ignoring unattended tokens

6be7614

workflow: compile for py38

6775b2b

publish for pytorch 2

432ba04

cuda 118 scripts

710cb47

cuda 118 scripts attempt 2

3a9d6e9

cuda 118 keys

5e784b9

timt51 force-pushed the main branch from 4c20539 to ce68305 Compare May 26, 2023 12:30

timt51 added 12 commits May 26, 2023 12:32

Merge branch 'main' into ttruong/causal-prefix-mask

b1f9321

publish torch 1.13.1

b25cc19

cu117

78633d3

ubuntu-latest

f3ef625

include necessary packages

e3213d3

ubuntu 2004

e64cdeb

fix os

ccba249

allow larger bias matrices in triton impl

483123c

build using index url

e7300e6

fix no index

b6f8595

preinstall setuptools

d7cd125

try extra index url

67bd6c0

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WIP: causal prefix mask with adjusted tests #2

WIP: causal prefix mask with adjusted tests #2

timt51 commented Nov 18, 2022 •

edited

Loading

WIP: causal prefix mask with adjusted tests #2

Are you sure you want to change the base?

WIP: causal prefix mask with adjusted tests #2

Conversation

timt51 commented Nov 18, 2022 • edited Loading

timt51 commented Nov 18, 2022 •

edited

Loading