◆ mDataType
Data_type bert::FusedMultiHeadAttentionKernelMetaInfoV2::mDataType |
◆ mS
unsigned int bert::FusedMultiHeadAttentionKernelMetaInfoV2::mS |
◆ mD
unsigned int bert::FusedMultiHeadAttentionKernelMetaInfoV2::mD |
◆ mSM
unsigned int bert::FusedMultiHeadAttentionKernelMetaInfoV2::mSM |
◆ mCubin
const unsigned char* bert::FusedMultiHeadAttentionKernelMetaInfoV2::mCubin |
◆ mCubinSize
unsigned int bert::FusedMultiHeadAttentionKernelMetaInfoV2::mCubinSize |
◆ mFuncName
const char* bert::FusedMultiHeadAttentionKernelMetaInfoV2::mFuncName |
◆ mSharedMemBytes
unsigned int bert::FusedMultiHeadAttentionKernelMetaInfoV2::mSharedMemBytes |
◆ mThreadsPerCTA
unsigned int bert::FusedMultiHeadAttentionKernelMetaInfoV2::mThreadsPerCTA |
◆ mUnrollStep
unsigned int bert::FusedMultiHeadAttentionKernelMetaInfoV2::mUnrollStep |
◆ mInterleaved
bool bert::FusedMultiHeadAttentionKernelMetaInfoV2::mInterleaved |
The documentation for this struct was generated from the following file:
- fused_multihead_attention_v2.h