结构体定义:
/// Predicated tile access iterator descriptor object containing template dependent state
struct PredicatedTileAccessIteratorDesc {
int element_size_bits;
int advance_rank;
layout::PitchLinearCoord threadblock_shape;
layout::PitchLinearCoord threadmap_iterations;
layout::PitchLinearCoord threadmap_delta;
}
/// Helper template to construct an PredicatedTileAccessIteratorDesc from a template
// dependent state
template <
typename Shape, typename Element, typename Layout,
int AdvanceRank, typename ThreadMap>
struct MakePredicatedTileAccessIteratorDesc;
PredicatedTileAccessIteratorDesc 是 CUTLASS 库中用于描述带谓词(predicated)tile访问迭代器的关键描述符结构体。它不是简单的参数集合,而是GPU内存访问模式的元数据抽象,通过使用模版结构体MakePredicatedTileAccessIteratorDesc,特化出特定的PredicatedTileAccessIteratorDesc,参数在编译时确定,实现零运行时开销,为后续的 PredicatedTileAccessIteratorParams 计算提供必要的模板参数信息。

最低0.47元/天 解锁文章
834

被折叠的 条评论
为什么被折叠?



