Expand description
gpu
dialect.
Structs
- An
all_reduce
operation. Reduce values among workgroup.. - A builder for an
all_reduce
operation. - An
alloc
operation. GPU memory allocation operation.. - A builder for an
alloc
operation. - A
barrier
operation. Synchronizes all work items of a workgroup.. - A builder for a
barrier
operation. - A
block_dim
operation. - A builder for a
block_dim
operation. - A
block_id
operation. - A builder for a
block_id
operation. - A
create_2to4_spmat
operation. Create sparse matrix with 2:4 sparsity operation. - A builder for a
create_2to4_spmat
operation. - A
create_coo_aos
operation. Create sparse matrix in COO format operation (AoS). - A builder for a
create_coo_aos
operation. - A
create_coo
operation. Create sparse matrix in COO format operation. - A builder for a
create_coo
operation. - A
create_csr
operation. Create sparse matrix in CSR format operation. - A builder for a
create_csr
operation. - A
create_dn_tensor
operation. Create dense tensor operation. - A builder for a
create_dn_tensor
operation. - A
dealloc
operation. GPU memory deallocation operation. - A builder for a
dealloc
operation. - A
destroy_dn_tensor
operation. Destroy dense tensor operation. - A builder for a
destroy_dn_tensor
operation. - A
destroy_sp_mat
operation. Destroy sparse matrix operation. - A builder for a
destroy_sp_mat
operation. - A
func
operation. Function executable on a GPU. - A builder for a
func
operation. - A
module
operation. A top level compilation unit containing code to be run on a GPU.. - A builder for a
module
operation. - A
global_id
operation. - A builder for a
global_id
operation. - A
grid_dim
operation. - A builder for a
grid_dim
operation. - A
host_register
operation. Registers a memref for access from device.. - A builder for a
host_register
operation. - A
host_unregister
operation. Unregisters a memref for access from device.. - A builder for a
host_unregister
operation. - A
lane_id
operation. - A builder for a
lane_id
operation. - A
launch_func
operation. Launches a function as a GPU kernel. - A builder for a
launch_func
operation. - A
launch
operation. GPU kernel launch operation. - A builder for a
launch
operation. - A
memcpy
operation. GPU memcpy operation. - A builder for a
memcpy
operation. - A
memset
operation. GPU memset operation. - A builder for a
memset
operation. - A
module_end
operation. A pseudo op that marks the end of a gpu.module.. - A builder for a
module_end
operation. - A
num_subgroups
operation. - A builder for a
num_subgroups
operation. - A
printf
operation. Device-side printf, as in CUDA or OpenCL, for debugging. - A builder for a
printf
operation. - A
return
operation. Terminator for GPU functions.. - A builder for a
return
operation. - A
sddmm_buffer_size
operation. Precompute buffersize for SDDMM operation. - A builder for a
sddmm_buffer_size
operation. - A
sddmm
operation. SDDMM operation. - A builder for a
sddmm
operation. - A
set_default_device
operation. Set default GPU for operations after this by index. - A builder for a
set_default_device
operation. - A
shuffle
operation. Shuffles values within a subgroup.. - A builder for a
shuffle
operation. - A
spmm_buffer_size
operation. Precompute buffersize for SpMM operation. - A builder for a
spmm_buffer_size
operation. - A
spmm
operation. SpMM operation. - A builder for a
spmm
operation. - A
spmv_buffer_size
operation. Precompute buffersize for SpMV operation. - A builder for a
spmv_buffer_size
operation. - A
spmv
operation. SpMV operation. - A builder for a
spmv
operation. - A
subgroup_id
operation. - A builder for a
subgroup_id
operation. - A
subgroup_mma_compute
operation. GPU warp synchronous matrix multiply accumulate. - A builder for a
subgroup_mma_compute
operation. - A
subgroup_mma_constant_matrix
operation. GPU warp synchronous constant matrix. - A builder for a
subgroup_mma_constant_matrix
operation. - A
subgroup_mma_elementwise
operation. GPU warp elementwise operation on a matrix. - A builder for a
subgroup_mma_elementwise
operation. - A
subgroup_mma_load_matrix
operation. GPU warp synchronous matrix load. - A builder for a
subgroup_mma_load_matrix
operation. - A
subgroup_mma_store_matrix
operation. GPU warp synchronous matrix store. - A builder for a
subgroup_mma_store_matrix
operation. - A
subgroup_reduce
operation. Reduce values among subgroup.. - A builder for a
subgroup_reduce
operation. - A
subgroup_size
operation. - A builder for a
subgroup_size
operation. - A
terminator
operation. Terminator for GPU launch regions.. - A builder for a
terminator
operation. - A
thread_id
operation. - A builder for a
thread_id
operation. - A
wait
operation. Wait for async gpu ops to complete.. - A builder for a
wait
operation. - A
yield
operation. GPU yield operation. - A builder for a
yield
operation.
Functions
- Creates an
all_reduce
operation. - Creates an
alloc
operation. - Creates a
barrier
operation. - Creates a
block_dim
operation. - Creates a
block_id
operation. - Creates a
create_2to4_spmat
operation. - Creates a
create_coo
operation. - Creates a
create_coo_aos
operation. - Creates a
create_csr
operation. - Creates a
create_dn_tensor
operation. - Creates a
dealloc
operation. - Creates a
destroy_dn_tensor
operation. - Creates a
destroy_sp_mat
operation. - Creates a
func
operation. - Creates a
global_id
operation. - Creates a
grid_dim
operation. - Creates a
host_register
operation. - Creates a
host_unregister
operation. - Creates a
lane_id
operation. - Creates a
launch
operation. - Creates a
launch_func
operation. - Creates a
memcpy
operation. - Creates a
memset
operation. - Creates a
module
operation. - Creates a
module_end
operation. - Creates a
num_subgroups
operation. - Creates a
printf
operation. - Creates a
return
operation. - Creates a
sddmm
operation. - Creates a
sddmm_buffer_size
operation. - Creates a
set_default_device
operation. - Creates a
shuffle
operation. - Creates a
spmm
operation. - Creates a
spmm_buffer_size
operation. - Creates a
spmv
operation. - Creates a
spmv_buffer_size
operation. - Creates a
subgroup_id
operation. - Creates a
subgroup_mma_compute
operation. - Creates a
subgroup_mma_constant_matrix
operation. - Creates a
subgroup_mma_elementwise
operation. - Creates a
subgroup_mma_load_matrix
operation. - Creates a
subgroup_mma_store_matrix
operation. - Creates a
subgroup_reduce
operation. - Creates a
subgroup_size
operation. - Creates a
terminator
operation. - Creates a
thread_id
operation. - Creates a
wait
operation. - Creates a
yield
operation.