cuda编程之atomicAdd

最新推荐文章于 2025-06-27 23:07:12 发布

原创最新推荐文章于 2025-06-27 23:07:12 发布 · 1.3w 阅读

6 ·

CC 4.0 BY-SA版权

文章标签：

#cuda

CUDA编程专栏收录该内容

4 篇文章

订阅专栏

本文详细介绍了CUDA中的原子操作atomicAdd()函数。该函数能够读取32位或64位整数或浮点数，计算旧值加上输入值，并将结果存回同一内存地址。这些操作在一个原子交易中完成，返回旧值。32位浮点数版本的atomicAdd()仅支持计算能力为2.x及更高版本的设备；64位浮点数版本则要求计算能力至少为6.x。

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

以下内容来源于cuda c programming guide

注意：函数运算完之后，会将运算结果保存在第一个参数指针指定的空间位置，返回的是old值。

B.12.1.1. atomicAdd()

int atomicAdd(int* address, int val);
unsigned int atomicAdd(unsigned int* address,

                       unsigned int val);
unsigned long long int atomicAdd(unsigned long long int* address,

                                 unsigned long long int val);
float atomicAdd(float* address, float val);

double atomicAdd(double* address, double val);

reads the 32-bit or 64-bit word old located at the address address in global or shared memory, computes (old + val), and stores the result back to memory at the same address. These three operations are performed in one atomic transaction. The function returns old.

The 32-bit floating-point version of atomicAdd() is only supported by devices ofcompute capability 2.x and higher.

The 64-bit floating-point version of atomicAdd() is only supported by devices ofcompute capability 6.x and higher.