边缘计算~AI盒子的大脑~GPU
-
TFLOPS:是每秒执行1万亿次浮点运算次数。(F表示Float浮点) -
TOPS:每秒执行1万亿次运算次数。 -
FP:代表浮点运算数据格式,包括双精度(FP64)、单精度(FP32)、半精度(FP16)以及FP8等,INT代表整数格式,包括INT8、INT4等。后面的数字位数越高,意味着精度越高,能够支持的运算复杂程度就越高,适配场景越广; -
FP32:也叫做 float32,两种叫法是完全一样,全称是Single-precision floating-point(单精度浮点数); -
BF16:也叫做BFLOAT16 (这是最常叫法),全称brain floating point,用16位二进制来表示的,Google Brain开发; -
FP16:也叫float16,全称是Half-precision floating-point(半精度浮点数)。
英伟达(NVIDIA)在高端GPU市场长期占据主导地位,市场份额一度超过90%。目前国内企业要突破英伟达等国外公司的垄断还有很长的路要走。
-
T4:https://www.nvidia.com/en-us/data-center/tesla-t4/ -
A10:https://www.nvidia.com/en-us/data-center/products/a10-gpu/ -
A30:https://www.nvidia.com/en-us/data-center/products/a30-gpu/
-
A100:https://www.nvidia.com/en-us/data-center/a100 -
H100:https://www.nvidia.com/en-us/data-center/h100/ Huawei Ascend-910B (404)见HUAWEIAscend)310:https://www.hisilicon.com/cn/products/Ascend/Ascend-310 -
910论文: Ascend: a Scalable and Unified Architecture for Ubiquitous Deep Neural Network Computing, HPCA, 2021:https://ieeexplore.ieee.org/abstract/document/9407221
