Mali Offline Compiler 的 FMA CVT等GPU Shader结果参数说明

我是不会看文档的,这辈子都不可能看文档

曾经很不喜欢看文档,更不喜欢看教学视频

原因可能有如下

太多框架可学习

太多团队可管理

太多游戏可玩

太多BUG要跟进

太多经典大神作品

总之,编程的事情好玩可多了去了,不比看文档实际?

直到我需要用Mali做gpu shader分析

 

基于Mali Shader Offline C

我需要知道A FMA CVT sfu

百度也百度不到任何资料

由此可推断,估计是官方的简写,非通用,大概率是官方的资料

所以,只能看官方文档

 

 

文档还是挺精彩的,不看可惜了

意外发现,官方资料还很全(都很有用,可惜人生 苦短,进了librery,还是不能尽阅)

Mali Offline Compiler 的 FMA CVT等GPU Shader结果参数说明

直接跳去第二章节

Mali Offline Compiler 的 FMA CVT等GPU Shader结果参数说明

最终文档里找到了想要的参数说明

3.3 Mali Valhall architecture
    Mali Valhall GPU shader cores have six parallel pipeline classes, comprising three arithmetic pipelines
and three fixed-function support pipelines.
    All Valhall GPUs implement two parallel processing engines, each containing their own set of arithmetic
pipelines. Data presented in the tool is normalized based on the number of engines in the design, to give
an overall cost for the targeted shader core, not just for a single engine.
Arithmetic fused multiply accumulate unit (FMA)
The FMA pipelines are the main arithmetic pipelines, implementing the floating-point
multipliers that are widely used in shader code. Each FMA pipeline implements a 16-wide warp,
and can issue a single 32-bit operation or two 16-bit operations per thread and per clock cycle.
Most programs that are arithmetic-limited are limited by the performance of the FMA pipeline.
Arithmetic convert unit (CVT)
The CVT pipelines implement simple operations, such as format conversion and integer
addition. Each FMA pipeline implements a 16-wide warp, and can issue a single 32-bit
operation or two 16-bit operations per thread and per clock cycle.
Arithmetic special functions unit (SFU)
The SFU pipelines implement a special functions unit for computation of complex functions
such as reciprocals and transcendental functions. Each SFU pipeline implements a 4-wide issue
path, executing a 16-wide warp over 4 clock cycles.
Load/store unit (LS)
The load/store pipeline handles all non-texture memory access, including buffer access, image
access, and atomic operations.
Varying unit (V)
The varying pipeline is a dedicated pipeline which implements the varying interpolator.
Texture unit (T)
The texture pipeline handles all texture sampling and filtering operations.

看文档还是不看?

现在的我的答案是,看文档还是必须的,但是视频还是千万不要看