TFMOD

指令示意图

TFMOD tile operation

简介

两个 Tile 的逐元素余数,余数符号与被除数相同。

数学语义

对每个元素 (i, j) 在有效区域内:

\[\mathrm{dst}_{i,j} = \mathrm{fmod}(\mathrm{src0}_{i,j}, \mathrm{src1}_{i,j})\]

汇编语法

PTO-AS 形式:参见 PTO-AS Specification.

同步形式:

%dst = tfmod %src0, %src1 : !pto.tile<...>

AS Level 1 (SSA)

%dst = pto.tfmod %src0, %src1 : !pto.tile<...>

AS Level 2 (DPS)

pto.tfmod ins(%src0, %src1 : !pto.tile_buf<...>) outs(%dst : !pto.tile_buf<...>)

AS Level 1(SSA)

%dst = pto.tfmod %src0, %src1 : !pto.tile<...>

AS Level 2(DPS)

pto.tfmod ins(%src0, %src1 : !pto.tile_buf<...>) outs(%dst : !pto.tile_buf<...>)

C++ 内建接口

声明于 include/pto/common/pto_instr.hpp:

template <typename TileDataDst, typename TileDataSrc0, typename TileDataSrc1, typename... WaitEvents>
PTO_INST RecordEvent TFMOD(TileDataDst &dst, TileDataSrc0 &src0, TileDataSrc1 &src1, WaitEvents &... events);

约束

  • The op iterates over dst.GetValidRow() / dst.GetValidCol().
  • Division-by-zero behavior is target-defined; the CPU simulator asserts in debug builds.

示例

#include <pto/pto-inst.hpp>

using namespace pto;

void example() {
  using TileT = Tile<TileType::Vec, int32_t, 16, 16>;
  TileT out, a, b;
  TFMOD(out, a, b);
}