The TensorRT API version 1 namespace. More...
Namespaces | |
anonymous_namespace{NvInfer.h} | |
anonymous_namespace{NvInferRuntime.h} | |
plugin | |
utility | |
utils | |
Classes | |
class | CUDADriverWrapper |
class | Dims |
Structure to define the dimensions of a tensor. More... | |
class | Dims2 |
Descriptor for two-dimensional data. More... | |
class | Dims3 |
Descriptor for three-dimensional data. More... | |
class | Dims4 |
Descriptor for four-dimensional data. More... | |
class | DimsExprs |
class | DimsHW |
Descriptor for two-dimensional spatial data. More... | |
class | DynamicPluginTensorDesc |
class | IActivationLayer |
An Activation layer in a network definition. More... | |
class | IAlgorithm |
Describes a variation of execution of a layer. An algorithm is represented by IAlgorithmVariant and the IAlgorithmIOInfo for each of its inputs and outputs. An algorithm can be selected or reproduced using AlgorithmSelector::selectAlgorithms().". More... | |
class | IAlgorithmContext |
Describes the context and requirements, that could be fulfilled by one or more instances of IAlgorithm. More... | |
class | IAlgorithmIOInfo |
Carries information about input or output of the algorithm. IAlgorithmIOInfo for all the input and output along with IAlgorithmVariant denotes the variation of algorithm and can be used to select or reproduce an algorithm using IAlgorithmSelector::selectAlgorithms(). More... | |
class | IAlgorithmSelector |
Interface implemented by application for selecting and reporting algorithms of a layer provided by the builder. More... | |
class | IAlgorithmVariant |
provides a unique 128-bit identifier, which along with the input and output information denotes the variation of algorithm and can be used to select or reproduce an algorithm, using IAlgorithmSelector::selectAlgorithms() More... | |
class | IBuilder |
Builds an engine from a network definition. More... | |
class | IBuilderConfig |
Holds properties for configuring a builder to produce an engine. More... | |
class | IConcatenationLayer |
A concatenation layer in a network definition. More... | |
class | IConstantLayer |
Layer that represents a constant value. More... | |
class | IConvolutionLayer |
A convolution layer in a network definition. More... | |
class | ICudaEngine |
An engine for executing inference on a built network, with functionally unsafe features. More... | |
class | IDeconvolutionLayer |
A deconvolution layer in a network definition. More... | |
class | IDimensionExpr |
class | IElementWiseLayer |
A elementwise layer in a network definition. More... | |
class | IErrorRecorder |
Reference counted application-implemented error reporting interface for TensorRT objects. More... | |
class | IExecutionContext |
Context for executing inference using an engine, with functionally unsafe features. More... | |
class | IExprBuilder |
class | IFillLayer |
Generate an output tensor with specified mode. More... | |
class | IFullyConnectedLayer |
A fully connected layer in a network definition. This layer expects an input tensor of three or more non-batch dimensions. The input is automatically reshaped into an MxV tensor X , where V is a product of the last three dimensions and M is a product of the remaining dimensions (where the product over 0 dimensions is defined as 1). For example: More... | |
class | IGatherLayer |
class | IGpuAllocator |
Application-implemented class for controlling allocation on the GPU. More... | |
class | IHostMemory |
Class to handle library allocated memory that is accessible to the user. More... | |
class | IIdentityLayer |
A layer that represents the identity function. More... | |
class | IInt8Calibrator |
Application-implemented interface for calibration. More... | |
class | IInt8EntropyCalibrator |
Entropy calibrator. More... | |
class | IInt8EntropyCalibrator2 |
Entropy calibrator 2. More... | |
class | IInt8LegacyCalibrator |
Legacy calibrator left for backward compatibility with TensorRT 2.0. More... | |
class | IInt8MinMaxCalibrator |
MinMax Calibrator. More... | |
class | IIteratorLayer |
class | ILayer |
Base class for all layer classes in a network definition. More... | |
class | ILogger |
Application-implemented logging interface for the builder, engine and runtime. More... | |
class | ILoop |
Helper for creating a recurrent subgraph. More... | |
class | ILoopBoundaryLayer |
class | ILoopOutputLayer |
An ILoopOutputLayer is the sole way to get output from a loop. More... | |
class | ILRNLayer |
A LRN layer in a network definition. More... | |
class | IMatrixMultiplyLayer |
Layer that represents a Matrix Multiplication. More... | |
class | INetworkDefinition |
A network definition for input to the builder. More... | |
class | IOptimizationProfile |
Optimization profile for dynamic input dimensions and shape tensors. More... | |
class | IPaddingLayer |
Layer that represents a padding operation. More... | |
class | IParametricReLULayer |
Layer that represents a parametric ReLU operation. More... | |
class | IPlugin |
Plugin class for user-implemented layers. More... | |
class | IPluginCreator |
Plugin creator class for user implemented layers. More... | |
class | IPluginExt |
Plugin class for user-implemented layers. More... | |
class | IPluginFactory |
Plugin factory for deserialization. More... | |
class | IPluginRegistry |
Single registration point for all plugins in an application. It is used to find plugin implementations during engine deserialization. Internally, the plugin registry is considered to be a singleton so all plugins in an application are part of the same global registry. Note that the plugin registry is only supported for plugins of type IPluginV2 and should also have a corresponding IPluginCreator implementation. More... | |
class | IPluginV2 |
Plugin class for user-implemented layers. More... | |
class | IPluginV2DynamicExt |
class | IPluginV2Ext |
Plugin class for user-implemented layers. More... | |
class | IPluginV2IOExt |
Plugin class for user-implemented layers. More... | |
class | IPluginV2Layer |
Layer type for pluginV2. More... | |
class | IPoolingLayer |
A Pooling layer in a network definition. More... | |
class | IProfiler |
Application-implemented interface for profiling. More... | |
class | IRaggedSoftMaxLayer |
A RaggedSoftmax layer in a network definition. More... | |
class | IRecurrenceLayer |
class | IReduceLayer |
Layer that represents a reduction operator across Shape, Int32, Float, and Half tensors. More... | |
class | IRefitter |
Updates weights in an engine. More... | |
class | IResizeLayer |
A resize layer in a network definition. More... | |
class | IRuntime |
Allows a serialized functionally unsafe engine to be deserialized. More... | |
class | IScaleLayer |
A Scale layer in a network definition. More... | |
class | ISelectLayer |
class | IShapeLayer |
Layer type for getting shape of a tensor. More... | |
class | IShuffleLayer |
Layer type for shuffling data. More... | |
class | ISliceLayer |
Slices an input tensor into an output tensor based on the offset and strides. More... | |
class | ISoftMaxLayer |
A Softmax layer in a network definition. More... | |
class | ITensor |
A tensor in a network definition. More... | |
class | ITopKLayer |
Layer that represents a TopK reduction. More... | |
class | ITripLimitLayer |
class | IUnaryLayer |
Layer that represents an unary operation. More... | |
struct | Permutation |
class | PluginField |
Structure containing plugin attribute field names and associated data This information can be parsed to decode necessary plugin metadata. More... | |
struct | PluginFieldCollection |
class | PluginRegistrar |
Register the plugin creator to the registry The static registry object will be instantiated when the plugin library is loaded. More... | |
struct | PluginTensorDesc |
Fields that a plugin might see for an input or output. More... | |
class | Weights |
An array of weights used as a layer parameter. More... | |
Typedefs | |
typedef uint32_t | QuantizationFlags |
Represents a collection of one or more QuantizationFlag values using binary OR operations. More... | |
typedef uint32_t | BuilderFlags |
Represents a collection of one or more QuantizationFlag values using binary OR operations, e.g., 1U << BuilderFlag::kFP16 | 1U << BuilderFlag::kDEBUG. More... | |
using | TacticSources = uint32_t |
Represents a collection of one or more TacticSource values combine using bitwise-OR operations. More... | |
typedef uint32_t | NetworkDefinitionCreationFlags |
This bitset is capable of representing one or more NetworkDefinitionCreationFlag flags constructed with binary OR operations. e.g., 1U << NetworkDefinitionCreationFlag::kEXPLICIT_BATCH. More... | |
typedef uint32_t | TensorFormats |
It is capable of representing one or more TensorFormat by binary OR operations, e.g., 1U << TensorFormats::kCHW4 | 1U << TensorFormats::kCHW32. More... | |
using | PluginFormat = TensorFormat |
PluginFormat is reserved for backward compatibility. More... | |
Functions | |
template<> | |
constexpr int32_t | EnumMax< LayerType > () |
Maximum number of elements in LayerType enum. More... | |
template<> | |
constexpr int32_t | EnumMax< PaddingMode > () |
Maximum number of elements in PaddingMode enum. More... | |
template<> | |
constexpr int32_t | EnumMax< PoolingType > () |
Maximum number of elements in PoolingType enum. More... | |
template<> | |
constexpr int32_t | EnumMax< ScaleMode > () |
Maximum number of elements in ScaleMode enum. More... | |
template<> | |
constexpr int32_t | EnumMax< ElementWiseOperation > () |
Maximum number of elements in ElementWiseOperation enum. More... | |
template<> | |
constexpr int32_t | EnumMax< RNNOperation > () |
Maximum number of elements in RNNOperation enum. More... | |
template<> | |
constexpr int32_t | EnumMax< RNNDirection > () |
Maximum number of elements in RNNDirection enum. More... | |
template<> | |
constexpr int32_t | EnumMax< RNNInputMode > () |
Maximum number of elements in RNNInputMode enum. More... | |
template<> | |
constexpr int32_t | EnumMax< RNNGateType > () |
Maximum number of elements in RNNGateType enum. More... | |
class | __attribute__ ((deprecated)) IOutputDimensionsFormula |
template<> | |
constexpr int32_t | EnumMax< UnaryOperation > () |
Maximum number of elements in UnaryOperation enum. More... | |
template<> | |
constexpr int32_t | EnumMax< ReduceOperation > () |
Maximum number of elements in ReduceOperation enum. More... | |
template<> | |
constexpr int32_t | EnumMax< SliceMode > () |
Maximum number of elements in SliceMode enum. More... | |
template<> | |
constexpr int32_t | EnumMax< TopKOperation > () |
Maximum number of elements in TopKOperation enum. More... | |
template<> | |
constexpr int32_t | EnumMax< MatrixOperation > () |
Maximum number of elements in MatrixOperation enum. More... | |
template<> | |
constexpr int32_t | EnumMax< ResizeMode > () |
Maximum number of elements in ResizeMode enum. More... | |
template<> | |
constexpr int32_t | EnumMax< LoopOutput > () |
Maximum number of elements in LoopOutput enum. More... | |
template<> | |
constexpr int32_t | EnumMax< TripLimit > () |
Maximum number of elements in TripLimit enum. More... | |
template<> | |
constexpr int32_t | EnumMax< FillOperation > () |
Maximum number of elements in FillOperation enum. More... | |
template<> | |
constexpr int32_t | EnumMax< CalibrationAlgoType > () |
Maximum number of elements in CalibrationAlgoType enum. More... | |
template<> | |
constexpr int32_t | EnumMax< QuantizationFlag > () |
Maximum number of quantization flags in QuantizationFlag enum. More... | |
template<> | |
constexpr int32_t | EnumMax< BuilderFlag > () |
Maximum number of builder flags in BuilderFlag enum. More... | |
template<> | |
constexpr int32_t | EnumMax< ProfilingVerbosity > () |
Maximum number of profile verbosity levels in ProfilingVerbosity enum. More... | |
template<> | |
constexpr int32_t | EnumMax< TacticSource > () |
Maximum number of tactic sources in TacticSource enum. More... | |
template<> | |
constexpr int32_t | EnumMax< NetworkDefinitionCreationFlag > () |
Maximum number of elements in NetworkDefinitionCreationFlag enum. More... | |
template<> | |
constexpr int32_t | EnumMax< PluginType > () |
Maximum number of elements in PluginType enum. More... | |
template<> | |
constexpr int32_t | EnumMax< EngineCapability > () |
Maximum number of elements in EngineCapability enum. More... | |
template<> | |
constexpr int32_t | EnumMax< DimensionOperation > () |
Maximum number of elements in DimensionOperation enum. More... | |
template<> | |
constexpr int32_t | EnumMax< WeightsRole > () |
Maximum number of elements in WeightsRole enum. More... | |
template<> | |
constexpr int32_t | EnumMax< DeviceType > () |
Maximum number of elements in DeviceType enum. More... | |
template<> | |
constexpr int32_t | EnumMax< OptProfileSelector > () |
template<typename T > | |
constexpr int32_t | EnumMax () |
Forward declare IGpuAllocator for use in other interfaces. More... | |
template<> | |
constexpr int32_t | EnumMax< ActivationType > () |
Maximum number of elements in ActivationType enum. More... | |
template<> | |
constexpr int32_t | EnumMax< DataType > () |
Maximum number of elements in DataType enum. More... | |
template<> | |
constexpr int32_t | EnumMax< DimensionType > () |
Maximum number of elements in DimensionType enum. More... | |
template<> | |
constexpr int32_t | EnumMax< TensorFormat > () |
Maximum number of elements in TensorFormat enum. More... | |
template<> | |
constexpr int32_t | EnumMax< TensorLocation > () |
Maximum number of elements in TensorLocation enum. More... | |
template<> | |
constexpr int32_t | EnumMax< ILogger::Severity > () |
Maximum number of elements in ILogger::Severity enum. More... | |
template<> | |
constexpr int32_t | EnumMax< ErrorCode > () |
Maximum number of elements in ErrorCode enum. More... | |
void | cuErrCheck_ (CUresult stat, const CUDADriverWrapper &wrap, const char *file, int line) |
The TensorRT API version 1 namespace.
typedef uint32_t nvinfer1::QuantizationFlags |
Represents a collection of one or more QuantizationFlag values using binary OR operations.
typedef uint32_t nvinfer1::BuilderFlags |
Represents a collection of one or more QuantizationFlag values using binary OR operations, e.g., 1U << BuilderFlag::kFP16 | 1U << BuilderFlag::kDEBUG.
using nvinfer1::TacticSources = typedef uint32_t |
Represents a collection of one or more TacticSource values combine using bitwise-OR operations.
This bitset is capable of representing one or more NetworkDefinitionCreationFlag flags constructed with binary OR operations. e.g., 1U << NetworkDefinitionCreationFlag::kEXPLICIT_BATCH.
typedef uint32_t nvinfer1::TensorFormats |
It is capable of representing one or more TensorFormat by binary OR operations, e.g., 1U << TensorFormats::kCHW4 | 1U << TensorFormats::kCHW32.
using nvinfer1::PluginFormat = typedef TensorFormat |
PluginFormat is reserved for backward compatibility.
|
strong |
The type values of layer classes.
|
strong |
Enumerates the modes of padding to perform in convolution, deconvolution and pooling layer, padding mode takes precedence if setPaddingMode() and setPrePadding() are also used.
There are three padding styles, EXPLICIT, SAME, and CAFFE, with each style having two variants. The EXPLICIT and CAFFE styles determine if the final sampling location is used or not. The SAME style determine if the asymmetry in the padding is on the pre or post padding.
Formulas for Convolution:
Formulas for Deconvolution:
Formulas for Pooling:
Pooling Example 1:
Pooling Example 2:
The sample points are {0, 2, 4, 6, 8} in each dimension.
CAFFE_ROUND_DOWN and CAFFE_ROUND_UP have two restrictions each on usage with pooling operations. This will cause getDimensions to return an empty dimension and also to reject the network at validation time.
For more information on original reference code, see https://github.com/BVLC/caffe/blob/master/src/caffe/layers/pooling_layer.cpp
|
strong |
|
strong |
Controls how shift, scale and power are applied in a Scale layer.
Enumerator | |
---|---|
kUNIFORM | Identical coefficients across all elements of the tensor. |
kCHANNEL | Per-channel coefficients. |
kELEMENTWISE | Elementwise coefficients. |
|
strong |
Enumerates the binary operations that may be performed by an ElementWise layer.
|
strong |
Enumerates the RNN operations that may be performed by an RNN layer.
Equation definitions
In the equations below, we use the following naming convention:
Equations
Depending on the value of RNNOperation chosen, each sub-layer of the RNN layer will perform one of the following operations:
|
strong |
Enumerates the RNN direction that may be performed by an RNN layer.
Enumerator | |
---|---|
kUNIDIRECTION | Network iterations from first input to last input. |
kBIDIRECTION | Network iterates from first to last and vice versa and outputs concatenated. |
|
strong |
Enumerates the RNN input modes that may occur with an RNN layer.
If the RNN is configured with RNNInputMode::kLINEAR, then for each gate g
in the first layer of the RNN, the input vector X[t]
(length E
) is left-multiplied by the gate's corresponding weight matrix W[g]
(dimensions HxE
) as usual, before being used to compute the gate output as described by RNNOperation.
If the RNN is configured with RNNInputMode::kSKIP, then this initial matrix multiplication is "skipped" and W[g]
is conceptually an identity matrix. In this case, the input vector X[t]
must have length H
(the size of the hidden state).
Enumerator | |
---|---|
kLINEAR | Perform the normal matrix multiplication in the first recurrent layer. |
kSKIP | No operation is performed on the first recurrent layer. |
|
strong |
Identifies an individual gate within an RNN cell.
Enumerator | |
---|---|
kINPUT | Input gate (i). |
kOUTPUT | Output gate (o). |
kFORGET | Forget gate (f). |
kUPDATE | Update gate (z). |
kRESET | Reset gate (r). |
kCELL | Cell gate (c). |
kHIDDEN | Hidden gate (h). |
|
strong |
Enumerates the unary operations that may be performed by a Unary layer.
|
strong |
Enumerates the reduce operations that may be performed by a Reduce layer.
The table shows the result of reducing across an empty volume of a given type.
Operation | kFLOAT and kHALF | kINT32 | kINT8 |
---|---|---|---|
kSUM | 0 | 0 | 0 |
kPROD | 1 | 1 | 1 |
kMAX | negative infinity | INT_MIN | -128 |
kMIN | positive infinity | INT_MAX | 127 |
kAVG | NaN | 0 | -128 |
The current version of TensorRT usually performs reduction for kINT8 via kFLOAT or kHALF. The kINT8 values show the quantized representations of the floating-point values.
Enumerator | |
---|---|
kSUM | |
kPROD | |
kMAX | |
kMIN | |
kAVG |
|
strong |
Controls how ISliceLayer handles out of bounds coordinates.
Enumerator | |
---|---|
kDEFAULT | Fail with error when the coordinates are out of bounds. This is the default. |
kWRAP | Coordinates wrap around periodically. |
|
strong |
|
strong |
Enumerates the operations that may be performed on a tensor by IMatrixMultiplyLayer before multiplication.
Enumerator | |
---|---|
kNONE | Treat x as a matrix if it has two dimensions, or as a collection of matrices if x has more than two dimensions, where the last two dimensions are the matrix dimensions. x must have at least two dimensions. |
kTRANSPOSE | Like kNONE, but transpose the matrix dimensions. |
kVECTOR | Treat x as a vector if it has one dimension, or as a collection of vectors if x has more than one dimension. x must have at least one dimension. The first input tensor with dimensions [M,K] used with MatrixOperation::kVECTOR is equivalent to a tensor with dimensions [M, 1, K] with MatrixOperation::kNONE, i.e. is treated as M row vectors of length K. If MatrixOperation::kTRANSPOSE is specified, then the dimensions are [M, K, 1]. The second input tensor with dimensions [M,K] used with MatrixOperation::kVECTOR is equivalent to a tensor with dimensions [M, K, 1] with MatrixOperation::kNONE, i.e. is treated as M column vectors of length K. If MatrixOperation::kTRANSPOSE is specified, then the dimensions are [M, 1, K]. |
|
strong |
|
strong |
Enum that describes kinds of loop outputs.
|
strong |
|
strong |
Enumerates the tensor fill operations that may performed by a fill layer.
Enumerator | |
---|---|
kLINSPACE | Generate evenly spaced numbers over a specified interval. |
kRANDOM_UNIFORM | Generate a tensor with random values drawn from a uniform distribution. |
|
strong |
|
strong |
List of valid flags for quantizing the network to int8.
Enumerator | |
---|---|
kCALIBRATE_BEFORE_FUSION | Run int8 calibration pass before layer fusion. Only valid for IInt8LegacyCalibrator and IInt8EntropyCalibrator. We always run int8 calibration pass before layer fusion for IInt8MinMaxCalibrator and IInt8EntropyCalibrator2. Disabled by default. |
|
strong |
List of valid modes that the builder can enable when creating an engine from a network definition.
Enumerator | |
---|---|
kFP16 | Enable FP16 layer selection, with FP32 fallback. |
kINT8 | Enable Int8 layer selection, with FP32 fallback with FP16 fallback if kFP16 also specified. |
kDEBUG | Enable debugging of layers via synchronizing after every layer. |
kGPU_FALLBACK | Enable layers marked to execute on GPU if layer cannot execute on DLA. |
kSTRICT_TYPES | Enables strict type constraints. |
kREFIT | Enable building a refittable engine. |
kDISABLE_TIMING_CACHE | Disable reuse of timing information across identical layers. |
kTF32 | Allow (but not require) computations on tensors of type DataType::kFLOAT to use TF32. TF32 computes inner products by rounding the inputs to 10-bit mantissas before multiplying, but accumulates the sum using 23-bit mantissas. Enabled by default. |
|
strong |
|
strong |
|
strong |
List of immutable network properties expressed at network creation time. NetworkDefinitionCreationFlag is used with createNetworkV2 to specify immutable properties of the network. The createNetwork() function always had an implicit batch dimension being specified by the maxBatchSize builder parameter. createNetworkV2 with kDEFAULT flag mimics that behaviour.
Enumerator | |
---|---|
kEXPLICIT_BATCH | Dynamic shape support requires that the kEXPLICIT_BATCH flag is set. With dynamic shapes, any of the input dimensions can vary at run-time, and there are no implicit dimensions in the network specification. This is specified by using the wildcard dimension value -1. Mark the network to be an explicit batch network |
kEXPLICIT_PRECISION | Setting the network to be an explicit precision network has the following implications: 1) Precision of all input tensors to the network have to be specified with ITensor::setType() function 2) Precision of all layer output tensors in the network have to be specified using ILayer::setOutputType() function 3) The builder will not quantize the weights of any layer including those running in lower precision(INT8). It will simply cast the weights into the required precision. 4) Dynamic ranges must not be provided to run the network in int8 mode. Dynamic ranges of each tensor in the explicit precision network is [-127,127]. 5) Quantizing and dequantizing activation values between higher (FP32) and lower (INT8) precision will be performed using explicit Scale layers with input/output precision set appropriately. Mark the network to be an explicit precision network |
|
strong |
The type values for the various plugins.
|
strong |
Forward declaration of IPluginFactory for use by other interfaces.
List of supported engine capability flows.
The EngineCapability determines the restrictions of a network during build time for what can be executed at runtime. EngineCapability::kDEFAULT does not provide any restrictions on functionality and the resulting serialized engine can be executed with TensorRT's standard runtime APIs in the nvinfer1 namespace. EngineCapabiltiy::kSAFE_GPU provides a restricted subset of network operations that are safety certified and the resulting serialized engine can be executed with TensorRT's safe runtime APIs in the nvinfer1::safe namespace. EngineCapability::kSAFE_DLA provides a restricted subset of network operations that are DLA compatible and the resulting serialized engine can be executed using NvMediaDLA's runtime APIs. See sampleNvmedia for an example of integrating NvMediaDLA APIs with TensorRT APIs.
Enumerator | |
---|---|
kDEFAULT | Full capability, TensorRT mode without any restrictions using TensorRT nvinfer1 APIs. |
kSAFE_GPU | Safety restricted capability, TensorRT flow that can only run on GPU devices via TensorRT nvinfer1::safe APIs. |
kSAFE_DLA | Safety restricted capability, TensorRT flow that can only run on DLA devices via NvMediaDLA APIs. |
|
strong |
An operation on two IDimensionExpr, which represent integer expressions used in dimension computations.
For example, given two IDimensionExpr x and y and an IExprBuilder& eb, eb.operation(DimensionOperation::kSUM, x, y) creates a representation of x+y.
|
strong |
How a layer uses particular Weights.
The power weights of an IScaleLayer are omitted. Refitting those is not supported.
Enumerator | |
---|---|
kKERNEL | kernel for IConvolutionLayer, IDeconvolutionLayer, or IFullyConnectedLayer |
kBIAS | bias for IConvolutionLayer, IDeconvolutionLayer, or IFullyConnectedLayer |
kSHIFT | shift part of IScaleLayer |
kSCALE | scale part of IScaleLayer |
kCONSTANT | weights for IConstantLayer |
|
strong |
|
strong |
When setting or querying optimization profile parameters (such as shape tensor inputs or dynamic dimensions), select whether we are interested in the minimum, optimum, or maximum values for these parameters. The minimum and maximum specify the permitted range that is supported at runtime, while the optimum value is used for the kernel selection. This should be the "typical" value that is expected to occur at runtime.
|
strong |
Enumerates the types of activation to perform in an activation layer.
|
strong |
|
strong |
|
strong |
Format of the input/output tensors.
This enum is extended to be used by both plugins and reformat-free network I/O tensors.
For more information about data formats, see the topic "Data Format Description" located in the TensorRT Developer Guide (https://docs.nvidia.com/deeplearning/sdk/tensorrt-developer-guide/index.html).
|
strong |
|
strong |
Enumerator | |
---|---|
kFLOAT16 | FP16 field type. |
kFLOAT32 | FP32 field type. |
kFLOAT64 | FP64 field type. |
kINT8 | INT8 field type. |
kINT16 | INT16 field type. |
kINT32 | INT32 field type. |
kCHAR | char field type. |
kDIMS | nvinfer1::Dims field type. |
kUNKNOWN |
|
strong |
|
strong |
Error codes that can be returned by TensorRT during execution.
the type of parser error
|
inlineconstexpr |
Maximum number of elements in LayerType enum.
|
inlineconstexpr |
Maximum number of elements in PaddingMode enum.
|
inlineconstexpr |
Maximum number of elements in PoolingType enum.
|
inlineconstexpr |
Maximum number of elements in ScaleMode enum.
|
inlineconstexpr |
Maximum number of elements in ElementWiseOperation enum.
|
inlineconstexpr |
Maximum number of elements in RNNOperation enum.
|
inlineconstexpr |
Maximum number of elements in RNNDirection enum.
|
inlineconstexpr |
Maximum number of elements in RNNInputMode enum.
|
inlineconstexpr |
Maximum number of elements in RNNGateType enum.
class nvinfer1::__attribute__ | ( | (deprecated) | ) |
Application-implemented interface to compute the HW output dimensions of a layer from the layer input and parameters.
inputDims | The input dimensions of the layer. |
kernelSize | The kernel size (or window size, for a pooling layer) parameter of the layer operation. |
stride | The stride parameter for the layer. |
padding | The padding parameter of the layer. |
dilation | The dilation parameter of the layer (only applicable to convolutions). |
layerName | The name of the layer. |
Note that for dilated convolutions, the dilation is applied to the kernel size before this routine is called.
|
inlineconstexpr |
Maximum number of elements in UnaryOperation enum.
|
inlineconstexpr |
Maximum number of elements in ReduceOperation enum.
|
inlineconstexpr |
Maximum number of elements in SliceMode enum.
|
inlineconstexpr |
Maximum number of elements in TopKOperation enum.
|
inlineconstexpr |
Maximum number of elements in MatrixOperation enum.
|
inlineconstexpr |
Maximum number of elements in ResizeMode enum.
|
inlineconstexpr |
Maximum number of elements in LoopOutput enum.
|
inlineconstexpr |
Maximum number of elements in TripLimit enum.
|
inlineconstexpr |
Maximum number of elements in FillOperation enum.
|
inlineconstexpr |
Maximum number of elements in CalibrationAlgoType enum.
|
inlineconstexpr |
Maximum number of quantization flags in QuantizationFlag enum.
|
inlineconstexpr |
Maximum number of builder flags in BuilderFlag enum.
|
inlineconstexpr |
Maximum number of profile verbosity levels in ProfilingVerbosity enum.
|
inlineconstexpr |
Maximum number of tactic sources in TacticSource enum.
|
inlineconstexpr |
Maximum number of elements in NetworkDefinitionCreationFlag enum.
|
inlineconstexpr |
Maximum number of elements in PluginType enum.
|
inlineconstexpr |
Maximum number of elements in EngineCapability enum.
|
inlineconstexpr |
Maximum number of elements in DimensionOperation enum.
|
inlineconstexpr |
Maximum number of elements in WeightsRole enum.
|
inlineconstexpr |
Maximum number of elements in DeviceType enum.
|
inlineconstexpr |
|
inlineconstexpr |
Forward declare IGpuAllocator for use in other interfaces.
Maximum number of elements in an enumeration type.
|
inlineconstexpr |
Maximum number of elements in ActivationType enum.
|
inlineconstexpr |
Maximum number of elements in DataType enum.
|
inlineconstexpr |
Maximum number of elements in DimensionType enum.
|
inlineconstexpr |
Maximum number of elements in TensorFormat enum.
|
inlineconstexpr |
Maximum number of elements in TensorLocation enum.
|
inlineconstexpr |
Maximum number of elements in ILogger::Severity enum.
|
inlineconstexpr |
Maximum number of elements in ErrorCode enum.
|
inline |