TensorRT  7.2.1.6
NVIDIA TensorRT
Looking for a C++ dev who knows TensorRT?
I'm looking for work. Hire me!
All Classes Namespaces Functions Variables Typedefs Enumerations Enumerator Friends Pages
TensorRT OSS Release Changelog

7.2.1 - 2020-10-20

Added

  • Polygraphy v0.20.13 - Deep Learning Inference Prototyping and Debugging Toolkit
  • PyTorch-Quantization Toolkit v2.0.0
  • Updated BERT plugins for variable sequence length inputs
    • Optimized kernels for sequence lengths of 64 and 96 added
  • Added Tacotron2 + Waveglow TTS demo #677
  • Re-enable GridAnchorRect_TRT plugin with rectangular feature maps #679
  • Update batchedNMS plugin to IPluginV2DynamicExt interface #738
  • Support 3D inputs in InstanceNormalization plugin #745
  • Added this CHANGELOG.md

Changed

  • ONNX GraphSurgeon - v0.2.7 with bugfixes, new examples.
  • demo/BERT bugfixes for Jetson Xavier
  • Updated build Dockerfile to cuda-11.1
  • Updated ClangFormat style specification according to TensorRT coding guidelines

Removed

  • N/A
sample::BuildOptions::tf32
bool tf32
Definition: sampleOptions.h:112
nvcaffeparser1::IPluginFactoryExt
Plugin factory used to configure plugins with added support for TRT versioning.
Definition: NvCaffeParser.h:116
sample::anonymous_namespace{sampleInference.cpp}::Iteration::mEnqueue
EnqueueFunction mEnqueue
Definition: sampleInference.cpp:460
xy_t::xy_t
xy_t()
Definition: maskRCNNKernels.h:260
chptToBin.chpt_to_dict_arrays_simple
def chpt_to_dict_arrays_simple(file_name)
Definition: chptToBin.py:30
nvinfer1::plugin::BatchTilePlugin::getPluginVersion
const char * getPluginVersion() const override
Return the plugin version.
Definition: batchTilePlugin.cpp:156
SampleMNISTAPI::loadWeights
std::map< std::string, nvinfer1::Weights > loadWeights(const std::string &file)
Loads weights from weights file.
Definition: sampleMNISTAPI.cpp:389
polygraphy.backend.tf.loader.GraphFromKeras.__init__
def __init__(self, path)
Definition: backend/tf/loader.py:112
pytorch_quantization.tensor_quant._tensor_quant
def _tensor_quant(inputs, amax, num_bits=8, unsigned=False, narrow_range=True)
Definition: tensor_quant.py:318
bert::Fused_multihead_attention_params_v2::scale_softmax
uint32_t scale_softmax
Definition: fused_multihead_attention_v2.h:58
nvinfer1::IPluginV2DynamicExt::enqueue
virtual int32_t enqueue(const PluginTensorDesc *inputDesc, const PluginTensorDesc *outputDesc, const void *const *inputs, void *const *outputs, void *workspace, cudaStream_t stream)=0
Execute the layer.
SampleMLP::teardown
bool teardown()
Cleans up any state created in the sample class.
Definition: sampleMLP.cpp:349
nvinfer1::plugin::ProposalPlugin::mAnchorSizeNum
size_t mAnchorSizeNum
Definition: proposalPlugin.h:120
FrcnnArgs
Definition: frcnnUtils.h:51
polygraphy.tools.surgeon.subtool.base
Definition: surgeon/subtool/base.py:1
test_ir.TestNode.input_tensor
input_tensor
Definition: test_ir.py:136
isolate.inputs
inputs
Definition: isolate.py:54
std::numeric_limits< half_float::half >::infinity
static half_float::half infinity()
Positive infinity.
Definition: samples/common/half.h:4246
nvinfer1::IInt8EntropyCalibrator2
Entropy calibrator 2.
Definition: NvInfer.h:6078
nvinfer1::plugin::BatchTilePlugin::serialize
void serialize(void *buffer) const override
Serialize the layer.
Definition: batchTilePlugin.cpp:113
nvinfer1::LoopOutput
LoopOutput
Enum that describes kinds of loop outputs.
Definition: NvInfer.h:4487
bert::TFusedMHAKernelFactory::hashID
uint64_t hashID(Data_type type, unsigned int sm) const
Definition: fused_multihead_attention.h:322
bert::fused_multihead_attention_int8_384_64_kernel_sm80_cu_o
unsigned char fused_multihead_attention_int8_384_64_kernel_sm80_cu_o[]
Definition: fused_multihead_attention_int8_384_64_kernel.sm80.cpp:20
nmtSample::SLPEmbedder::mWeights
ComponentWeights::ptr mWeights
Definition: slpEmbedder.h:51
polygraphy.backend.onnx.loader.BytesFromOnnx.__init__
def __init__(self, model)
Definition: backend/onnx/loader.py:179
nmtSample::Attention
calculates attention vector from context and decoder output vectors
Definition: attention.h:32
model.WaveGlow.n_early_size
n_early_size
Definition: demo/Tacotron2/waveglow/model.py:168
inference_trt.init_decoder_tensors
def init_decoder_tensors(decoder_inputs, decoder_outputs)
Definition: inference_trt.py:118
SampleErrorRecorder::decRefCount
IErrorRecorder::RefCount decRefCount() noexcept final
Decrements the refcount for the current ErrorRecorder.
Definition: ErrorRecorder.h:105
polygraphy.backend.trt_legacy.TrtLegacyRunner.engine
engine
Definition: backend/trt_legacy.py:270
half_float::detail::operator>>
std::basic_istream< charT, traits > & operator>>(std::basic_istream< charT, traits > &in, half &arg)
Input operator.
Definition: ieee_half.h:2218
SampleOnnxMNIST::SampleUniquePtr
std::unique_ptr< T, samplesCommon::InferDeleter > SampleUniquePtr
Definition: sampleOnnxMNIST.cpp:49
builder.BertConfig.use_qat
use_qat
Definition: builder.py:104
LogStreamConsumerBuffer::setShouldLog
void setShouldLog(bool shouldLog)
Definition: pluginLogging.h:92
RefineNMSParameters::numClasses
int numClasses
Definition: maskRCNNKernels.h:86
nvinfer1::PoolingType::kMAX
@ kMAX
tools.test_surgeon.TestSurgeonInsert
Definition: test_surgeon.py:45
LogStreamConsumerBase
Convenience object used to initialize LogStreamConsumerBuffer before std::ostream in LogStreamConsume...
Definition: pluginLogging.h:107
example.load_network
def load_network(builder, network, parser)
Definition: Polygraphy/examples/api/03_interoperating_with_tensorrt/example.py:39
polygraphy.tools.args.trt_legacy.TrtLegacyArgs.add_to_parser
def add_to_parser(self, parser)
Definition: tools/args/trt_legacy.py:25
anonymous_namespace{gridAnchorPlugin.cpp}::GRID_ANCHOR_PLUGIN_NAMES
std::string GRID_ANCHOR_PLUGIN_NAMES[]
Definition: gridAnchorPlugin.cpp:29
samplesCommon::vPPM::buffer
std::vector< uint8_t > buffer
Definition: samples/common/common.h:664
anonymous_namespace{instanceNormalizationPlugin.cpp}::INSTANCE_PLUGIN_VERSION
constexpr const char * INSTANCE_PLUGIN_VERSION
Definition: instanceNormalizationPlugin.cpp:64
polygraphy.backend.trt.util.build_profile
def build_profile(builder, network, profile)
Definition: trt/util.py:310
polygraphy.logger.logger
Definition: Polygraphy/polygraphy/logger/logger.py:1
tests.tensor_quant_test.TestFakeAffineTensorQuant.test_simple_run
def test_simple_run(self, verbose)
Definition: tensor_quant_test.py:354
nvinfer1::LayerType::kTOPK
@ kTOPK
TopK layer.
nvinfer1::plugin::ProposalPlugin::getNbOutputs
int getNbOutputs() const override
Get the number of outputs from the layer.
Definition: proposalPlugin.cpp:160
nvinfer1::plugin::Normalize::getPluginType
const char * getPluginType() const override
Return the plugin type.
Definition: normalizePlugin.cpp:229
polygraphy.backend.trt.util.load_plugins
def load_plugins(plugins)
Definition: trt/util.py:28
anonymous_namespace{serialize.hpp}::Serializer< const char * >::deserialize
static void deserialize(void const **buffer, size_t *buffer_size, const char **value)
Definition: serialize.hpp:77
bert::TFusedMultiHeadAttentionXMMAKernel::~TFusedMultiHeadAttentionXMMAKernel
virtual ~TFusedMultiHeadAttentionXMMAKernel()=default
pytorch_quantization.nn.modules.tensor_quantizer.TensorQuantizer._learn_amax
_learn_amax
Definition: tensor_quantizer.py:78
polygraphy.backend.onnx.util.mark_outputs
def mark_outputs(model, outputs)
Definition: onnx/util.py:68
polygraphy.tools.surgeon.subtool.insert.Insert.add_parser_args
def add_parser_args(self, parser)
Definition: insert.py:30
nvinfer1::IRuntime::destroy
virtual void destroy() noexcept=0
Destroy this object.
half_float::detail::functions::scalbln
static half scalbln(half arg, long exp)
Scaling implementation.
Definition: samples/common/half.h:2188
nvinfer1::plugin::DetectionOutput::C1
int C1
Definition: nmsPlugin.h:90
model.WaveGlow.upsample
upsample
Definition: demo/Tacotron2/waveglow/model.py:161
onnx_graphsurgeon.ir.graph.Graph.fold_constants
def fold_constants(self)
Definition: graph.py:341
nvinfer1::Dims3::Dims3
Dims3()
Construct an empty Dims3 object.
Definition: NvInfer.h:170
sample::TrtCudaBuffer::TrtCudaBuffer
TrtCudaBuffer()=default
SampleConfig::getTolerance
float getTolerance() const
Definition: sampleConfig.h:296
tests.optim_helper_test.TestMatchParameters
Definition: optim_helper_test.py:33
nvinfer1::plugin::MultilevelCropAndResizePluginCreator::mPooledSize
int mPooledSize
Definition: multilevelCropAndResizePlugin.h:124
model.Tacotron2.postnet
postnet
Definition: demo/Tacotron2/tacotron2/model.py:609
nvinfer1::Dims::MAX_DIMS
static const int32_t MAX_DIMS
The maximum number of dimensions supported for a tensor.
Definition: NvInferRuntimeCommon.h:197
nvonnxparser::IParserError::~IParserError
virtual ~IParserError()
Definition: NvOnnxParser.h:112
nvinfer1::plugin::CodeTypeSSD::TF_CENTER
@ TF_CENTER
Use box centers and size but flip x and y coordinates.
nvinfer1::PluginField::type
PluginFieldType type
Plugin field attribute type.
Definition: NvInferRuntimeCommon.h:861
nvinfer1::IIdentityLayer::~IIdentityLayer
virtual ~IIdentityLayer()
Definition: NvInfer.h:4260
nvinfer1::plugin::PyramidROIAlign::detachFromContext
void detachFromContext() override
Detach the plugin object from its execution context.
Definition: pyramidROIAlignPlugin.cpp:292
text.cmudict.CMUDict.lookup
def lookup(self, word)
Definition: cmudict.py:51
FCPlugin::getWorkspaceSize
virtual size_t getWorkspaceSize(int maxBatchSize) const override
Definition: samples/opensource/samplePlugin/fcPlugin.h:164
SamplePlugin::constructNetwork
void constructNetwork(SampleUniquePtr< nvinfer1::IBuilder > &builder, SampleUniquePtr< nvcaffeparser1::ICaffeParser > &parser, SampleUniquePtr< nvinfer1::INetworkDefinition > &network)
Uses a Caffe parser to create the MNIST Network and marks the output layers.
Definition: samplePlugin.cpp:208
nvinfer1::plugin::RPROIPlugin::setPluginNamespace
void setPluginNamespace(const char *pluginNamespace) override
Set the namespace that this plugin object belongs to.
Definition: nvFasterRCNNPlugin.cpp:244
nmtSample::LSTMEncoder::~LSTMEncoder
~LSTMEncoder() override=default
helpers.calibrator.BertCalibrator.__init__
def __init__(self, squad_json, vocab_file, cache_file, batch_size, max_seq_length, num_inputs)
Definition: demo/BERT/helpers/calibrator.py:28
nvinfer1::plugin::BatchTilePlugin::isOutputBroadcastAcrossBatch
bool isOutputBroadcastAcrossBatch(int outputIndex, const bool *inputIsBroadcasted, int nbInputs) const override
Definition: batchTilePlugin.cpp:127
polygraphy.tools.args.tf2onnx.loader.Tf2OnnxLoaderArgs.parse
def parse(self, args)
Definition: tools/args/tf2onnx/loader.py:40
pytorch_quantization.nn.modules.quant_rnn.QuantRNNCell.weight_hh
weight_hh
Definition: modules/quant_rnn.py:347
nvinfer1::LayerType::kPLUGIN
@ kPLUGIN
Plugin layer.
tools.test_run.TestTf.test_tf_save_pb
def test_tf_save_pb(self)
Definition: test_run.py:149
stft.STFT.inverse
def inverse(self, magnitude, phase)
Definition: stft.py:107
pytorch_quantization.nn.modules.quant_rnn.GRUCell.input_size
input_size
Definition: modules/quant_rnn.py:428
tests.calibrator_test.TestMaxCalibrator.test_fine_grain
def test_fine_grain(self)
Definition: calibrator_test.py:49
SampleUffSSD::infer
bool infer()
Runs the TensorRT inference engine for this sample.
Definition: sampleUffSSD.cpp:232
Logger::reportTestStart
static void reportTestStart(TestAtom &testAtom)
Report that a test has started.
Definition: pluginLogging.h:345
polygraphy.comparator.compare.OutputCompareResult
Definition: compare.py:24
polygraphy.backend.pyt.runner.PytRunner
Definition: backend/pyt/runner.py:21
nvcaffeparser1::CaffeWeightFactory::generator
std::default_random_engine generator
Definition: caffeWeightFactory.h:64
ParserOnnxConfig::isDebug
virtual bool isDebug() const
Definition: parserOnnxConfig.h:137
pytorch_quantization.quant_modules.QuantModuleReplacementHelper.quant_map
quant_map
Definition: quant_modules.py:63
polygraphy.tools.args.trt.runner.TrtRunnerArgs
Definition: tools/args/trt/runner.py:21
nvinfer1::EnumMax< ElementWiseOperation >
constexpr int32_t EnumMax< ElementWiseOperation >()
Maximum number of elements in ElementWiseOperation enum.
Definition: NvInfer.h:2625
nvcaffeparser1::ICaffeParser::setPluginFactory
virtual void setPluginFactory(IPluginFactory *factory)=0
Set the IPluginFactory used to create the user defined plugins.
nvinfer1::plugin::MultilevelCropAndResize::canBroadcastInputAcrossBatch
bool canBroadcastInputAcrossBatch(int inputIndex) const override
Definition: multilevelCropAndResizePlugin.cpp:268
inference.main
def main()
Definition: Tacotron2/inference.py:181
samplesCommon::GenericBuffer::GenericBuffer
GenericBuffer(size_t size, nvinfer1::DataType type)
Construct a buffer with the specified allocation size in bytes.
Definition: buffers.h:69
SampleMNISTAPIParams::outputSize
int outputSize
The output size.
Definition: sampleMNISTAPI.cpp:50
text.numbers._expand_ordinal
def _expand_ordinal(m)
Definition: numbers.py:60
nvinfer1::plugin::MultilevelProposeROI::serialize
void serialize(void *buffer) const override
Serialize the layer.
Definition: multilevelProposeROIPlugin.cpp:241
nvinfer1::INetworkDefinition::markOutput
virtual void markOutput(ITensor &tensor)=0
Mark a tensor as a network output.
polygraphy.tools.args.comparator.ComparatorRunArgs.add_to_parser
def add_to_parser(self, parser)
Definition: tools/args/comparator.py:29
polygraphy.backend.trt.loader.EngineFromNetwork.__call__
def __call__(self)
Definition: backend/trt/loader.py:410
sample::LogStreamConsumer::LogStreamConsumer
LogStreamConsumer(LogStreamConsumer &&other)
Definition: samples/common/logging.h:145
nvinfer1::IPluginV2::getNbOutputs
virtual int32_t getNbOutputs() const =0
Get the number of outputs from the layer.
sample::Logger::TestAtom::mCmdline
std::string mCmdline
Definition: samples/common/logging.h:283
data_function.TextMelLoader.__init__
def __init__(self, dataset_path, audiopaths_and_text, args)
Definition: tacotron2/data_function.py:32
nvinfer1::ActivationType::kHARD_SIGMOID
@ kHARD_SIGMOID
Hard sigmoid activation: max(0, min(1, alpha*x+beta))
polygraphy.tools.args.onnx.loader.OnnxLoaderArgs._write
_write
Definition: tools/args/onnx/loader.py:25
samplesCommon::GpuTimer::start
void start()
Definition: samples/common/common.h:821
nvinfer1::IInt8Calibrator::~IInt8Calibrator
virtual ~IInt8Calibrator()
Definition: NvInfer.h:6056
onnx_graphsurgeon.ir.node.Node
Definition: node.py:24
nvinfer1::ITensor::getAllowedFormats
virtual TensorFormats getAllowedFormats() const =0
Get a bitmask of TensorFormat values that the tensor supports.
nmtSample::SoftmaxLikelihood::SoftmaxLikelihood
SoftmaxLikelihood()=default
pytorch_quantization.nn.modules.quant_conv.QuantConv2d.padding_mode
padding_mode
Definition: quant_conv.py:121
test_infer_trt.parse_args
def parse_args(parser)
Definition: test_infer_trt.py:38
Logger::TestAtom::TestAtom
TestAtom(TestAtom &&)=default
nvinfer1::plugin::SpecialSlicePluginCreator
Definition: specialSlicePlugin.h:99
model.WaveGlow.remove_weightnorm
def remove_weightnorm(model)
Definition: demo/Tacotron2/waveglow/model.py:273
samplesCommon::GenericBuffer::allocFn
AllocFunc allocFn
Definition: buffers.h:175
sample::LogStreamConsumerBuffer::mShouldLog
bool mShouldLog
Definition: samples/common/logging.h:104
nvinfer1::plugin::SpecialSlice::isOutputBroadcastAcrossBatch
bool isOutputBroadcastAcrossBatch(int outputIndex, const bool *inputIsBroadcasted, int nbInputs) const override
Definition: specialSlicePlugin.cpp:182
nvinfer1::plugin::Reorg
Definition: reorgPlugin.h:29
pytorch_quantization.nn.modules.clip.Clip.__init__
def __init__(self, clip_value_min, clip_value_max, learn_min=False, learn_max=False)
Definition: clip.py:40
TLTMaskRCNNConfig::ANCHOR_RATIOS
static const std::vector< std::pair< float, float > > ANCHOR_RATIOS
Definition: tlt_mrcnn_config.h:68
setup
Definition: onnx-graphsurgeon/setup.py:1
nvinfer1::plugin::FlattenConcat::detachFromContext
void detachFromContext() override
Detach the plugin object from its execution context.
Definition: flattenConcat.cpp:195
onnx_graphsurgeon.importers.onnx_importer.OnnxImporter
Definition: onnx_importer.py:69
nvinfer1::IErrorRecorder
Reference counted application-implemented error reporting interface for TensorRT objects.
Definition: NvInferRuntimeCommon.h:1233
SampleConfig::mModelFilename
std::string mModelFilename
Definition: sampleConfig.h:36
sample::gLogWarning
LogStreamConsumer gLogWarning
Definition: logger.cpp:25
nvinfer1::plugin::RPROIPluginCreator::getPluginName
const char * getPluginName() const override
Return the plugin name.
Definition: nvFasterRCNNPlugin.cpp:339
MaskRCNNUtils::padPPM
void padPPM(const PPM< uint8_t > &src, PPM< uint8_t > &dst, int top, int bottom, int left, int right)
Definition: sampleUffMaskRCNN.cpp:144
nvinfer1::plugin::LogStream::buffer
Buf buffer
Definition: checkMacrosPlugin.h:55
BertInference::BertInference
BertInference(const std::string &enginePath, const int maxBatchSize, const int seqLength, const bool enableGraph=false)
Definition: bert_infer.h:35
polygraphy.backend.onnx.loader.BytesFromOnnx._model
_model
Definition: backend/onnx/loader.py:186
nvinfer1::plugin::DetectionOutputParameters::codeType
CodeTypeSSD codeType
Definition: NvInferPluginUtils.h:186
chptToBin.concatenate_layers
def concatenate_layers(params)
Definition: chptToBin.py:105
SampleMovieLensParams::embeddingVecSize
int32_t embeddingVecSize
Definition: sampleMovieLens.cpp:64
polygraphy.tools.args.tf2onnx.loader.Tf2OnnxLoaderArgs.add_to_script
def add_to_script(self, script, suffix=None)
Definition: tools/args/tf2onnx/loader.py:45
MaskRCNNUtils::PPM
Definition: sampleUffMaskRCNN.cpp:72
nvinfer1::plugin::RPROIPlugin::anchorsScalesHost
float * anchorsScalesHost
Definition: nvFasterRCNNPlugin.h:100
bert::TFusedMultiHeadAttentionXMMAKernel::hashID
virtual uint64_t hashID(const KernelMeta &kernelMeta) const
Definition: fused_multihead_attention.h:198
nvinfer1::IPluginV2DynamicExt::supportsFormatCombination
virtual bool supportsFormatCombination(int32_t pos, const PluginTensorDesc *inOut, int32_t nbInputs, int32_t nbOutputs)=0
Return true if plugin supports the format and datatype for the input/output indexed by pos.
sample::MirroredBuffer::deviceToHost
void deviceToHost(TrtCudaStream &stream)
Definition: sampleDevice.h:338
nvinfer1::IRuntime::~IRuntime
virtual ~IRuntime()
Definition: NvInferRuntime.h:768
models.init_bn
def init_bn(module)
Definition: demo/Tacotron2/models.py:46
sample::TrtCudaStream::sleep
void sleep(int *ms)
Definition: sampleDevice.h:90
nmtSample::BLEUScoreWriter::mReferenceLength
size_t mReferenceLength
Definition: bleuScoreWriter.h:54
export_tacotron2_onnx.DecoderIter.tacotron2
tacotron2
Definition: export_tacotron2_onnx.py:107
EntropyCalibratorImpl::mStream
BatchStream mStream
Definition: frcnnUtils.h:493
chptToBin.convert_weigts
def convert_weigts(dimensions, data, forget_bias=1.0)
Definition: chptToBin.py:314
SampleINT8APIParams::dynamicRangeFileName
std::string dynamicRangeFileName
Definition: sampleINT8API.cpp:64
tests.quant_pooling_test.TestQuantAvgPool1d.test_raise
def test_raise(self)
Definition: quant_pooling_test.py:130
nvinfer1::PluginType::kFASTERRCNN
@ kFASTERRCNN
FasterRCNN fused plugin (RPN + ROI pooling).
samplesCommon::CpuTimer::mStop
std::chrono::time_point< Clock > mStop
Definition: samples/common/common.h:856
util.test_misc.FindInDictCase.map
map
Definition: test_misc.py:37
polygraphy.backend.trt.runner.TrtRunner.set_shapes_from_feed_dict
def set_shapes_from_feed_dict(self, feed_dict)
Definition: backend/trt/runner.py:104
half_float::detail::functions::atan2
static expr atan2(float x, float y)
Arc tangent implementation.
Definition: samples/common/half.h:1935
nmtSample::Alignment::getAttentionKeySize
virtual int getAttentionKeySize()=0
get the size of the attention keys
tools.test_args.TestParseMeta.test_parse_dtype_only
def test_parse_dtype_only(self)
Definition: test_args.py:20
tests.tensor_quantizer_test.TestTensorQuantizer.test_clip_mode
def test_clip_mode(self)
Definition: tensor_quantizer_test.py:105
classification_flow.export_onnx
def export_onnx(model, onnx_filename, batch_onnx, per_channel_quantization)
Definition: classification_flow.py:309
test_infer.print_stats
def print_stats(measurements_all)
Definition: test_infer.py:72
ConcatTopKWorkSpace::concatedBBoxDims
nvinfer1::DimsHW concatedBBoxDims
Definition: maskRCNNKernels.h:196
nvinfer1::plugin::GenerateDetection::supportsFormat
bool supportsFormat(DataType type, PluginFormat format) const override
Check format support.
Definition: generateDetectionPlugin.cpp:160
tools.test_run.TestOther.test_custom_per_output_tolerance
def test_custom_per_output_tolerance(self)
Definition: test_run.py:233
Logger::Logger
Logger(Severity severity)
Definition: demo/BERT/infer_c/logging.h:31
nmtSample::LikelihoodCombinationOperator::combine
virtual float combine(float rayLikelihood, float optionLikelihood) const =0
tools.test_run.TestTrt.test_trt_exclude_outputs_with_layerwise
def test_trt_exclude_outputs_with_layerwise(self)
Definition: test_run.py:83
nvinfer1::plugin::Region::destroy
void destroy() override
Destroy the plugin object.
Definition: regionPlugin.cpp:398
nvinfer1::plugin::MultilevelProposeROI::setPluginNamespace
void setPluginNamespace(const char *libNamespace) override
Set the namespace that this plugin object belongs to.
Definition: multilevelProposeROIPlugin.cpp:226
half_float::detail::exp2
expr exp2(half arg)
Binary exponential.
Definition: ieee_half.h:2340
nvinfer1::IUnaryLayer::~IUnaryLayer
virtual ~IUnaryLayer()
Definition: NvInfer.h:3535
sample::Bindings::getBindings
std::unordered_map< std::string, int > getBindings() const
Definition: sampleUtils.h:430
polygraphy.backend.trt.util.str_from_network
def str_from_network(network, mode="full")
Definition: trt/util.py:96
nvinfer1::PluginType::kPRIORBOX
@ kPRIORBOX
PriorBox plugin.
TrtDestroyer
Definition: demo/BERT/infer_c/common.h:167
nvinfer1::PaddingMode::kEXPLICIT_ROUND_UP
@ kEXPLICIT_ROUND_UP
Use explicit padding, rounding output size up.
nmtSample::Component::~Component
virtual ~Component()=default
comparator.test_data_loader.TestDataLoaderCache
Definition: test_data_loader.py:98
classification_flow.collect_stats
def collect_stats(model, data_loader, num_batches)
Definition: classification_flow.py:374
SampleOnnxMNIST::mEngine
std::shared_ptr< nvinfer1::ICudaEngine > mEngine
The TensorRT engine used to run the network.
Definition: sampleOnnxMNIST.cpp:75
tests.optim_helper_test.TestFreezeParameters.test_simple
def test_simple(self, resnet18)
Definition: optim_helper_test.py:97
half_float::detail::binary_specialized::fmin
static expr fmin(float x, float y)
Minimum implementation.
Definition: samples/common/half.h:2552
bert::QKVToContextInterleavedPlugin::initialize
int initialize() override
Initialize the layer for execution.
Definition: qkvToContextInt8InterleavedPlugin.cpp:182
helpers.calibrator.BertCalibrator.free
def free(self)
Definition: demo/BERT/helpers/calibrator.py:48
Args::userToItemsMap
std::map< int32_t, std::vector< int32_t > > userToItemsMap
Definition: sampleMovieLensMPS.cpp:181
polygraphy.tools.args.comparator.ComparatorCompareArgs
Definition: tools/args/comparator.py:68
samplesCommon::DeviceAllocator
Definition: buffers.h:179
sample::InferenceTrace::stream
int stream
Definition: sampleReporting.h:91
sample::ReportingOptions::exportTimes
std::string exportTimes
Definition: sampleOptions.h:177
tests.quant_conv_test.TestQuantConv3D.test_fake_quant_quant_per_channel_bias
def test_fake_quant_quant_per_channel_bias(self)
Definition: quant_conv_test.py:509
pytorch_quantization.tensor_quant.ScaledQuantDescriptor._scale_amax
_scale_amax
Definition: tensor_quant.py:102
polygraphy.backend.trt_legacy.ParseNetworkFromOnnxLegacy.onnx_loader
onnx_loader
Definition: backend/trt_legacy.py:127
MNISTBatchStream::mBatchSize
int mBatchSize
Definition: BatchStream.h:146
pytorch_quantization.nn.modules.quant_rnn.QuantRNNBase.bidirectional
bidirectional
Definition: modules/quant_rnn.py:53
layers.LinearNorm.__init__
def __init__(self, in_dim, out_dim, bias=True, w_init_gain='linear')
Definition: layers.py:24
tests.quant_modules_test.TestQuantModuleReplace.test_with_custom_quant_modules
def test_with_custom_quant_modules(self)
Definition: quant_modules_test.py:58
UffPoolPluginV2::getPluginNamespace
const char * getPluginNamespace() const override
Return the namespace of the plugin object.
Definition: sampleUffPluginV2Ext.cpp:563
nvinfer1::plugin::DetectionLayerPluginCreator::mFC
static PluginFieldCollection mFC
Definition: detectionLayerPlugin.h:126
nmtSample::MultiplicativeAlignment::getAttentionKeySize
int getAttentionKeySize() override
get the size of the attention keys
Definition: multiplicativeAlignment.cpp:71
half_float::detail::builtin_isnan
bool builtin_isnan(T arg)
Check for NaN.
Definition: ieee_half.h:420
pytorch_quantization.nn.modules.quant_rnn.GRUCell.weight_hh
weight_hh
Definition: modules/quant_rnn.py:432
model.Decoder.__init__
def __init__(self, n_mel_channels, n_frames_per_step, encoder_embedding_dim, attention_dim, attention_location_n_filters, attention_location_kernel_size, attention_rnn_dim, decoder_rnn_dim, prenet_dim, max_decoder_steps, gate_threshold, p_attention_dropout, p_decoder_dropout, early_stopping)
Definition: demo/Tacotron2/tacotron2/model.py:240
MaskRCNNUtils::RawDetection::class_id
float class_id
Definition: sampleUffMaskRCNN.cpp:54
export_waveglow_onnx.parse_args
def parse_args(parser)
Definition: export_waveglow_onnx.py:26
nvinfer1::plugin::CropAndResizePluginCreator::getFieldNames
const PluginFieldCollection * getFieldNames() override
Return a list of fields that needs to be passed to createPlugin.
Definition: cropAndResizePlugin.cpp:263
nvinfer1::plugin::MultilevelCropAndResize::mFeatureSpatialSize
xy_t mFeatureSpatialSize[mFeatureMapCount]
Definition: multilevelCropAndResizePlugin.h:101
helpers.data_processing._compute_softmax
def _compute_softmax(scores)
Definition: data_processing.py:310
AlgorithmCacheReader::selectAlgorithms
int32_t selectAlgorithms(const nvinfer1::IAlgorithmContext &algoContext, const nvinfer1::IAlgorithm *const *algoChoices, int32_t nbChoices, int32_t *selection) override
Return value in [0, nbChoices] for a valid algorithm.
Definition: sampleAlgorithmSelector.cpp:128
samplesCommon::HostFree::operator()
void operator()(void *ptr) const
Definition: buffers.h:210
nvinfer1::anonymous_namespace{NvInferRuntime.h}::createInferRuntime
IRuntime * createInferRuntime(ILogger &logger)
Create an instance of an IRuntime class.
Definition: NvInferRuntime.h:1990
pytorch_quantization.tensor_quant.ScaledQuantDescriptor
Definition: tensor_quant.py:30
comparator.test_comparator.TestComparator
Definition: test_comparator.py:32
nvinfer1::plugin::BatchedNMSPlugin::isOutputBroadcastAcrossBatch
bool isOutputBroadcastAcrossBatch(int outputIndex, const bool *inputIsBroadcasted, int nbInputs) const override
Definition: batchedNMSPlugin.cpp:459
polygraphy.tools.args.logger.LoggerArgs.add_to_parser
def add_to_parser(self, parser)
Definition: Polygraphy/polygraphy/tools/args/logger.py:23
nvinfer1::plugin::ProposalLayer::configurePlugin
void configurePlugin(const Dims *inputDims, int nbInputs, const Dims *outputDims, int nbOutputs, const DataType *inputTypes, const DataType *outputTypes, const bool *inputIsBroadcast, const bool *outputIsBroadcast, PluginFormat floatFormat, int maxBatchSize) override
Definition: proposalLayerPlugin.cpp:349
batchPrepare.height
int height
Definition: batchPrepare.py:34
FrcnnArgs::inputHeight
int inputHeight
Definition: frcnnUtils.h:58
nvinfer1::CalibrationAlgoType::kMINMAX_CALIBRATION
@ kMINMAX_CALIBRATION
export_waveglow_onnx.main
def main()
Definition: export_waveglow_onnx.py:197
sample::defaultWarmUp
constexpr int defaultWarmUp
Definition: sampleOptions.h:47
nvinfer1::plugin::CudnnError
Definition: checkMacrosPlugin.h:110
parserutils::getCHWWithExpansion
nvinfer1::DimsCHW getCHWWithExpansion(const nvinfer1::Dims &d, int filler)
Definition: parserUtils.h:114
onnx_graphsurgeon.logger.logger.Logger.critical
def critical(self, message)
Definition: onnx-graphsurgeon/onnx_graphsurgeon/logger/logger.py:244
nvinfer1::plugin::DetectionOutput::serialize
void serialize(void *buffer) const override
Serialize the layer.
Definition: nmsPlugin.cpp:131
bert::EmbLayerNormVarSeqlenPlugin::enqueue
int enqueue(const nvinfer1::PluginTensorDesc *inputDesc, const nvinfer1::PluginTensorDesc *outputDesc, const void *const *inputs, void *const *outputs, void *workspace, cudaStream_t stream) override
Execute the layer.
Definition: embLayerNormVarSeqlenPlugin.cpp:260
pytorch_quantization.nn.modules.quant_instancenorm.QuantInstanceNorm2d.__init__
def __init__(self, int num_features, float eps=1e-5, float momentum=0.1, bool affine=False, bool track_running_stats=False, **kwargs)
Definition: quant_instancenorm.py:52
TLTMaskRCNNConfig::MODEL_MASK_SHAPE
static const Dims4 MODEL_MASK_SHAPE
Definition: tlt_mrcnn_config.h:175
nvinfer1::plugin::TRTException::status
int status
Definition: checkMacrosPlugin.h:96
FrcnnArgs::inputWidth
int inputWidth
Definition: frcnnUtils.h:59
sample::InferenceEnvironment::profiler
std::unique_ptr< Profiler > profiler
Definition: sampleInference.h:36
helpers.calibrator.BertCalibrator.tokenizer
tokenizer
Definition: demo/BERT/helpers/calibrator.py:41
audio_processing.window_sumsquare
def window_sumsquare(window, n_frames, hop_length=200, win_length=800, n_fft=800, dtype=np.float32, norm=None)
Definition: audio_processing.py:23
nmtSample::BeamSearchPolicy::Ray::backtrackId
int backtrackId
Definition: beamSearchPolicy.h:59
sample::Logger::TestResult::kRUNNING
@ kRUNNING
The test is running.
onnx_graphsurgeon.logger.logger.Logger.__init__
def __init__(self, severity=INFO, colors=True, letter=True, timestamp=False, line_info=False)
Definition: onnx-graphsurgeon/onnx_graphsurgeon/logger/logger.py:89
nvinfer1::plugin::CoordConvACPluginCreator::createPlugin
IPluginV2Ext * createPlugin(const char *name, const PluginFieldCollection *fc) override
Return a plugin object.
Definition: coordConvACPlugin.cpp:195
nvinfer1::ILayer::~ILayer
virtual ~ILayer()
Definition: NvInfer.h:945
FrcnnArgs::dataDirs
std::vector< std::string > dataDirs
Definition: frcnnUtils.h:57
polygraphy.tools.args.comparator
Definition: tools/args/comparator.py:1
nvinfer1::plugin::ProposalLayer::mValidCnt
std::shared_ptr< CudaBind< int > > mValidCnt
Definition: proposalLayerPlugin.h:104
nvinfer1::plugin::CoordConvACPlugin::oH
int oH
Definition: coordConvACPlugin.h:87
nvinfer1::plugin::BatchTilePlugin::clone
IPluginV2Ext * clone() const override
Clone the plugin object.
Definition: batchTilePlugin.cpp:166
MaskRCNNUtils::writePPMFile
void writePPMFile(const std::string &filename, PPM< uint8_t > &ppm)
Definition: sampleUffMaskRCNN.cpp:92
parserutils::operator<<
std::ostream & operator<<(std::ostream &o, const nvinfer1::Dims &dims)
Definition: parserUtils.h:87
nvinfer1::plugin::PReLU
LReLU PReLU
Definition: lReluPlugin.h:99
half_float::half::operator*=
detail::enable< half &, T >::type operator*=(T rhs)
Arithmetic assignment.
Definition: ieee_half.h:1139
SampleConfig::mUseDLACore
int64_t mUseDLACore
Definition: sampleConfig.h:50
nvinfer1::plugin::ProposalLayer::mKeepTopK
int mKeepTopK
Definition: proposalLayerPlugin.h:99
nvinfer1::SliceMode
SliceMode
Controls how ISliceLayer handles out of bounds coordinates.
Definition: NvInfer.h:3881
Bbox
Definition: bboxUtils.h:26
BertInferenceRunner::prepare
void prepare(const int batchSize)
Definition: infer_c.cpp:31
bert::FusedMultiHeadAttentionKernelMetaInfoV1::mSharedMemBytes
unsigned int mSharedMemBytes
Definition: fused_multihead_attention.h:124
bert::QKVToContextInterleavedPlugin::mNamespace
std::string mNamespace
Definition: qkvToContextInt8InterleavedPlugin.h:79
BboxInfo::bbox_idx
int bbox_idx
Definition: bboxUtils.h:44
polygraphy.tools.args.tf.loader.TfLoaderArgs.freeze_graph
freeze_graph
Definition: tools/args/tf/loader.py:77
pytorch_quantization.tensor_quant.FakeAffineTensorQuantFunction.forward
def forward(ctx, inputs, min_range, max_range, num_bits=8)
Definition: tensor_quant.py:378
nvinfer1::PaddingMode::kSAME_LOWER
@ kSAME_LOWER
Use SAME padding, with prePadding >= postPadding.
polygraphy.tools.surgeon.subtool.base.BaseSurgeonSubtool.import_graph
def import_graph(self, args)
Definition: surgeon/subtool/base.py:44
nvinfer1::plugin::Reorg::terminate
void terminate() override
Release resources acquired during plugin layer initialization.
Definition: reorgPlugin.cpp:98
nvinfer1::RNNGateType::kUPDATE
@ kUPDATE
Update gate (z).
PluginFactory::createPlugin
nvinfer1::IPlugin * createPlugin(const char *layerName, const void *serialData, size_t serialLength) override
Create a plugin from serialized data.
Definition: samples/opensource/samplePlugin/fcPlugin.h:356
SampleMovieLens::printOutputParams
void printOutputParams(OutputParams &outParams)
Prints the expected recommendation results (ground truth) from the MovieLens dataset for a given user...
Definition: sampleMovieLens.cpp:458
onnx_graphsurgeon.util.misc.combine_dicts
def combine_dicts(dict0, dict1)
Definition: onnx-graphsurgeon/onnx_graphsurgeon/util/misc.py:53
polygraphy.tools.args.trt_legacy.TrtLegacyArgs.save_uff
save_uff
Definition: tools/args/trt_legacy.py:62
nvinfer1::plugin::PyramidROIAlign::getPluginNamespace
const char * getPluginNamespace() const override
Return the namespace of the plugin object.
Definition: pyramidROIAlignPlugin.cpp:139
polygraphy.backend.tf.loader.UseTfTrt.int8
int8
Definition: backend/tf/loader.py:228
SampleDynamicReshape::mPreprocessorContext
SampleUniquePtr< nvinfer1::IExecutionContext > mPreprocessorContext
Definition: sampleDynamicReshape.cpp:86
half_float::detail::false_type
bool_type< false > false_type
Definition: ieee_half.h:302
UffPoolPluginV2::mInputDims
Dims mInputDims
Definition: sampleUffPluginV2Ext.cpp:638
polygraphy.backend.tf.runner.TfRunner.infer_impl
def infer_impl(self, feed_dict)
Definition: backend/tf/runner.py:78
nvinfer1::IErrorRecorder::ErrorDesc
const char * ErrorDesc
A typedef of a c-style string for reporting error descriptions.
Definition: NvInferRuntimeCommon.h:1239
classification_flow.res
def res
Definition: classification_flow.py:443
AlgorithmCacheReader::AlgorithmCacheItem::tactic
int64_t tactic
Definition: sampleAlgorithmSelector.cpp:238
nvinfer1::DimensionType::kCHANNEL
@ kCHANNEL
Elements correspond to different channels.
samplesCommon::GenericBuffer::GenericBuffer
GenericBuffer(GenericBuffer &&buf)
Definition: buffers.h:80
polygraphy.tools.args.onnx.runner
Definition: tools/args/onnx/runner.py:1
nmtSample::SLPProjection::mOutputChannelCount
int mOutputChannelCount
Definition: slpProjection.h:54
nvinfer1::IAlgorithmContext::getNbOutputs
virtual int32_t getNbOutputs() const =0
Return number of outputs of the algorithm.
nvinfer1::plugin::DetectionLayer::mValidCnt
std::shared_ptr< CudaBind< int > > mValidCnt
Definition: detectionLayerPlugin.h:101
nmtSample::SequenceProperties::~SequenceProperties
virtual ~SequenceProperties()=default
nvinfer1::plugin::GroupNormalizationPlugin::mChannelVolume
int mChannelVolume
Definition: groupNormalizationPlugin.h:100
nvinfer1::plugin::Normalize::getWorkspaceSize
size_t getWorkspaceSize(int maxBatchSize) const override
Definition: normalizePlugin.cpp:102
nvinfer1::plugin::ProposalPlugin::getPluginNamespace
const char * getPluginNamespace() const override
Return the namespace of the plugin object.
Definition: proposalPlugin.cpp:270
polygraphy.backend.trt.util._get_network_outputs
def _get_network_outputs(network)
Definition: trt/util.py:186
half_float::half::round_style
static const std::float_round_style round_style
Rounding mode to use.
Definition: ieee_half.h:1190
polygraphy.util.misc.insert_suffix
def insert_suffix(path, suffix)
File creation.
Definition: Polygraphy/polygraphy/util/misc.py:282
nvinfer1::plugin::SplitPlugin::setPluginNamespace
void setPluginNamespace(const char *) override
Set the namespace that this plugin object belongs to.
Definition: split.h:119
nvinfer1::plugin::MultilevelCropAndResize::enqueue
int enqueue(int batch_size, const void *const *inputs, void **outputs, void *workspace, cudaStream_t stream) override
Definition: multilevelCropAndResizePlugin.cpp:196
nvinfer1::plugin::SplitPlugin::supportsFormatCombination
bool supportsFormatCombination(int pos, const nvinfer1::PluginTensorDesc *inOut, int nbInputs, int nbOutputs) override
onnx_graphsurgeon.ir.graph.Graph.doc_string
doc_string
Definition: graph.py:121
nvinfer1::IFillLayer::~IFillLayer
virtual ~IFillLayer()
Definition: NvInfer.h:4885
polygraphy.tools.args.tf2onnx.loader.Tf2OnnxLoaderArgs.opset
opset
Definition: tools/args/tf2onnx/loader.py:41
nvinfer1::PluginTensorDesc::type
DataType type
Definition: NvInferRuntimeCommon.h:370
nvinfer1::plugin::NMSParameters::scoreThreshold
float scoreThreshold
Definition: NvInferPluginUtils.h:251
nvinfer1::ErrorCode::kINVALID_ARGUMENT
@ kINVALID_ARGUMENT
An argument passed to the function is invalid in isolation.
tests.quant_linear_test.TestQuantLinear.test_set_default_quant_desc
def test_set_default_quant_desc(self)
Definition: quant_linear_test.py:211
tests.tensor_quant_test.TestFakeTensorQuant
Definition: tensor_quant_test.py:135
tests.quant_conv_test.TestQuantConv1D.test_fake_quant_per_channel_other_prec
def test_fake_quant_per_channel_other_prec(self)
Definition: quant_conv_test.py:383
backend.trt.test_loader.TestOnnxNetworkLoader
Definition: trt/test_loader.py:89
bert::EmbLayerNormVarSeqlenPlugin::mBeta
bert::WeightsWithOwnership mBeta
Definition: embLayerNormVarSeqlenPlugin.h:97
comparator.test_compare.TestBasicCompareFunc
Definition: test_compare.py:22
SampleUffFasterRcnn::mInputDims
nvinfer1::Dims mInputDims
The dimensions of the input to the network.
Definition: sampleUffFasterRCNN.cpp:123
half_float::detail::functions::remainder
static expr remainder(float x, float y)
Remainder implementation.
Definition: samples/common/half.h:1657
nmtSample::TextReader::getInfo
std::string getInfo() override
get the textual description of the component
Definition: textReader.cpp:65
backend.trt.test_loader.identity_identity_network
def identity_identity_network()
Definition: trt/test_loader.py:53
SampleMNISTAPIParams
The SampleMNISTAPIParams structure groups the additional parameters required by the SampleMNISTAPI sa...
Definition: sampleMNISTAPI.cpp:46
bert::fused_multihead_attention_fp16_128_64_kernel_sm80_cu_o
unsigned char fused_multihead_attention_fp16_128_64_kernel_sm80_cu_o[]
Definition: fused_multihead_attention_fp16_128_64_kernel.sm80.cpp:20
polygraphy.tools.args.tf2onnx.loader
Definition: tools/args/tf2onnx/loader.py:1
pytorch_quantization.nn.modules.quant_pooling.QuantAdaptiveAvgPool2d
Definition: quant_pooling.py:129
export_tacotron2_onnx.Encoder.infer
infer
Definition: export_tacotron2_onnx.py:68
text.cmudict._get_pronunciation
def _get_pronunciation(s)
Definition: cmudict.py:75
sample::TrtCudaBuffer
Managed buffer for host and device.
Definition: sampleDevice.h:214
nvinfer1::plugin::InstanceNormalizationPlugin::getNbOutputs
int getNbOutputs() const override
Get the number of outputs from the layer.
Definition: instanceNormalizationPlugin.cpp:143
nvinfer1::plugin::BatchedNMSBasePluginCreator::mPluginAttributes
static std::vector< PluginField > mPluginAttributes
Definition: batchedNMSPlugin.h:134
comparator.test_comparator.TestComparator.test_generator_as_data_loader
def test_generator_as_data_loader(self)
Definition: test_comparator.py:52
polygraphy.tools.args.comparator.ComparatorCompareArgs.rtol
rtol
Definition: tools/args/comparator.py:105
polygraphy.backend.trt_legacy.LoadNetworkFromCaffe.__init__
def __init__(self, deploy, model, outputs, batch_size=None, dtype=None)
Definition: backend/trt_legacy.py:144
test_ir.TestNodeIO.setup_method
def setup_method(self, field_names)
Definition: test_ir.py:222
test_examples.Example
Definition: Polygraphy/tests/test_examples.py:54
polygraphy.tools.precision.precision.STBisect.find
def find(self)
Definition: precision.py:258
ParserOnnxConfig::getModelDtype
virtual nvinfer1::DataType getModelDtype() const
Get the Model Data Type.
Definition: parserOnnxConfig.h:82
polygraphy.util.misc.indent_block
def indent_block(block, level=1)
Definition: Polygraphy/polygraphy/util/misc.py:416
polygraphy.logger.logger.Logger.colors
colors
Definition: Polygraphy/polygraphy/logger/logger.py:130
samplesCommon::GenericBuffer::size
size_t size() const
Returns the size (in number of elements) of the buffer.
Definition: buffers.h:128
nvinfer1::plugin::CropAndResizePlugin::getNbOutputs
int getNbOutputs() const override
Get the number of outputs from the layer.
Definition: cropAndResizePlugin.cpp:108
SampleFasterRCNN::mInputDims
nvinfer1::Dims mInputDims
The dimensions of the input to the network.
Definition: sampleFasterRCNN.cpp:85
onnx_graphsurgeon.logger.logger.LoggerIndent.__init__
def __init__(self, logger, indent)
Definition: onnx-graphsurgeon/onnx_graphsurgeon/logger/logger.py:27
TLTMaskRCNNConfig::FPN_CLASSIF_FC_LAYERS_SIZE
static const int FPN_CLASSIF_FC_LAYERS_SIZE
Definition: tlt_mrcnn_config.h:50
nvinfer1::IPluginCreator::deserializePlugin
virtual IPluginV2 * deserializePlugin(const char *name, const void *serialData, size_t serialLength)=0
Called during deserialization of plugin layer.
onnx_graphsurgeon.ir.graph.Graph.tensors
def tensors(self, check_duplicates=False)
Definition: graph.py:301
nvcaffeparser1::BinaryProtoBlob::getDimensions
nvinfer1::DimsNCHW getDimensions() override
Definition: binaryProtoBlob.h:36
MNISTBatchStream::skip
void skip(int skipCount) override
Definition: BatchStream.h:68
bert::SkipLayerNormInterleavedPlugin::getSerializationSize
size_t getSerializationSize() const override
Find the size of the serialization buffer required.
Definition: skipLayerNormInt8InterleavedPlugin.cpp:234
nvinfer1::Dims2::Dims2
Dims2(int32_t d0, int32_t d1)
Construct a Dims2 from 2 elements.
Definition: NvInfer.h:83
sample::exportJSONTrace
void exportJSONTrace(const std::vector< InferenceTrace > &trace, const std::string &fileName)
Printed format: [ value, ...] value ::= { "start enq : time, "end enq" : time, "start in" : time,...
Definition: sampleReporting.cpp:221
SampleUffFasterRcnnParams::classifierRegressorStd
std::vector< float > classifierRegressorStd
Definition: sampleUffFasterRCNN.cpp:75
polygraphy.backend.common.loader.BytesFromPath.__call__
def __call__(self)
Definition: backend/common/loader.py:30
nmtSample::ngramCountIntersection
Count_t ngramCountIntersection(const Count_t &cnt0, const Count_t &cnt1)
Definition: bleuScoreWriter.cpp:95
nvinfer1::plugin::ProposalLayer::getPluginNamespace
const char * getPluginNamespace() const override
Return the namespace of the plugin object.
Definition: proposalLayerPlugin.cpp:187
polygraphy.backend.common.loader.BytesFromPath
Definition: backend/common/loader.py:19
nvinfer1::plugin::reduced_divisor::get_mul_shift
__host__ void get_mul_shift(unsigned &mul, unsigned &shift)
Definition: reducedMathPlugin.h:89
SampleUffSSD::processInput
bool processInput(const samplesCommon::BufferManager &buffers)
Reads the input and mean data, preprocesses, and stores the result in a managed buffer.
Definition: sampleUffSSD.cpp:286
nvinfer1::plugin::ResizeNearest::mOutputDims
Dims mOutputDims
Definition: resizeNearestPlugin.h:92
nvinfer1::plugin::PriorBoxPluginCreator::createPlugin
IPluginV2Ext * createPlugin(const char *name, const PluginFieldCollection *fc) override
Return a plugin object.
Definition: priorBoxPlugin.cpp:383
nmtSample::BeamSearchPolicy::Ray
Definition: beamSearchPolicy.h:56
nvinfer1::ErrorCode::kFAILED_ALLOCATION
@ kFAILED_ALLOCATION
An error occurred when performing an allocation of memory on the host or the device.
MinimumWorkspaceAlgorithmSelector::selectAlgorithms
int32_t selectAlgorithms(const nvinfer1::IAlgorithmContext &algoContext, const nvinfer1::IAlgorithm *const *algoChoices, int32_t nbChoices, int32_t *selection) override
Return value in [0, nbChoices] for a valid algorithm.
Definition: sampleAlgorithmSelector.cpp:283
nvinfer1::ActivationType::kSOFTPLUS
@ kSOFTPLUS
Parametric softplus activation: alpha*log(exp(beta*x)+1)
backend.trt.test_loader.TestSaveEngine.test_save_engine
def test_save_engine(self, identity_network)
Definition: trt/test_loader.py:271
nvinfer1::plugin::PyramidROIAlign::canBroadcastInputAcrossBatch
bool canBroadcastInputAcrossBatch(int inputIndex) const override
Definition: pyramidROIAlignPlugin.cpp:260
nvinfer1::plugin::CoordConvACPlugin::initialize
int initialize() override
Initialize the layer for execution.
Definition: coordConvACPlugin.cpp:64
samplesCommon::UffSampleParams
The UffSampleParams structure groups the additional parameters required by networks that use Uff.
Definition: argsParser.h:71
util.test_format.FormatTestCase.__init__
def __init__(self, shape, format)
Definition: test_format.py:24
SampleMaskRCNN::mOriginalPPMs
std::vector< MaskRCNNUtils::PPM< uint8_t > > mOriginalPPMs
Definition: sampleUffMaskRCNN.cpp:335
Semaphore::wait
void wait()
Definition: sampleMovieLensMPS.cpp:94
polygraphy.util.format.FormatManager.permutation
def permutation(old_format, new_format)
Definition: format.py:97
pytorch_quantization.nn.modules.tensor_quantizer.TensorQuantizer._narrow_range
_narrow_range
Definition: tensor_quantizer.py:80
nvinfer1::plugin::Region::hasSoftmaxTree
bool hasSoftmaxTree
Definition: regionPlugin.h:98
nmtSample::LSTMDecoder::LSTMDecoder
LSTMDecoder(ComponentWeights::ptr weights)
Definition: lstmDecoder.cpp:29
SampleConfig::setEngineFileName
void setEngineFileName(const char *engineFilename)
Definition: sampleConfig.h:133
SampleConfig::setTF32
void setTF32(bool enabled)
Definition: sampleConfig.h:99
nvinfer1::ITensor::getName
virtual const char * getName() const =0
Get the tensor name.
data_function.TextMelLoader.__len__
def __len__(self)
Definition: tacotron2/data_function.py:79
generate.gemm
def gemm(self, a, b, trans_a=False, trans_b=False)
Definition: 07_creating_a_model_with_the_layer_api/generate.py:51
export_tacotron2_onnx.lstmcell2lstm_params
def lstmcell2lstm_params(lstm_mod, lstmcell_mod)
Definition: export_tacotron2_onnx.py:85
AlgorithmCacheReader::AlgorithmCacheItem::nbInputs
int32_t nbInputs
Definition: sampleAlgorithmSelector.cpp:239
bert::EmbLayerNormVarSeqlenPlugin::getPluginType
const char * getPluginType() const override
Return the plugin type.
Definition: embLayerNormVarSeqlenPlugin.cpp:337
nvinfer1::plugin::Reorg::initialize
int initialize() override
Initialize the layer for execution.
Definition: reorgPlugin.cpp:93
half_float::detail::operator<
enable< bool, T, U >::type operator<(T x, U y)
Comparison for less than.
Definition: ieee_half.h:2141
tests.quant_linear_test.TestQuantLinear.test_fake_quant_per_channel_other_precs
def test_fake_quant_per_channel_other_precs(self)
Definition: quant_linear_test.py:153
polygraphy.tools.precision.precision.STWorstFirst.__init__
def __init__(self)
Definition: precision.py:172
sample::anonymous_namespace{sampleInference.cpp}::Iteration::getEnqueueTime
TimePoint getEnqueueTime(bool start)
Definition: sampleInference.cpp:408
comparator.test_struct.TestRunResults.test_keys
def test_keys(self, run_results)
Definition: comparator/test_struct.py:36
nvinfer1::DeviceType::kGPU
@ kGPU
GPU Device.
SampleMaskRCNN::mInputDims
nvinfer1::Dims mInputDims
Definition: sampleUffMaskRCNN.cpp:332
backend.trt.test_loader.TestSerializedEngineLoader.test_serialized_engine_loader_from_lambda
def test_serialized_engine_loader_from_lambda(self, identity_engine)
Definition: trt/test_loader.py:73
bert::packedMaskSize384
constexpr size_t packedMaskSize384
Definition: embLayerNormVarSeqlenPlugin.cpp:43
nvinfer1::plugin::GenerateDetectionPluginCreator::mScoreThreshold
float mScoreThreshold
Definition: generateDetectionPlugin.h:133
nvinfer1::plugin::LReluPluginCreator::deserializePlugin
IPluginV2 * deserializePlugin(const char *name, const void *serialData, size_t serialLength) override
Called during deserialization of plugin layer.
Definition: lReluPlugin.cpp:165
SampleConfig::setModelFileName
void setModelFileName(const char *onnxFilename)
Set the Model File Name.
Definition: sampleConfig.h:109
tools.test_inspect.params
params
Definition: test_inspect.py:26
tests.quant_rnn_test.TestQuantLSTMCell.test_no_quant_input_hidden_bias
def test_no_quant_input_hidden_bias(self, verbose)
Definition: quant_rnn_test.py:109
BatchStream::mImageSize
int mImageSize
Definition: BatchStream.h:378
nvinfer1::plugin::ProposalLayer::getNbOutputs
int getNbOutputs() const override
Get the number of outputs from the layer.
Definition: proposalLayerPlugin.cpp:125
polygraphy.logger.logger.LoggerVerbosity.__enter__
def __enter__(self)
Definition: Polygraphy/polygraphy/logger/logger.py:64
nvinfer1::IReduceLayer::~IReduceLayer
virtual ~IReduceLayer()
Definition: NvInfer.h:3625
data_function.TextMelLoader.sampling_rate
sampling_rate
Definition: tacotron2/data_function.py:36
nvinfer1::plugin::reduced_divisor::divmod
__host__ __device__ __forceinline__ void divmod(int x, int &q, int &mod) const
Definition: reducedMathPlugin.h:80
nmtSample::LimitedSamplesDataReader::LimitedSamplesDataReader
LimitedSamplesDataReader(int maxSamplesToRead, DataReader::ptr originalDataReader)
Definition: limitedSamplesDataReader.cpp:24
nvinfer1::plugin::GenerateDetection::mMaxBatchSize
int mMaxBatchSize
Definition: generateDetectionPlugin.h:100
bert::TFusedMultiHeadAttentionXMMAKernel::mDriver
nvinfer1::CUDADriverWrapper mDriver
Definition: fused_multihead_attention.h:275
nvinfer1::plugin::Region::smTree
std::shared_ptr< softmaxTree > smTree
Definition: regionPlugin.h:96
bert::SkipLayerNormInterleavedPluginCreator::deserializePlugin
nvinfer1::IPluginV2 * deserializePlugin(const char *name, const void *serialData, size_t serialLength) override
Called during deserialization of plugin layer.
Definition: skipLayerNormInt8InterleavedPlugin.cpp:331
pytorch_quantization.nn.modules.quant_rnn.QuantLSTMCell.forward
def forward(self, input, hx=None)
Definition: modules/quant_rnn.py:407
nvinfer1::plugin::PriorBoxParameters::minSize
float * minSize
Definition: NvInferPluginUtils.h:93
onnx_graphsurgeon.util.exception.OnnxGraphSurgeonException
Definition: onnx-graphsurgeon/onnx_graphsurgeon/util/exception.py:17
pytorch_quantization.calib.calibrator._Calibrator
Definition: tools/pytorch-quantization/pytorch_quantization/calib/calibrator.py:21
nvinfer1::plugin::PyramidROIAlignPluginCreator
Definition: pyramidROIAlignPlugin.h:104
SampleUffSSDParams::calBatchSize
int32_t calBatchSize
The size of calibration batch.
Definition: sampleUffSSD.cpp:53
sample::BuildOptions::shapesCalib
std::unordered_map< std::string, ShapeRange > shapesCalib
Definition: sampleOptions.h:124
tests.quant_rnn_test.TestQuantLSTM.test_quant_input_hidden_bias
def test_quant_input_hidden_bias(self, verbose)
Definition: quant_rnn_test.py:448
nvinfer1::PluginFieldCollection::fields
const PluginField * fields
Pointer to PluginField entries.
Definition: NvInferRuntimeCommon.h:879
nvinfer1::plugin::PriorBox::canBroadcastInputAcrossBatch
bool canBroadcastInputAcrossBatch(int inputIndex) const override
Definition: priorBoxPlugin.cpp:308
nvinfer1::IProfiler::reportLayerTime
virtual void reportLayerTime(const char *layerName, float ms)=0
Layer time reporting callback.
nmtSample::BeamSearchPolicy::getInfo
std::string getInfo() override
get the textual description of the component
Definition: beamSearchPolicy.cpp:166
nmtSample::SLPAttention::addToModel
void addToModel(nvinfer1::INetworkDefinition *network, nvinfer1::ITensor *inputFromDecoder, nvinfer1::ITensor *context, nvinfer1::ITensor **attentionOutput) override
add the attention vector calculation to the network
Definition: slpAttention.cpp:38
polygraphy.util.cuda.DeviceBuffer.resize
def resize(self, shape)
Definition: cuda.py:146
nvinfer1::plugin::CropAndResizePlugin::getOutputDataType
DataType getOutputDataType(int index, const nvinfer1::DataType *inputTypes, int nbInputs) const override
Definition: cropAndResizePlugin.cpp:200
polygraphy.comparator.struct.AccuracyResult.percentage
def percentage(self, runner_pair=None)
Definition: comparator/struct.py:173
profiler
Definition: profiler.py:1
PoolParameters::mV
int mV
Definition: sampleUffPluginV2Ext.cpp:181
bert::EmbLayerNormVarSeqlenPlugin::destroy
void destroy() override
Destroy the plugin object.
Definition: embLayerNormVarSeqlenPlugin.cpp:395
BufferDesc::BufferDesc
BufferDesc(nvinfer1::Dims dims, int dataWidth, TensorFormat format)
Definition: sampleReformatFreeIO.cpp:69
nvinfer1::IRefitter::setDynamicRange
virtual bool setDynamicRange(const char *tensorName, float min, float max)=0
Update dynamic range for a tensor.
bert::TFusedMultiHeadAttentionXMMAKernel::FusedMultiHeadAttentionKernelInfo::mMetaInfoIndex
unsigned int mMetaInfoIndex
Definition: fused_multihead_attention.h:284
nvinfer1::LayerType::kRNN
@ kRNN
RNN layer.
pytorch_quantization.nn.modules.quant_rnn.QuantRNNBase.bias
bias
Definition: modules/quant_rnn.py:49
nvinfer1::plugin::FlattenConcat::~FlattenConcat
~FlattenConcat() override
Definition: flattenConcat.cpp:80
tests.calibrator_test.TestMaxCalibrator.test_repr
def test_repr(self)
Definition: calibrator_test.py:90
onnx_graphsurgeon.importers.onnx_importer
Definition: onnx_importer.py:1
tools.test_surgeon.TestSurgeonExtract.test_fallback_shape_inference
def test_fallback_shape_inference(self)
Definition: test_surgeon.py:31
polygraphy.tools.surgeon.surgeon.Surgeon.__init__
def __init__(self)
Definition: surgeon.py:25
SampleConfig::getPrintLayerInfo
bool getPrintLayerInfo() const
Get whether the layer information will be printed.
Definition: sampleConfig.h:161
nvinfer1::plugin::Normalize::getOutputDimensions
Dims getOutputDimensions(int index, const Dims *inputs, int nbInputDims) override
Definition: normalizePlugin.cpp:87
bert::anonymous_namespace{skipLayerNormInt8InterleavedPlugin.cpp}::SKIP_LAYER_NORM_INTERLEAVED_VERSION
static const char * SKIP_LAYER_NORM_INTERLEAVED_VERSION
Definition: skipLayerNormInt8InterleavedPlugin.cpp:41
SampleUffPluginV2Ext::teardown
bool teardown()
Used to clean up any state created in the sample class.
Definition: sampleUffPluginV2Ext.cpp:326
pytorch_quantization.nn.modules.quant_linear.QuantLinear
Definition: quant_linear.py:28
MultilevelProposeROIWorkSpace::preRefineSortedScoreDims
nvinfer1::DimsHW preRefineSortedScoreDims
Definition: maskRCNNKernels.h:162
nvinfer1::plugin::GroupNormalizationPlugin::getPluginNamespace
const char * getPluginNamespace() const override
Return the namespace of the plugin object.
Definition: groupNormalizationPlugin.cpp:246
nvinfer1::plugin::RPROIParams::anchorsRatioCount
int32_t anchorsRatioCount
Definition: NvInferPluginUtils.h:123
nvonnxparser::EnumMax
int32_t EnumMax()
polygraphy.backend.trt_legacy.LoadUffFile.path
path
Definition: backend/trt_legacy.py:37
samplesCommon::BBox::y2
float y2
Definition: samples/common/common.h:669
nvinfer1::plugin::ProposalPlugin::terminate
void terminate() override
Release resources acquired during plugin layer initialization.
Definition: proposalPlugin.cpp:248
SampleSSDParams::keepTopK
int keepTopK
The maximum number of detection post-NMS.
Definition: sampleSSD.cpp:50
sample::LogStreamConsumerBase::LogStreamConsumerBase
LogStreamConsumerBase(std::ostream &stream, const std::string &prefix, bool shouldLog)
Definition: samples/common/logging.h:114
anonymous_namespace{batchedNMSPlugin.cpp}::NMS_PLUGIN_VERSION
const char * NMS_PLUGIN_VERSION
Definition: batchedNMSPlugin.cpp:34
nvinfer1::plugin::PriorBoxParameters::numMaxSize
int32_t numMaxSize
Definition: NvInferPluginUtils.h:94
nvinfer1::IRuntime::deserializeCudaEngine
virtual nvinfer1::ICudaEngine * deserializeCudaEngine(const void *blob, std::size_t size, IPluginFactory *pluginFactory) noexcept=0
Deserialize an engine from a stream.
tools.test_args.TestParseMeta.test_parse_shape_dtype
def test_parse_shape_dtype(self)
Definition: test_args.py:27
sample::anonymous_namespace{sampleEngines.cpp}::RndInt8Calibrator::readCalibrationCache
const void * readCalibrationCache(size_t &length) override
Definition: sampleEngines.cpp:239
tests.quant_linear_test.TestQuantLinear.test_fake_quant_against_unquantized
def test_fake_quant_against_unquantized(self)
Definition: quant_linear_test.py:178
profiler.mergeHeaders
def mergeHeaders(features, skipFirst=True)
Definition: profiler.py:99
sample::InferenceTime::out
float out
Definition: sampleReporting.h:55
anonymous_namespace{proposalPlugin.cpp}::PROPOSAL_PLUGIN_NAME
static const char * PROPOSAL_PLUGIN_NAME
Definition: proposalPlugin.cpp:33
nvinfer1::Dims2
Descriptor for two-dimensional data.
Definition: NvInfer.h:65
nvinfer1::CUDADriverWrapper::cuLinkAddFile
CUresult cuLinkAddFile(CUlinkState state, CUjitInputType type, const char *path, unsigned int numOptions, CUjit_option *options, void **optionValues) const
Definition: cudaDriverWrapper.cpp:112
polygraphy.backend.trt.loader
Definition: backend/trt/loader.py:1
nvinfer1::RNNInputMode::kSKIP
@ kSKIP
No operation is performed on the first recurrent layer.
test_ir.TestConstant.output_node
output_node
Definition: test_ir.py:125
model.LocationLayer.__init__
def __init__(self, attention_n_filters, attention_kernel_size, attention_dim)
Definition: demo/Tacotron2/tacotron2/model.py:30
bert::packedMaskSize128
constexpr size_t packedMaskSize128
Definition: embLayerNormVarSeqlenPlugin.cpp:41
sample::MirroredBuffer::getSize
size_t getSize() const
Definition: sampleDevice.h:343
ConcatTopKWorkSpace::totalSize
size_t totalSize
Definition: maskRCNNKernels.h:205
nvinfer1::IDimensionExpr::getConstantValue
virtual int32_t getConstantValue() const =0
If isConstant(), returns value of the constant.
nvinfer1::plugin::ResizeNearestPluginCreator::createPlugin
IPluginV2Ext * createPlugin(const char *name, const PluginFieldCollection *fc) override
Return a plugin object.
Definition: resizeNearestPlugin.cpp:62
comparator.test_struct.make_iter_results
def make_iter_results(runner_name)
Definition: comparator/test_struct.py:9
polygraphy.util.cuda.Cuda
Definition: cuda.py:24
nvinfer1::plugin::GroupNormalizationPlugin::getNbOutputs
int getNbOutputs() const override
Get the number of outputs from the layer.
Definition: groupNormalizationPlugin.cpp:77
nvinfer1::plugin::GroupNormalizationPluginCreator
Definition: groupNormalizationPlugin.h:119
half_float::detail::functions::lgamma
static double lgamma(double arg)
Definition: samples/common/half.h:2485
sample::anonymous_namespace{sampleInference.cpp}::IterationStreams
std::vector< std::unique_ptr< Iteration > > IterationStreams
Definition: sampleInference.cpp:474
CudaBind::~CudaBind
~CudaBind()
Definition: maskRCNNKernels.h:74
tests.quant_pooling_test.TestQuantAvgPool2d.test_input_fake_quant_disable
def test_input_fake_quant_disable(self)
Definition: quant_pooling_test.py:186
nvinfer1::plugin::Normalize::getSerializationSize
size_t getSerializationSize() const override
Find the size of the serialization buffer required.
Definition: normalizePlugin.cpp:117
UffPoolPluginV2
Definition: sampleUffPluginV2Ext.cpp:337
SampleCharRNNParams::hiddenSize
int hiddenSize
Definition: sampleCharRNN.cpp:117
polygraphy.comparator.data_loader.BaseDataLoader
Definition: comparator/data_loader.py:25
polygraphy.backend.common
Definition: tools/Polygraphy/polygraphy/backend/common/__init__.py:1
sample::InferenceOptions::skip
bool skip
Definition: sampleOptions.h:159
half_float::detail::rint
half rint(half arg)
Nearest integer using half's internal rounding mode.
Definition: ieee_half.h:2593
nvinfer1::plugin::CoordConvACPlugin::isOutputBroadcastAcrossBatch
bool isOutputBroadcastAcrossBatch(int outputIndex, const bool *inputIsBroadcasted, int nbInputs) const override
Definition: coordConvACPlugin.cpp:166
nvcaffeparser1::WeightType::kNVBIAS
@ kNVBIAS
model.Invertible1x1Conv.W_inverse
W_inverse
Definition: demo/Tacotron2/waveglow/model.py:77
sample::ModelFormat::kANY
@ kANY
tools.test_run.TestTrt.test_trt
def test_trt(self)
Definition: test_run.py:61
tracer.skipTrace
def skipTrace(trace, start)
Definition: tracer.py:53
nvinfer1::plugin::Normalize::isOutputBroadcastAcrossBatch
bool isOutputBroadcastAcrossBatch(int outputIndex, const bool *inputIsBroadcasted, int nbInputs) const override
Definition: normalizePlugin.cpp:184
nvinfer1::plugin::DetectionLayer::check_valid_inputs
void check_valid_inputs(const nvinfer1::Dims *inputs, int nbInputDims)
Definition: detectionLayerPlugin.cpp:217
pytorch_quantization.nn.modules.quant_bert.QuantBertSelfAttention
Definition: quant_bert.py:49
model.Prenet
Definition: demo/Tacotron2/tacotron2/model.py:108
infer.main
def main()
Definition: infer.py:43
polygraphy.tools.inspect.subtool.data.Data.__init__
def __init__(self)
Definition: data.py:28
nvinfer1::PaddingMode::kCAFFE_ROUND_DOWN
@ kCAFFE_ROUND_DOWN
Use CAFFE padding, rounding output size down, uses prePadding value.
nmtSample::LSTMEncoder::getStateSizes
std::vector< nvinfer1::Dims > getStateSizes() override
get the sizes (vector of them) of the hidden state vectors
Definition: lstmEncoder.cpp:105
onnx_graphsurgeon.logger.logger.Logger.indent
def indent(self, level=1)
Definition: onnx-graphsurgeon/onnx_graphsurgeon/logger/logger.py:135
helpers.calibrator.BertCalibrator.read_histogram_cache
def read_histogram_cache(self, length)
Definition: demo/BERT/helpers/calibrator.py:107
nvinfer1::plugin::RegionPluginCreator::RegionPluginCreator
RegionPluginCreator()
Definition: regionPlugin.cpp:468
nvinfer1::LayerType::kGATHER
@ kGATHER
Gather layer.
nvinfer1::plugin::PriorBoxPluginCreator::getPluginVersion
const char * getPluginVersion() const override
Return the plugin version.
Definition: priorBoxPlugin.cpp:373
backend.onnx.test_loader.TestModifyOnnx
Definition: onnx/test_loader.py:53
nvinfer1::TensorFormat::kCDHW32
@ kCDHW32
Thirty-two wide channel vectorized row major format.
nvinfer1::PoolingType::kMAX_AVERAGE_BLEND
@ kMAX_AVERAGE_BLEND
nvinfer1::plugin::RPROIPluginCreator::~RPROIPluginCreator
~RPROIPluginCreator() override
Definition: nvFasterRCNNPlugin.cpp:334
polygraphy.backend.trt_legacy.ConvertToUff.uff_path
uff_path
Definition: backend/trt_legacy.py:51
SampleINT8APIParams::verbose
bool verbose
Definition: sampleINT8API.cpp:57
nvinfer1::IProfiler::~IProfiler
virtual ~IProfiler()
Definition: NvInferRuntime.h:676
nvinfer1::plugin::Normalize::supportsFormat
bool supportsFormat(DataType type, PluginFormat format) const override
Check format support.
Definition: normalizePlugin.cpp:139
CudaBind::CudaBind
CudaBind(size_t size)
Definition: maskRCNNKernels.h:68
onnx_graphsurgeon.util.exception
Definition: onnx-graphsurgeon/onnx_graphsurgeon/util/exception.py:1
nvinfer1::plugin::GridAnchorGenerator::initialize
int initialize() override
Initialize the layer for execution.
Definition: gridAnchorPlugin.cpp:186
nvinfer1::plugin::ResizeNearest::terminate
void terminate() override
Release resources acquired during plugin layer initialization.
Definition: resizeNearestPlugin.cpp:119
pytorch_quantization.calib.histogram.HistogramCalibrator._skip_zeros
_skip_zeros
Definition: histogram.py:47
nvinfer1::TensorFormat::kCHW16
@ kCHW16
Sixteen wide channel vectorized row major format.
nmtSample::SoftmaxLikelihood::~SoftmaxLikelihood
~SoftmaxLikelihood() override=default
UffPoolPluginV2::~UffPoolPluginV2
virtual ~UffPoolPluginV2()
Definition: sampleUffPluginV2Ext.cpp:383
nvinfer1::plugin::Region::canBroadcastInputAcrossBatch
bool canBroadcastInputAcrossBatch(int inputIndex) const override
Definition: regionPlugin.cpp:438
nvinfer1::plugin::FlattenConcat::clone
IPluginV2Ext * clone() const override
Clone the plugin object.
Definition: flattenConcat.cpp:284
tests.optim_helper_test.TestQuantWeightInPlace.test_simple
def test_simple(self)
Definition: optim_helper_test.py:105
pytorch_quantization.calib.max.MaxCalibrator.__str__
def __str__(self)
Definition: max.py:93
half_float::detail::int2half
uint16 int2half(T value)
Convert integer to half-precision floating point.
Definition: ieee_half.h:737
export_onnx2trt.main
def main()
Definition: export_onnx2trt.py:48
util.test_cuda.TestDeviceBuffer.test_device_buffer_memcpy_async
def test_device_buffer_memcpy_async(self)
Definition: test_cuda.py:50
onnx_graphsurgeon.util.misc.SynchronizedList.extend
def extend(self, Sequence[object] iterable)
Definition: onnx-graphsurgeon/onnx_graphsurgeon/util/misc.py:102
nvinfer1::DynamicPluginTensorDesc::max
Dims max
Upper bounds on tensor’s dimensions.
Definition: NvInferRuntime.h:396
nvinfer1::plugin::BatchedNMSBasePluginCreator::BatchedNMSBasePluginCreator
BatchedNMSBasePluginCreator()
Definition: batchedNMSPlugin.cpp:469
polygraphy.backend.pyt.runner
Definition: backend/pyt/runner.py:1
nmtSample::Projection::getOutputSize
virtual int getOutputSize()=0
get the size of raw logits vector
polygraphy.tools.inspect.subtool.model.Model.__init__
def __init__(self)
Definition: tools/Polygraphy/polygraphy/tools/inspect/subtool/model.py:27
nvinfer1::IPluginV2::terminate
virtual void terminate()=0
Release resources acquired during plugin layer initialization.
SamplePlugin::mParams
samplesCommon::CaffeSampleParams mParams
The parameters for the sample.
Definition: samplePlugin.cpp:109
nvinfer1::ITensor::getDynamicRangeMax
virtual float getDynamicRangeMax() const =0
Get maximum of dynamic range.
tools.test_run.TestTrt.test_trt_custom_outputs
def test_trt_custom_outputs(self)
Definition: test_run.py:69
tools.test_inspect.run_inspect_model
def run_inspect_model(request)
Definition: test_inspect.py:27
nvonnxparser::IParser::destroy
virtual void destroy()=0
destroy this object
sample::LogStreamConsumer::severityOstream
static std::ostream & severityOstream(Severity severity)
Definition: samples/common/logging.h:160
MNISTBatchStream::mLabels
std::vector< float > mLabels
Definition: BatchStream.h:151
model.Tacotron2.decoder
decoder
Definition: demo/Tacotron2/tacotron2/model.py:600
nvinfer1::plugin::LogStream::Buf
Definition: checkMacrosPlugin.h:49
nvinfer1::plugin::BatchedNMSDynamicPluginCreator::createPlugin
IPluginV2DynamicExt * createPlugin(const char *name, const PluginFieldCollection *fc) override
Return a plugin object.
Definition: batchedNMSPlugin.cpp:569
polygraphy.backend.tf.loader.SaveGraph.engine_dir
engine_dir
Definition: backend/tf/loader.py:325
polygraphy.tools.args.onnx.loader.OnnxLoaderArgs.get_onnx_loader
def get_onnx_loader(self)
Definition: tools/args/onnx/loader.py:127
nvinfer1::TensorFormat::kCHW32
@ kCHW32
Thirty-two wide channel vectorized row major format.
nmtSample::SLPAttention::mKernelWeights
nvinfer1::Weights mKernelWeights
Definition: slpAttention.h:48
nvinfer1::PluginType::kSSDDETECTIONOUTPUT
@ kSSDDETECTIONOUTPUT
SSD DetectionOutput plugin.
tests.quant_rnn_test.TestQuantLSTM.test_basic_forward
def test_basic_forward(self, verbose)
Definition: quant_rnn_test.py:266
onnx_graphsurgeon.logger.logger.LoggerIndent.__enter__
def __enter__(self)
Definition: onnx-graphsurgeon/onnx_graphsurgeon/logger/logger.py:32
polygraphy.tools.args.onnx.runner.OnnxtfRunnerArgs.onnx_loader_args
onnx_loader_args
Definition: tools/args/onnx/runner.py:25
pytorch_quantization.tensor_quant.ScaledQuantDescriptor.name
def name(self)
Definition: tensor_quant.py:136
bert::QKVToContextInterleavedPlugin::getOutputDimensions
nvinfer1::DimsExprs getOutputDimensions(int outputIndex, const nvinfer1::DimsExprs *inputs, int nbInputs, nvinfer1::IExprBuilder &exprBuilder) override
Definition: qkvToContextInt8InterleavedPlugin.cpp:102
sample::anonymous_namespace{sampleEngines.cpp}::UffBufferShutter
Definition: sampleEngines.cpp:51
SampleCharRNNBindingNames::CELL_OUT_BLOB_NAME
const char * CELL_OUT_BLOB_NAME
Definition: sampleCharRNN.cpp:91
polygraphy.tools.args.trt.loader.TrtLoaderArgs.exclude_outputs
exclude_outputs
Definition: tools/args/trt/loader.py:90
bert::Fused_multihead_attention_params::packed_mask_ptr
void * packed_mask_ptr
Definition: fused_multihead_attention.h:53
nmtSample::Decoder::addToModel
virtual void addToModel(nvinfer1::INetworkDefinition *network, nvinfer1::ITensor *inputData, nvinfer1::ITensor **inputStates, nvinfer1::ITensor **outputData, nvinfer1::ITensor **outputStates)=0
add the memory, cell, and hidden states to the network
nmtSample::SoftmaxLikelihood::SoftmaxLikelihoodCombinationOperator::combine
float combine(float rayLikelihood, float optionLikelihood) const override
Definition: softmaxLikelihood.cpp:82
sample::anonymous_namespace{sampleInference.cpp}::Iteration::recordEnqueueTime
void recordEnqueueTime()
Definition: sampleInference.cpp:402
model.LocationLayer.location_conv
location_conv
Definition: demo/Tacotron2/tacotron2/model.py:33
polygraphy.tools.args.tf2onnx.loader.Tf2OnnxLoaderArgs.register
def register(self, maker)
Definition: tools/args/tf2onnx/loader.py:29
nvinfer1::plugin::SpecialSlice::supportsFormat
bool supportsFormat(DataType type, PluginFormat format) const override
Check format support.
Definition: specialSlicePlugin.cpp:72
sample::InferenceTime::InferenceTime
InferenceTime(float q, float i, float c, float o, float e)
Definition: sampleReporting.h:36
nvinfer1::plugin::CudaError::CudaError
CudaError(const char *fl, const char *fn, int ln, int stat, const char *msg=nullptr)
Definition: checkMacrosPlugin.h:104
MaskRCNNUtils::PPM::buffer
std::vector< T > buffer
Definition: sampleUffMaskRCNN.cpp:76
tests.tensor_quant_test.TestFakeTensorQuant.test_cuda_ext
def test_cuda_ext(self)
Definition: tensor_quant_test.py:198
nvinfer1::plugin::SpecialSlice::SpecialSlice
SpecialSlice()
Definition: specialSlicePlugin.cpp:123
onnx_models.Model.opset
opset
Definition: onnx_models.py:40
nvonnxparser::ErrorCode::kINTERNAL_ERROR
@ kINTERNAL_ERROR
SampleOnnxMNIST::mOutputDims
nvinfer1::Dims mOutputDims
The dimensions of the output to the network.
Definition: sampleOnnxMNIST.cpp:72
nvinfer1::DimensionOperation
DimensionOperation
An operation on two IDimensionExpr, which represent integer expressions used in dimension computation...
Definition: NvInferRuntime.h:295
sample::defaultMinTiming
constexpr int defaultMinTiming
Definition: sampleOptions.h:37
nmtSample::DebugUtil::addDumpTensorToStream
static void addDumpTensorToStream(nvinfer1::INetworkDefinition *network, nvinfer1::ITensor *input, nvinfer1::ITensor **output, std::shared_ptr< std::ostream > out)
Definition: debugUtil.cpp:123
pytorch_quantization.quant_modules.QuantModuleReplacementHelper.orginal_func_map
orginal_func_map
Definition: quant_modules.py:56
nvinfer1::plugin::GridAnchorGenerator::serialize
void serialize(void *buffer) const override
Serialize the layer.
Definition: gridAnchorPlugin.cpp:226
onnx_graphsurgeon.exporters.onnx_exporter.dtype_to_onnx
int dtype_to_onnx(np.dtype dtype)
Definition: onnx_exporter.py:27
nvinfer1::plugin::Reorg::getPluginVersion
const char * getPluginVersion() const override
Return the plugin version.
Definition: reorgPlugin.cpp:110
nvinfer1::plugin::RegionPluginCreator::~RegionPluginCreator
~RegionPluginCreator() override=default
nvinfer1::plugin::FlattenConcatPluginCreator::getPluginVersion
const char * getPluginVersion() const override
Return the plugin version.
Definition: flattenConcat.cpp:306
anonymous_namespace{normalizePlugin.cpp}::NORMALIZE_PLUGIN_NAME
const char * NORMALIZE_PLUGIN_NAME
Definition: normalizePlugin.cpp:32
nmtSample::ComponentWeights
weights storage
Definition: componentWeights.h:31
nvinfer1::IAlgorithm::getAlgorithmIOInfo
virtual const IAlgorithmIOInfo & getAlgorithmIOInfo(int32_t index) const =0
Returns the format of an Algorithm input or output.
polygraphy.tools.args.trt.loader.TrtLoaderArgs.trt_min_shapes
trt_min_shapes
Definition: tools/args/trt/loader.py:92
nvinfer1::plugin::BatchTilePluginCreator::deserializePlugin
IPluginV2Ext * deserializePlugin(const char *name, const void *serialData, size_t serialLength) override
Called during deserialization of plugin layer.
Definition: batchTilePlugin.cpp:211
test_ir.TestGraph
Definition: test_ir.py:450
nvinfer1::plugin::PriorBox::destroy
void destroy() override
Destroy the plugin object.
Definition: priorBoxPlugin.cpp:244
nmtSample::LSTMEncoder::mRNNKind
bool mRNNKind
Definition: lstmEncoder.h:53
nvinfer1::ElementWiseOperation::kMIN
@ kMIN
Minimum of the two elements.
nvinfer1::plugin::InstanceNormalizationPlugin::_cudnn_handle
cudnnHandle_t _cudnn_handle
Definition: instanceNormalizationPlugin.h:97
nvinfer1::anonymous_namespace{NvInferRuntime.h}::createInferRefitter
IRefitter * createInferRefitter(ICudaEngine &engine, ILogger &logger)
Create an instance of an IRefitter class.
Definition: NvInferRuntime.h:2000
ParserOnnxConfig::setModelFileName
virtual void setModelFileName(const char *onnxFilename)
Set the Model File Name.
Definition: parserOnnxConfig.h:91
half_float::detail::unary_specialized::negate
static half negate(half arg)
Negation implementation.
Definition: samples/common/half.h:2516
polygraphy.tools.args.tf.config
Definition: tools/Polygraphy/polygraphy/tools/args/tf/config.py:1
nvinfer1::DynamicPluginTensorDesc::min
Dims min
Lower bounds on tensor’s dimensions.
Definition: NvInferRuntime.h:393
sample::setUpInference
bool setUpInference(InferenceEnvironment &iEnv, const InferenceOptions &inference)
Set up contexts and bindings for inference.
Definition: sampleInference.cpp:41
SampleUffFasterRcnnParams::nmsIouThresholdClassifier
float nmsIouThresholdClassifier
Definition: sampleUffFasterRCNN.cpp:73
helpers.tokenization.WordpieceTokenizer.__init__
def __init__(self, vocab, unk_token="[UNK]", max_input_chars_per_word=200)
Definition: tokenization.py:333
nvinfer1::IPluginV2Ext::~IPluginV2Ext
virtual ~IPluginV2Ext()
Definition: NvInferRuntimeCommon.h:654
SampleFasterRCNNParams::outputClsSize
int outputClsSize
The number of output classes.
Definition: sampleFasterRCNN.cpp:47
nvonnxparser::IParserError::code
virtual ErrorCode code() const =0
the error code
export_tacotron2_onnx.DecoderIter.forward
def forward(self, decoder_input, attention_hidden, attention_cell, decoder_hidden, decoder_cell, attention_weights, attention_weights_cum, attention_context, memory, processed_memory, mask)
Definition: export_tacotron2_onnx.py:181
polygraphy.tools.args.tf.loader.TfLoaderArgs.save_tensorboard
save_tensorboard
Definition: tools/args/tf/loader.py:76
nvinfer1::OptProfileSelector::kMIN
@ kMIN
This is used to set or get the minimum permitted value for dynamic dimensions etc.
pytorch_quantization.nn.modules.tensor_quantizer.TensorQuantizer.scale
def scale(self)
Definition: tensor_quantizer.py:118
SampleReformatFreeIO::SampleUniquePtr
std::unique_ptr< T, samplesCommon::InferDeleter > SampleUniquePtr
Definition: sampleReformatFreeIO.cpp:207
pytorch_quantization.nn.modules.tensor_quantizer.TensorQuantizer.amax
def amax(self)
Definition: tensor_quantizer.py:126
sample::anonymous_namespace{sampleInference.cpp}::EnqueueImplicit::mBatch
int mBatch
Definition: sampleInference.cpp:207
test_func.TestExtend.value
value
Definition: test_func.py:121
pytorch_quantization.nn.modules.quant_pooling.QuantMaxPool2d.__init__
def __init__(self, kernel_size, stride=None, padding=0, dilation=1, return_indices=False, ceil_mode=False, **kwargs)
Definition: quant_pooling.py:51
pytorch_quantization.tensor_quant.ScaledQuantDescriptor.from_yaml
def from_yaml(cls, yaml_str)
Definition: tensor_quant.py:211
tests.quant_conv_transposed_test.TestQuantConvTranspose2D.test_fake_quant_per_channel_bias
def test_fake_quant_per_channel_bias(self)
Definition: quant_conv_transposed_test.py:190
export_tacotron2_onnx.DecoderIter.attention_rnn
attention_rnn
Definition: export_tacotron2_onnx.py:116
SampleReformatFreeIO::mDigit
int mDigit
Definition: sampleReformatFreeIO.cpp:272
sample::anonymous_namespace{sampleOptions.cpp}::checkEraseNegativeOption
bool checkEraseNegativeOption(Arguments &arguments, const std::string &option, bool &value)
Definition: sampleOptions.cpp:192
polygraphy.backend.trt.loader.SaveEngine.__call__
def __call__(self)
Definition: backend/trt/loader.py:505
comparator.test_struct.TestRunResults.test_values
def test_values(self, run_results)
Definition: comparator/test_struct.py:40
bert::FusedMultiHeadAttentionKernelMetaInfoV2::mS
unsigned int mS
Definition: fused_multihead_attention_v2.h:161
SampleMovieLensArgs::dlaCore
int dlaCore
Definition: sampleMovieLens.cpp:526
polygraphy.tools.args.onnx.loader.OnnxLoaderArgs._outputs
_outputs
Definition: tools/args/onnx/loader.py:26
nvinfer1::IPluginExt::~IPluginExt
virtual ~IPluginExt()
Definition: NvInferRuntime.h:273
nvuffparser::UffInputOrder::kNCHW
@ kNCHW
NCHW order.
nmtSample::Context::Context
Context()=default
sample::anonymous_namespace{sampleEngines.cpp}::RndInt8Calibrator::mBatches
int mBatches
Definition: sampleEngines.cpp:183
utils.to_gpu
def to_gpu(x)
Definition: demo/Tacotron2/common/utils.py:50
nvinfer1::plugin::LReLU::getWorkspaceSize
size_t getWorkspaceSize(int maxBatchSize) const override
Definition: lReluPlugin.cpp:105
nvcaffeparser1::CaffeParser::mDeploy
std::shared_ptr< trtcaffe::NetParameter > mDeploy
Definition: caffeParser.h:82
test_func.TestExtend.test_extend_named_parameters
def test_extend_named_parameters(self)
Definition: test_func.py:33
polygraphy.backend.trt_legacy.LoadUffFile.outputs
outputs
Definition: backend/trt_legacy.py:39
test_ir.toposort_multi_tier_output_graph
def toposort_multi_tier_output_graph()
Definition: test_ir.py:405
nvinfer1::plugin::NormalizePluginCreator::getPluginName
const char * getPluginName() const override
Return the plugin name.
Definition: normalizePlugin.cpp:267
samplesCommon::BufferManager::BufferManager
BufferManager(std::shared_ptr< nvinfer1::ICudaEngine > engine, const int batchSize=0, const nvinfer1::IExecutionContext *context=nullptr)
Create a BufferManager for handling buffer interactions with engine.
Definition: buffers.h:245
SampleMaskRCNN::mPPMs
std::vector< MaskRCNNUtils::PPM< uint8_t > > mPPMs
Definition: sampleUffMaskRCNN.cpp:338
polygraphy.backend.tf.loader.SessionFromGraph.__init__
def __init__(self, graph, config=None)
Definition: backend/tf/loader.py:393
pytorch_quantization.calib.histogram.HistogramCalibrator._num_bins
_num_bins
Definition: histogram.py:46
onnx_graphsurgeon.exporters.onnx_exporter.OnnxExporter
Definition: onnx_exporter.py:30
nvcaffeparser1::bnConvertWrap
bool bnConvertWrap(float scaleFactor, const Weights &variance, const Weights &mean, const Weights &scaleBlob, const Weights &biasBlob, Weights &shift, Weights &scale, float eps, bool nvCaffe, CaffeWeightFactory &weightFactory)
Definition: parseBatchNorm.cpp:24
onnx_graphsurgeon.logger.logger.LoggerIndent
Definition: onnx-graphsurgeon/onnx_graphsurgeon/logger/logger.py:26
tests.calibrator_test.TestPercentileCalibrator.test_unsigned_one_tensor
def test_unsigned_one_tensor(self, verbose)
Definition: calibrator_test.py:265
onnx_graphsurgeon.logger.logger.LoggerSuppress.__enter__
def __enter__(self)
Definition: onnx-graphsurgeon/onnx_graphsurgeon/logger/logger.py:47
SampleCharRNNBase
The SampleCharRNNBase class implements the char_rnn sample.
Definition: sampleCharRNN.cpp:142
SampleCharRNNParams::layerCount
int layerCount
Definition: sampleCharRNN.cpp:116
nvinfer1::plugin::PyramidROIAlign::getWorkspaceSize
size_t getWorkspaceSize(int) const override
Definition: pyramidROIAlignPlugin.cpp:107
polygraphy.tools.args.comparator.ComparatorCompareArgs.add_to_parser
def add_to_parser(self, parser)
Definition: tools/args/comparator.py:69
onnx_models.Model.__str__
def __str__(self)
Definition: onnx_models.py:88
nvinfer1::PluginFieldType::kINT8
@ kINT8
INT8 field type.
nmtSample::TextWriter::finalize
void finalize() override
it is called right after inference ends
Definition: textWriter.cpp:38
anonymous_namespace{proposalPlugin.cpp}::PROPOSAL_PLUGIN_VERSION
static const char * PROPOSAL_PLUGIN_VERSION
Definition: proposalPlugin.cpp:32
pytorch_quantization.nn.modules.quant_rnn.QuantLSTMCell.bias_hh
bias_hh
Definition: modules/quant_rnn.py:394
nvinfer1::PluginTensorDesc::scale
float scale
Definition: NvInferRuntimeCommon.h:372
builder.set_output_name
def set_output_name(layer, prefix, name, out_idx=0)
Definition: builder.py:109
sample::TrtCudaStream::wait
void wait(TrtCudaEvent &event)
Definition: sampleDevice.h:156
tests.quant_pooling_test.TestQuantAdaptiveAvgPool1d
Definition: quant_pooling_test.py:218
nvinfer1::plugin::Reorg::getPluginType
const char * getPluginType() const override
Return the plugin type.
Definition: reorgPlugin.cpp:105
nvinfer1::UnaryOperation::kFLOOR
@ kFLOOR
Floor.
bert::FusedMultiHeadAttentionKernelMetaInfoV1::mDataType
Data_type mDataType
Definition: fused_multihead_attention.h:117
nvinfer1::plugin::ProposalPlugin::mRpnWidth
size_t mRpnWidth
Definition: proposalPlugin.h:112
tests.optim_helper_test.TestGroupParameters.test_lr_momentum_decay
def test_lr_momentum_decay(self, resnet18)
Definition: optim_helper_test.py:57
FCPlugin::read
void read(const char *&buffer, T &val)
Definition: samples/opensource/samplePlugin/fcPlugin.h:237
nvinfer1::plugin::Reorg::C
int C
Definition: reorgPlugin.h:87
nvinfer1::IPluginV2DynamicExt::getTensorRTVersion
int32_t getTensorRTVersion() const
Return the API version with which this plugin was built.
Definition: NvInferRuntime.h:534
nvinfer1::ITensor::setDynamicRange
virtual bool setDynamicRange(float min, float max)=0
Set dynamic range for the tensor.
nvinfer1::plugin::RPROIPlugin::anchorsRatiosHost
float * anchorsRatiosHost
Definition: nvFasterRCNNPlugin.h:100
onnx_graphsurgeon.ir.tensor.Tensor.values
values
Definition: tensor.py:70
bert::EmbLayerNormVarSeqlenPlugin::getOutputDataType
nvinfer1::DataType getOutputDataType(int index, const nvinfer1::DataType *inputTypes, int nbInputs) const override
Definition: embLayerNormVarSeqlenPlugin.cpp:324
polygraphy.tools.args.data_loader.DataLoaderArgs.float_range
float_range
Definition: tools/args/data_loader.py:61
sample::TrtCudaGraph::TrtCudaGraph
TrtCudaGraph()=default
nvinfer1::plugin::PyramidROIAlignPluginCreator::mFC
static PluginFieldCollection mFC
Definition: pyramidROIAlignPlugin.h:122
bert::Fused_multihead_attention_params_v2::qkv_stride_in_bytes
int64_t qkv_stride_in_bytes
Definition: fused_multihead_attention_v2.h:35
nvinfer1::plugin::BatchTilePluginCreator::createPlugin
IPluginV2Ext * createPlugin(const char *name, const PluginFieldCollection *fc) override
Return a plugin object.
Definition: batchTilePlugin.cpp:204
tests.tensor_quant_test.TestFakeTensorQuant.test_unsigned
def test_unsigned(self)
Definition: tensor_quant_test.py:191
trt_utils.is_dimension_dynamic
def is_dimension_dynamic(dim)
Definition: trt_utils.py:20
sample::Binding::fill
void fill(const std::string &fileName)
Definition: sampleUtils.h:222
nvinfer1::plugin::Normalize::getNbOutputs
int getNbOutputs() const override
Get the number of outputs from the layer.
Definition: normalizePlugin.cpp:81
sample::Logger::TestAtom::mStarted
bool mStarted
Definition: samples/common/logging.h:281
test_ir.TestGraph.test_cleanup_graph_input_producers
def test_cleanup_graph_input_producers(self)
Definition: test_ir.py:653
SampleINT8APIPreprocessing
Definition: sampleINT8API.cpp:44
SampleFasterRCNN::SampleFasterRCNN
SampleFasterRCNN(const SampleFasterRCNNParams &params)
Definition: sampleFasterRCNN.cpp:61
nvinfer1::DataType::kINT32
@ kINT32
Signed 32-bit integer format.
sample::LogStreamConsumer::setReportableSeverity
void setReportableSeverity(Severity reportableSeverity)
Definition: samples/common/logging.h:153
pytorch_quantization.nn.modules
Definition: tools/pytorch-quantization/pytorch_quantization/nn/modules/__init__.py:1
bert::EmbLayerNormVarSeqlenPluginCreator::createPlugin
nvinfer1::IPluginV2 * createPlugin(const char *name, const nvinfer1::PluginFieldCollection *fc) override
Return a plugin object.
Definition: embLayerNormVarSeqlenPlugin.cpp:440
pytorch_quantization.nn.modules.tensor_quantizer.TensorQuantizer.extra_repr
def extra_repr(self)
Definition: tensor_quantizer.py:362
onnx_graphsurgeon.ir.tensor.Constant.__init__
def __init__(self, str name, np.ndarray values)
Definition: tensor.py:187
PluginFactory::isPluginExt
bool isPluginExt(const char *name) override
A user implemented function that determines if a layer configuration is provided by an IPluginExt.
Definition: samples/opensource/samplePlugin/fcPlugin.h:330
test_func.TestExtend.test_extend_0_args_2_rv
def test_extend_0_args_2_rv(self)
Definition: test_func.py:57
nvinfer1::plugin::Region::H
int H
Definition: regionPlugin.h:97
half_float::detail::asinh
expr asinh(half arg)
Hyperbolic area sine.
Definition: ieee_half.h:2494
tests.quant_conv_transposed_test.TestQuantConvTranspose1D.test_fake_quant_per_tensor
def test_fake_quant_per_tensor(self)
Definition: quant_conv_transposed_test.py:412
polygraphy.tools.args.onnxrt.runner
Definition: tools/args/onnxrt/runner.py:1
test_func.TestExtend
Definition: test_func.py:19
nvinfer1::plugin::ProposalPlugin::destroy
void destroy() override
Destroy the plugin object.
Definition: proposalPlugin.cpp:250
helpers.data_processing.convert_doc_tokens
def convert_doc_tokens(paragraph_text)
Definition: data_processing.py:26
sample::anonymous_namespace{sampleInference.cpp}::Iteration::wait
void wait(TrtCudaEvent &gpuStart)
Definition: sampleInference.cpp:366
pytorch_quantization.utils.quant_logging.reset_logger_handler
def reset_logger_handler()
Definition: quant_logging.py:22
builder_varseqlen.emb_layernorm
def emb_layernorm(builder, network, config, weights_dict, builder_config, max_sequence_length, max_batch_size)
Definition: builder_varseqlen.py:510
polygraphy.backend.trt_legacy.LoadNetworkFromUff.__call__
def __call__(self)
Definition: backend/trt_legacy.py:87
pytorch_quantization.nn.modules.quant_rnn.GRU
Definition: modules/quant_rnn.py:299
onnx_graphsurgeon.ir.graph.UnseenTensor.seen_tensors
seen_tensors
Definition: graph.py:32
bert::fused_multihead_attention_v2_fp16_64_64_kernel_sm75_cubin
unsigned char fused_multihead_attention_v2_fp16_64_64_kernel_sm75_cubin[]
Definition: fused_multihead_attention_v2_fp16_64_64_kernel.sm75.cpp:19
nvinfer1::UnaryOperation::kABS
@ kABS
Absolute value.
nvinfer1::CUDADriverWrapper::handle
void * handle
Definition: cudaDriverWrapper.h:69
nmtSample::BenchmarkWriter::write
void write(const int *hOutputData, int actualOutputSequenceLength, int actualInputSequenceLength) override
write the generated sequence
Definition: benchmarkWriter.cpp:32
SampleSSDParams::outputClsSize
int outputClsSize
The number of output classes.
Definition: sampleSSD.cpp:49
helpers.tokenization.FullTokenizer.tokenize
def tokenize(self, text)
Definition: tokenization.py:169
sample::TrtCudaGraph::beginCapture
void beginCapture(TrtCudaStream &stream)
Definition: sampleDevice.h:186
nmtSample::Alignment::ptr
std::shared_ptr< Alignment > ptr
Definition: alignment.h:35
sample::anonymous_namespace{sampleEngines.cpp}::RndInt8Calibrator::mCacheFile
std::string mCacheFile
Definition: sampleEngines.cpp:185
sample::AllOptions::helps
bool helps
Definition: sampleOptions.h:193
mrcnn_to_trt_single.InferenceConfig
Definition: mrcnn_to_trt_single.py:93
samplesCommon::GenericBuffer::mBuffer
void * mBuffer
Definition: buffers.h:174
nvinfer1::ScaleMode::kCHANNEL
@ kCHANNEL
Per-channel coefficients.
nvinfer1::plugin::softmaxTree
softmaxTree
Definition: NvInferPluginUtils.h:220
bert::fused_multihead_attention_v2_fp16_128_64_kernel_sm75_cubin_len
unsigned int fused_multihead_attention_v2_fp16_128_64_kernel_sm75_cubin_len
Definition: fused_multihead_attention_v2_fp16_128_64_kernel.sm75.cpp:6578
builder_varseqlen.BertConfig.num_hidden_layers
num_hidden_layers
Definition: builder_varseqlen.py:96
nvinfer1::plugin::SpecialSlice::getOutputDataType
DataType getOutputDataType(int index, const nvinfer1::DataType *inputTypes, int nbInputs) const override
Definition: specialSlicePlugin.cpp:172
BatchStream::mBatchSize
int mBatchSize
Definition: BatchStream.h:373
classification_flow.prepare_model
def prepare_model(model_name, data_dir, per_channel_quantization, batch_size_train, batch_size_test, batch_size_onnx, calibrator, pretrained=True, ckpt_path=None, ckpt_url=None)
Definition: classification_flow.py:91
nvinfer1::plugin::InstanceNormalizationPlugin::detachFromContext
void detachFromContext() override
Detach the plugin object from its execution context.
Definition: instanceNormalizationPlugin.cpp:290
tests.quant_rnn_test.TestQuantLSTMCell.test_quant_input_hidden_bias
def test_quant_input_hidden_bias(self, verbose)
Definition: quant_rnn_test.py:193
bert::QKVToContextInterleavedPluginCreator::getPluginVersion
const char * getPluginVersion() const override
Return the plugin version.
Definition: qkvToContextInt8InterleavedPlugin.cpp:287
nvinfer1::plugin::FlattenConcat::getWorkspaceSize
size_t getWorkspaceSize(int) const override
Definition: flattenConcat.cpp:128
nmtSample::Encoder::ptr
std::shared_ptr< Encoder > ptr
Definition: encoder.h:36
sample::Logger::testResultString
static const char * testResultString(TestResult result)
returns an appropriate string for prefixing a test result message with the given result
Definition: samples/common/logging.h:395
isolate.outputs
outputs
Definition: isolate.py:55
nvinfer1::plugin::Region::getOutputDimensions
Dims getOutputDimensions(int index, const Dims *inputs, int nbInputDims) override
Definition: regionPlugin.cpp:239
SampleUffPluginV2Ext::SampleUniquePtr
std::unique_ptr< T, samplesCommon::InferDeleter > SampleUniquePtr
Definition: sampleUffPluginV2Ext.cpp:192
polygraphy.common.struct.TensorMetadata.__repr__
def __repr__(self)
Definition: common/struct.py:44
onnx_graphsurgeon.util.misc.SynchronizedList.pop
def pop(self, i=-1)
Definition: onnx-graphsurgeon/onnx_graphsurgeon/util/misc.py:117
nvinfer1::plugin::GridAnchorGenerator::canBroadcastInputAcrossBatch
bool canBroadcastInputAcrossBatch(int inputIndex) const override
Definition: gridAnchorPlugin.cpp:316
polygraphy.tools.args.logger.LoggerArgs.add_to_script
def add_to_script(self, script)
Definition: Polygraphy/polygraphy/tools/args/logger.py:40
nvinfer1::plugin::ResizeNearest::configurePlugin
void configurePlugin(const Dims *inputDims, int nbInputs, const Dims *outputDims, int nbOutputs, const DataType *inputTypes, const DataType *outputTypes, const bool *inputIsBroadcast, const bool *outputIsBroadcast, PluginFormat floatFormat, int maxBatchSize) override
Definition: resizeNearestPlugin.cpp:240
bert::fused_multihead_attention_v2_int8_384_64_kernel_cubin_len
unsigned int fused_multihead_attention_v2_int8_384_64_kernel_cubin_len
Definition: fused_multihead_attention_v2_int8_384_64_kernel.sm72.cpp:19647
nvcaffeparser1::CaffeParser::parseLReLUParam
std::vector< nvinfer1::PluginField > parseLReLUParam(const trtcaffe::LayerParameter &msg, CaffeWeightFactory &weightFactory, BlobNameToTensor &tensors)
Definition: caffeParser.cpp:236
ProposalWorkSpace::sortClassSampleIdxDims
nvinfer1::DimsHW sortClassSampleIdxDims
Definition: maskRCNNKernels.h:134
nvinfer1::PluginType::kCONCAT
@ kCONCAT
Concat plugin.
nvonnxparser::IParser::parseFromFile
virtual bool parseFromFile(const char *onnxModelFile, int verbosity)=0
Parse an onnx model file, can be a binary protobuf or a text onnx model calls parse method inside.
SampleCharRNNParams::charMaps
SampleCharRNNMaps charMaps
Definition: sampleCharRNN.cpp:127
nvinfer1::UnaryOperation::kASINH
@ kASINH
Inverse hyperbolic sine.
nvcaffeparser1::CaffeWeightFactory::getNullWeights
nvinfer1::Weights getNullWeights()
Definition: caffeWeightFactory.cpp:194
bert::FusedMultiHeadAttentionXMMAKernelV2
Definition: fused_multihead_attention_v2.h:421
samplesCommon::SampleParams::outputTensorNames
std::vector< std::string > outputTensorNames
Definition: argsParser.h:44
onnx_graphsurgeon.ir.tensor.Tensor.__setattr__
def __setattr__(self, name, value)
Definition: tensor.py:35
anonymous_namespace{pyramidROIAlignPlugin.cpp}::PYRAMIDROIALGIN_PLUGIN_NAME
const char * PYRAMIDROIALGIN_PLUGIN_NAME
Definition: pyramidROIAlignPlugin.cpp:29
bert::SkipLayerNormInterleavedPlugin::getPluginNamespace
const char * getPluginNamespace() const override
Return the namespace of the plugin object.
Definition: skipLayerNormInt8InterleavedPlugin.cpp:262
nvinfer1::ITensor::setName
virtual void setName(const char *name)=0
Set the tensor name.
sample::Bindings::addBinding
void addBinding(int b, const std::string &name, bool isInput, int volume, nvinfer1::DataType dataType, const std::string &fileName="")
Definition: sampleUtils.h:307
nvinfer1::INetworkDefinition::addConcatenation
virtual IConcatenationLayer * addConcatenation(ITensor *const *inputs, int32_t nbInputs)=0
Add a concatenation layer to the network.
polygraphy.backend.onnxrt.loader.SessionFromOnnxBytes.__init__
def __init__(self, model_bytes)
Definition: backend/onnxrt/loader.py:23
nvinfer1::plugin::RPROIPlugin::getNbOutputs
int getNbOutputs() const override
Get the number of outputs from the layer.
Definition: nvFasterRCNNPlugin.cpp:122
nvcaffeparser1::ICaffeParser::destroy
virtual void destroy()=0
Destroy this ICaffeParser object.
nvinfer1::plugin::MultilevelProposeROI::mValidCnt
std::shared_ptr< CudaBind< int > > mValidCnt
Definition: multilevelProposeROIPlugin.h:103
tests.quant_utils_test.TestQuantUtils
Definition: quant_utils_test.py:30
AlgorithmCacheReader::AlgorithmCacheItem
Definition: sampleAlgorithmSelector.cpp:235
bert::Fused_multihead_attention_params_v2::b
int b
Definition: fused_multihead_attention_v2.h:56
nvinfer1::plugin::ProposalPlugin::mNmsIouThreshold
float mNmsIouThreshold
Definition: proposalPlugin.h:116
tests.tensor_quant_test.TestTensorQuant.test_unsigned
def test_unsigned(self)
Definition: tensor_quant_test.py:99
nvinfer1::plugin::MultilevelProposeROI::mBackgroundLabel
int mBackgroundLabel
Definition: multilevelProposeROIPlugin.h:94
nvinfer1::CUDADriverWrapper::_cuModuleLoadData
CUresult(* _cuModuleLoadData)(CUmodule *, const void *)
Definition: cudaDriverWrapper.h:76
backend.trt.test_loader.TestSaveEngine
Definition: trt/test_loader.py:270
inference.print_single_query
def print_single_query(eval_time_elapsed, prediction, nbest_json)
Definition: BERT/inference.py:216
polygraphy.backend.trt_legacy.TrtLegacyRunner.engine_path
engine_path
Definition: backend/trt_legacy.py:221
SamplePlugin::infer
bool infer()
Runs the TensorRT inference engine for this sample.
Definition: samplePlugin.cpp:233
nvinfer1::plugin::CropAndResizePlugin::initialize
int initialize() override
Initialize the layer for execution.
Definition: cropAndResizePlugin.cpp:126
bert::fused_multihead_attention_v2_int8_192_64_kernel_cubin_len
unsigned int fused_multihead_attention_v2_int8_192_64_kernel_cubin_len
Definition: fused_multihead_attention_v2_int8_192_64_kernel.sm72.cpp:15999
bert::QKVToContextInterleavedPluginCreator::getFieldNames
const nvinfer1::PluginFieldCollection * getFieldNames() override
Return a list of fields that needs to be passed to createPlugin.
Definition: qkvToContextInt8InterleavedPlugin.cpp:292
nmtSample::Likelihood::addToModel
virtual void addToModel(nvinfer1::INetworkDefinition *network, int beamWidth, nvinfer1::ITensor *inputLogits, nvinfer1::ITensor *inputLikelihoods, nvinfer1::ITensor **newCombinedLikelihoods, nvinfer1::ITensor **newRayOptionIndices, nvinfer1::ITensor **newVocabularyIndices)=0
add calculation of likelihood and TopK indices to the network
polygraphy.backend.tf.register_callback
def register_callback()
Definition: tools/Polygraphy/polygraphy/backend/tf/__init__.py:5
backend.trt.test_calibrator.TestCalibrator.test_calibrator_outside_polygraphy
def test_calibrator_outside_polygraphy(self, identity_builder_network)
Definition: test_calibrator.py:68
export_tacotron2_onnx.Encoder.forward
def forward(self, sequence, sequence_lengths)
Definition: export_tacotron2_onnx.py:70
SampleBuffer::~SampleBuffer
~SampleBuffer()
Definition: sampleReformatFreeIO.cpp:155
IBatchStream::next
virtual bool next()=0
Args::batchSize
std::vector< int > batchSize
Definition: demo/BERT/infer_c/common.h:30
sample::anonymous_namespace{sampleOptions.cpp}::printTacticSources
std::ostream & printTacticSources(std::ostream &os, nvinfer1::TacticSources enabledSources, nvinfer1::TacticSources disabledSources)
Definition: sampleOptions.cpp:340
nvinfer1::plugin::BatchTilePluginCreator::setPluginNamespace
void setPluginNamespace(const char *libNamespace) override
Set the namespace of the plugin creator based on the plugin library it belongs to.
Definition: batchTilePlugin.h:99
nvinfer1::IExecutionContext::getBindingDimensions
virtual Dims getBindingDimensions(int32_t bindingIndex) const noexcept=0
Get the dynamic dimensions of a binding.
polygraphy.tools.precision.precision.STCheckerBase.layer_indices
def layer_indices(self, num_layers)
Definition: precision.py:90
anonymous_namespace{serialize.hpp}::Serializer< const char * >::serialize
static void serialize(void **buffer, const char *value)
Definition: serialize.hpp:72
nmtSample::MultiplicativeAlignment::addToModel
void addToModel(nvinfer1::INetworkDefinition *network, nvinfer1::ITensor *attentionKeys, nvinfer1::ITensor *queryStates, nvinfer1::ITensor **alignmentScores) override
add the alignment scores calculation to the network
Definition: multiplicativeAlignment.cpp:38
polygraphy.comparator.data_loader.DataLoader.int_range
int_range
Definition: comparator/data_loader.py:89
SampleFasterRCNN::build
bool build()
Function builds the network engine.
Definition: sampleFasterRCNN.cpp:132
polygraphy.tools.args.data_loader.DataLoaderArgs.int_range
int_range
Definition: tools/args/data_loader.py:60
pytorch_quantization.nn.modules.tensor_quantizer.TensorQuantizer._quant_forward
def _quant_forward(self, inputs)
Definition: tensor_quantizer.py:299
sample::operator<<
std::ostream & operator<<(std::ostream &os, const BaseModelOptions &options)
Definition: sampleOptions.cpp:836
sample::InferenceTrace
Measurement points in milliseconds.
Definition: sampleReporting.h:69
trt_utils.run_trt_engine
def run_trt_engine(context, engine, tensors)
Definition: trt_utils.py:28
nvinfer1::plugin::GridAnchorBasePluginCreator::mPluginAttributes
static std::vector< PluginField > mPluginAttributes
Definition: gridAnchorPlugin.h:124
nvinfer1::plugin::FlattenConcat::isOutputBroadcastAcrossBatch
bool isOutputBroadcastAcrossBatch(int outputIndex, const bool *inputIsBroadcasted, int nbInputs) const override
Definition: flattenConcat.cpp:198
nmtSample::Likelihood::~Likelihood
~Likelihood() override=default
nvinfer1::plugin::MultilevelProposeROI::mTempScores
std::vector< std::shared_ptr< CudaBind< float > > > mTempScores
Definition: multilevelProposeROIPlugin.h:107
nvinfer1::PluginVersion::kV2_IOEXT
@ kV2_IOEXT
IPluginV2Ext.
nvinfer1::plugin::DetectionOutputParameters::varianceEncodedInTarget
bool varianceEncodedInTarget
Definition: NvInferPluginUtils.h:183
nmtSample::DebugUtil::DumpTensorPlugin::initialize
int initialize() override
Initialize the layer for execution.
Definition: debugUtil.cpp:70
nvinfer1::plugin::Reorg::destroy
void destroy() override
Destroy the plugin object.
Definition: reorgPlugin.cpp:115
backend.onnxrt.test_runner.TestOnnxrtRunner.test_can_name_runner
def test_can_name_runner(self)
Definition: onnxrt/test_runner.py:31
pytorch_quantization.nn.functional
Definition: functional.py:1
nmtSample::SLPProjection::mInputChannelCount
int mInputChannelCount
Definition: slpProjection.h:53
nvonnxparser::ErrorCode::kINVALID_NODE
@ kINVALID_NODE
polygraphy.util.format.DataFormat
Definition: format.py:22
tests.tensor_quantizer_test.TestTensorQuantizer.test_setters
def test_setters(self)
Definition: tensor_quantizer_test.py:238
TLTMaskRCNNConfig::MODEL_OUTPUTS
static const std::vector< std::string > MODEL_OUTPUTS
Definition: tlt_mrcnn_config.h:173
polygraphy.tools.surgeon.surgeon.Surgeon.add_parser_args
def add_parser_args(self, parser)
Definition: surgeon.py:29
nvinfer1::plugin::PyramidROIAlign::mNameSpace
std::string mNameSpace
Definition: pyramidROIAlignPlugin.h:101
nvinfer1::plugin::LReLU::getNbOutputs
int getNbOutputs() const override
Get the number of outputs from the layer.
Definition: lReluPlugin.cpp:46
nvinfer1::plugin::PriorBox::terminate
void terminate() override
Release resources acquired during plugin layer initialization.
Definition: priorBoxPlugin.h:47
nvcaffeparser1::CaffeParser::mModel
std::shared_ptr< trtcaffe::NetParameter > mModel
Definition: caffeParser.h:83
SampleMLP::mEngine
std::shared_ptr< nvinfer1::ICudaEngine > mEngine
The TensorRT engine used to run the network.
Definition: sampleMLP.cpp:92
SampleErrorRecorder::getErrorCode
ErrorCode getErrorCode(int32_t errorIdx) const noexcept final
Returns the ErrorCode enumeration.
Definition: ErrorRecorder.h:50
nvinfer1::plugin::DetectionLayer::getOutputDimensions
Dims getOutputDimensions(int index, const Dims *inputs, int nbInputDims) override
Definition: detectionLayerPlugin.cpp:237
polygraphy.tools.args.onnx.loader
Definition: tools/args/onnx/loader.py:1
nvinfer1::plugin::InstanceNormalizationPlugin::enqueue
int enqueue(const nvinfer1::PluginTensorDesc *inputDesc, const nvinfer1::PluginTensorDesc *outputDesc, const void *const *inputs, void *const *outputs, void *workspace, cudaStream_t stream) override
Execute the layer.
Definition: instanceNormalizationPlugin.cpp:172
nvinfer1::plugin::ProposalLayerPluginCreator
Definition: proposalLayerPlugin.h:116
PluginVersion
Definition of plugin versions.
nvinfer1::plugin::CoordConvACPlugin::getPluginNamespace
const char * getPluginNamespace() const override
Return the namespace of the plugin object.
Definition: coordConvACPlugin.cpp:155
nvinfer1::plugin::DetectionOutput::setPluginNamespace
void setPluginNamespace(const char *pluginNamespace) override
Set the namespace that this plugin object belongs to.
Definition: nmsPlugin.cpp:177
nvcaffeparser1::shutdownProtobufLibrary
void shutdownProtobufLibrary()
Shuts down protocol buffers library.
Definition: NvCaffeParser.cpp:22
OutputArgs
Definition: sampleMovieLensMPS.cpp:155
tools.common.run_polygraphy_precision
def run_polygraphy_precision(additional_opts=[], disable_verbose=False)
Definition: tools/common.py:52
nvonnxparser::anonymous_namespace{NvOnnxParser.h}::createParser
IParser * createParser(nvinfer1::INetworkDefinition &network, nvinfer1::ILogger &logger)
Create a new parser object.
Definition: NvOnnxParser.h:260
nvinfer1::plugin::PriorBoxParameters::imgW
int32_t imgW
Definition: NvInferPluginUtils.h:98
nvinfer1::ITensor::getDimensions
virtual Dims getDimensions() const =0
Get the dimensions of a tensor.
bert::packedMaskSize256
constexpr size_t packedMaskSize256
Definition: embLayerNormVarSeqlenPlugin.cpp:42
layers.TacotronSTFT.stft_fn
stft_fn
Definition: layers.py:62
tests.print_test.TestPrint.test_print_module
def test_print_module(self)
Definition: print_test.py:43
polygraphy.tools.inspect.subtool.data.Data.run
def run(self, args)
Definition: data.py:39
nmtSample
Definition: component.h:23
nvinfer1::plugin::SplitPlugin::_nz
int _nz
Definition: split.h:42
sample::anonymous_namespace{sampleInference.cpp}::Iteration::mEnqueueTimes
std::vector< EnqueueTimes > mEnqueueTimes
Definition: sampleInference.cpp:471
polygraphy.common.func.extend
def extend(extend_func)
Definition: func.py:25
nvuffparser::FieldType
FieldType
The possible field types for custom layer.
Definition: NvUffParser.h:57
nvinfer1::plugin::Normalize::acrossSpatial
bool acrossSpatial
Definition: normalizePlugin.h:98
nvinfer1::DimensionOperation::kMIN
@ kMIN
Minimum of the two operands.
polygraphy.tools.run.run.add_runner_args
def add_runner_args(parser)
Definition: run.py:36
sample::AllOptions::model
ModelOptions model
Definition: sampleOptions.h:188
tests.optim_helper_test.TestMatchParameters.test_single_key
def test_single_key(self, resnet18)
Definition: optim_helper_test.py:35
sample::BuildOptions::outputFormats
std::vector< IOFormat > outputFormats
Definition: sampleOptions.h:126
ParserOnnxConfig::setFullTextFileName
virtual void setFullTextFileName(const char *fullTextFilename)
Set the File Name of the Network Description as a Text File, including the weights.
Definition: parserOnnxConfig.h:124
nvonnxparser::IParserError::func
virtual const char * func() const =0
source function in which the error occurred
BertInference::mCuSeqlens
std::vector< int > mCuSeqlens
Definition: bert_infer.h:344
nmtSample::TextWriter::write
void write(const int *hOutputData, int actualOutputSequenceLength, int actualInputSequenceLength) override
write the generated sequence
Definition: textWriter.cpp:30
nvinfer1::plugin::MultilevelProposeROI::mType
nvinfer1::DataType mType
Definition: multilevelProposeROIPlugin.h:114
nvinfer1::PluginField::data
const void * data
Plugin field attribute data.
Definition: NvInferRuntimeCommon.h:856
nvinfer1::plugin::MultilevelCropAndResize::attachToContext
void attachToContext(cudnnContext *cudnnContext, cublasContext *cublasContext, IGpuAllocator *gpuAllocator) override
Attach the plugin object to an execution context and grant the plugin the access to some context reso...
Definition: multilevelCropAndResizePlugin.cpp:294
polygraphy.comparator.compare.CompareFunc.basic_compare_func
def basic_compare_func(check_shapes=None, rtol=None, atol=None, fail_fast=None, find_output_func=None)
Definition: compare.py:64
export_tacotron2_onnx.encoder_infer
def encoder_infer(self, x, input_lengths)
Definition: export_tacotron2_onnx.py:43
bert::QKVToContextInterleavedPluginCreator::createPlugin
nvinfer1::IPluginV2 * createPlugin(const char *name, const nvinfer1::PluginFieldCollection *fc) override
Return a plugin object.
Definition: qkvToContextInt8InterleavedPlugin.cpp:297
SamplePlugin::processInput
bool processInput(const samplesCommon::BufferManager &buffers, const std::string &inputTensorName, int inputFileIdx) const
Reads the input and mean data, preprocesses, and stores the result in a managed buffer.
Definition: samplePlugin.cpp:294
SampleConfig::getTextFileName
const char * getTextFileName() const
Returns the File Name of the Network Description as a Text File.
Definition: sampleConfig.h:137
nvuffparser::FieldType::kUNKNOWN
@ kUNKNOWN
nvinfer1::plugin::CoordConvACPluginCreator::mNamespace
std::string mNamespace
Definition: coordConvACPlugin.h:114
SampleCharRNNBindingNames
Definition: sampleCharRNN.cpp:85
half_float::detail::sin
expr sin(half arg)
Sine function.
Definition: ieee_half.h:2417
sample::broadcastIOFormats
bool broadcastIOFormats(const std::vector< IOFormat > &formats, size_t nbBindings, bool isInput=true)
Definition: sampleUtils.h:468
backend.onnx.test_runner.TestOnnxTfRunner
Definition: onnx/test_runner.py:21
nvcaffeparser1::IPluginFactoryV2::~IPluginFactoryV2
virtual ~IPluginFactoryV2()
Definition: NvCaffeParser.h:158
nvinfer1::plugin::RPROIPlugin::getPluginNamespace
const char * getPluginNamespace() const override
Return the namespace of the plugin object.
Definition: nvFasterRCNNPlugin.cpp:249
SampleINT8API::mParams
SampleINT8APIParams mParams
Stores Sample Parameter.
Definition: sampleINT8API.cpp:103
nvinfer1::plugin::FlattenConcatPluginCreator::mConcatAxisID
int mConcatAxisID
Definition: flattenConcat.h:128
tests.clip_test.TestClip.test_raise
def test_raise(self)
Definition: clip_test.py:46
nvinfer1::WeightsRole::kCONSTANT
@ kCONSTANT
weights for IConstantLayer
sample::anonymous_namespace{sampleOptions.cpp}::boolToEnabled
const char * boolToEnabled(bool enable)
Definition: sampleOptions.cpp:171
nvcaffeparser1::IBlobNameToTensor::find
virtual nvinfer1::ITensor * find(const char *name) const =0
Given a blob name, returns a pointer to a ITensor object.
nvinfer1::Permutation::order
int32_t order[Dims::MAX_DIMS]
The elements of the permutation.
Definition: NvInfer.h:3734
half_float::detail::functions::ceil
static half ceil(half arg)
Ceiling implementation.
Definition: samples/common/half.h:2086
nvinfer1::LayerType::kRNN_V2
@ kRNN_V2
RNNv2 layer.
nmtSample::BeamSearchPolicy
processes the results of one iteration of the generator with beam search and produces input for the n...
Definition: beamSearchPolicy.h:33
tests.quant_pooling_test.TestQuantAdaptiveAvgPool2d
Definition: quant_pooling_test.py:239
nvinfer1::plugin::GroupNormalizationPluginCreator::getPluginNamespace
const char * getPluginNamespace() const override
Return the namespace of the plugin creator object.
Definition: groupNormalizationPlugin.cpp:275
polygraphy.util.misc.log_module_info
def log_module_info(module, name=None, severity=G_LOGGER.VERBOSE)
Definition: Polygraphy/polygraphy/util/misc.py:401
tools.test_run.TestTrtLegacy.test_trt_legacy_uff
def test_trt_legacy_uff(self)
Definition: test_run.py:51
sample::TrtCudaEvent::synchronize
void synchronize()
Definition: sampleDevice.h:139
nvinfer1::CalibrationAlgoType::kLEGACY_CALIBRATION
@ kLEGACY_CALIBRATION
loss_function.Tacotron2Loss.__init__
def __init__(self)
Definition: tacotron2/loss_function.py:21
nvinfer1::plugin::Region
Definition: regionPlugin.h:30
tests.quant_conv_test.TestQuantConv1D.test_no_quant
def test_no_quant(self)
Definition: quant_conv_test.py:269
tools.test_surgeon.TestSurgeonSanitize
Definition: test_surgeon.py:80
onnx_graphsurgeon.logger.logger.LogMode
Definition: onnx-graphsurgeon/onnx_graphsurgeon/logger/logger.py:55
nmtSample::BLEUScoreWriter::mReferenceInput
std::shared_ptr< std::istream > mReferenceInput
Definition: bleuScoreWriter.h:52
nvinfer1::plugin::CropAndResizePlugin::getPluginNamespace
const char * getPluginNamespace() const override
Return the namespace of the plugin object.
Definition: cropAndResizePlugin.cpp:194
pytorch_quantization.nn.modules.quant_conv.QuantConvTranspose3d
Definition: quant_conv.py:350
tools.test_run.TestTrt.test_trt_onnx_ext
def test_trt_onnx_ext(self)
Definition: test_run.py:93
nvinfer1::PluginField::name
const char * name
Plugin field attribute name.
Definition: NvInferRuntimeCommon.h:852
nvinfer1::plugin::LReLU::~LReLU
~LReLU() override=default
polygraphy.backend.onnxrt.loader.SessionFromOnnxBytes.__call__
def __call__(self)
Definition: backend/onnxrt/loader.py:33
polygraphy.util.cuda
Definition: cuda.py:1
IBatchStream::skip
virtual void skip(int skipCount)=0
pytorch_quantization.nn.modules.tensor_quantizer.TensorQuantizer.num_bits
def num_bits(self)
Definition: tensor_quantizer.py:110
stft.STFT.num_samples
num_samples
Definition: stft.py:81
polygraphy.tools.args.trt.loader.TrtLoaderArgs.check_registered
def check_registered(self)
Definition: tools/args/trt/loader.py:78
polygraphy.common.constants
Definition: constants.py:1
BatchStream::mDataDir
std::vector< std::string > mDataDir
Directories where the files can be found.
Definition: BatchStream.h:387
sample::roundUp
T roundUp(T m, T n)
Definition: sampleUtils.h:59
test_ir.TestGraph.test_deep_copy
def test_deep_copy(self)
Definition: test_ir.py:682
builder.main
def main()
Definition: builder.py:646
bert::EmbLayerNormVarSeqlenPluginCreator
Definition: embLayerNormVarSeqlenPlugin.h:115
SampleUffFasterRcnn::batch_inverse_transform_classifier
void batch_inverse_transform_classifier(const float *roi_after_nms, int roi_num_per_img, const float *classifier_cls, const float *classifier_regr, std::vector< float > &pred_boxes, std::vector< int > &pred_cls_ids, std::vector< float > &pred_probs, std::vector< int > &box_num_per_img, int N)
Helper function to do post-processing(apply delta to ROIs).
Definition: sampleUffFasterRCNN.cpp:522
pytorch_quantization.nn.modules._utils.QuantMixin.set_default_quant_desc_weight
def set_default_quant_desc_weight(cls, value)
Definition: _utils.py:47
nvinfer1::CUDADriverWrapper::_cuLinkComplete
CUresult(* _cuLinkComplete)(CUlinkState, void **, size_t *)
Definition: cudaDriverWrapper.h:72
tests.calibrator_test.TestPercentileCalibrator
Definition: calibrator_test.py:249
nvinfer1::IPluginV2Ext::configurePlugin
virtual void configurePlugin(const Dims *inputDims, int32_t nbInputs, const Dims *outputDims, int32_t nbOutputs, const DataType *inputTypes, const DataType *outputTypes, const bool *inputIsBroadcast, const bool *outputIsBroadcast, PluginFormat floatFormat, int32_t maxBatchSize)=0
Configure the layer with input and output data types.
SampleOnnxMnistCoordConvAC::infer
bool infer()
Runs the TensorRT inference engine for this sample.
Definition: sampleOnnxMnistCoordConvAC.cpp:204
parserutils
Definition: parserUtils.h:35
nvinfer1::plugin::CropAndResizePlugin::supportsFormat
bool supportsFormat(DataType type, PluginFormat format) const override
Check format support.
Definition: cropAndResizePlugin.cpp:160
preprocess.preprocess
def preprocess(dynamic_graph)
Definition: preprocess.py:20
nvonnxparser::IOnnxConfig::setVerbosityLevel
virtual void setVerbosityLevel(Verbosity)=0
Set to specific verbosity Level.
ParserOnnxConfig::getFullTextFileName
virtual const char * getFullTextFileName() const
Get the File Name of the Network Description as a Text File, including the weights.
Definition: parserOnnxConfig.h:120
SampleMNIST::processInput
bool processInput(const samplesCommon::BufferManager &buffers, const std::string &inputTensorName, int inputFileIdx) const
Reads the input and mean data, preprocesses, and stores the result in a managed buffer.
Definition: sampleMNIST.cpp:174
polygraphy.util.cuda.DeviceBuffer.address
def address(self)
Definition: cuda.py:129
nvinfer1::plugin::GridAnchorBasePluginCreator::GridAnchorBasePluginCreator
GridAnchorBasePluginCreator()
Definition: gridAnchorPlugin.cpp:351
nvinfer1::plugin::RPROIPluginCreator::deserializePlugin
IPluginV2Ext * deserializePlugin(const char *name, const void *serialData, size_t serialLength) override
Called during deserialization of plugin layer.
Definition: nvFasterRCNNPlugin.cpp:443
backend.trt.test_loader.TestEngineFromNetwork
Definition: trt/test_loader.py:245
SampleMLP::loadShape
nvinfer1::Dims loadShape(std::ifstream &input)
Loads shape from weights file.
Definition: sampleMLP.cpp:396
nmtSample::read
int read(std::vector< Segment_t > &samples, std::shared_ptr< std::istream > input, int samplesToRead=1)
Definition: bleuScoreWriter.cpp:34
nvinfer1::plugin::PyramidROIAlign::setPluginNamespace
void setPluginNamespace(const char *libNamespace) override
Set the namespace that this plugin object belongs to.
Definition: pyramidROIAlignPlugin.cpp:134
nvinfer1::plugin::GenerateDetection::detachFromContext
void detachFromContext() override
Detach the plugin object from its execution context.
Definition: generateDetectionPlugin.cpp:335
nvuffparser::IUffParser::~IUffParser
virtual ~IUffParser()
Definition: NvUffParser.h:231
polygraphy.backend.trt.loader.EngineFromBytes.__init__
def __init__(self, serialized_engine)
Definition: backend/trt/loader.py:459
nmtSample::Vocabulary
String<->Id bijection storage.
Definition: vocabulary.h:34
backend.tf.test_loader.TestLoggerCallbacks.test_set_severity
def test_set_severity(self, sev)
Definition: tf/test_loader.py:30
samplesCommon::HalfMemory
TypedHostMemory< uint16_t, DataType::kHALF > HalfMemory
Definition: samples/common/common.h:355
LogStreamConsumer::severityOstream
static std::ostream & severityOstream(Severity severity)
Definition: pluginLogging.h:162
tests.quant_rnn_test.TestQuantLSTM.test_quant_different_prec
def test_quant_different_prec(self, verbose)
Definition: quant_rnn_test.py:479
backend.trt.test_loader.TestEngineFromNetwork.test_can_build_with_parser_owning
def test_can_build_with_parser_owning(self)
Definition: trt/test_loader.py:246
half_float::half::operator++
half operator++(int)
Postfix increment.
Definition: samples/common/half.h:1537
SampleUffSSD::mParams
SampleUffSSDParams mParams
The parameters for the sample.
Definition: sampleUffSSD.cpp:91
bert::Fused_multihead_attention_params_v2::scale_bmm2
uint32_t scale_bmm2
Definition: fused_multihead_attention_v2.h:58
samplesCommon::vPPM::magic
std::string magic
Definition: samples/common/common.h:662
nvinfer1::plugin::NMSParameters::iouThreshold
float iouThreshold
Definition: NvInferPluginUtils.h:251
polygraphy.util.cuda.Cuda.handle
handle
Definition: cuda.py:34
SharedMemory::open_ro
int open_ro()
Definition: sampleMovieLensMPS.cpp:131
nvinfer1::ICudaEngine::getBindingIndex
virtual int32_t getBindingIndex(const char *name) const noexcept=0
Retrieve the binding index for a named tensor.
pytorch_quantization.nn.modules.quant_rnn.QuantRNNCell.reset_parameters
def reset_parameters(self)
Definition: modules/quant_rnn.py:356
pytorch_quantization.nn.modules.quant_rnn.GRUCell.forward
def forward(self, input, hx=None)
Definition: modules/quant_rnn.py:446
sample::modelToNetwork
Parser modelToNetwork(const ModelOptions &model, nvinfer1::INetworkDefinition &network, std::ostream &err)
Generate a network definition for a given model.
Definition: sampleEngines.cpp:70
tests.quant_modules_test.TestQuantModuleReplace.test_simple_default_args
def test_simple_default_args(self)
Definition: quant_modules_test.py:34
nmtSample::LSTMEncoder::mNumLayers
int mNumLayers
Definition: lstmEncoder.h:54
comparator.test_data_loader.TestDataLoader.test_can_override_shape
def test_can_override_shape(self)
Definition: test_data_loader.py:29
SampleGoogleNet::mEngine
std::shared_ptr< nvinfer1::ICudaEngine > mEngine
The TensorRT engine used to run the network.
Definition: sampleGoogleNet.cpp:81
nvonnxparser::IOnnxConfig::getVerbosityLevel
virtual Verbosity getVerbosityLevel() const =0
Get the Verbosity Level.
pytorch_quantization.nn.modules.quant_rnn.QuantRNNBase.extra_repr
def extra_repr(self)
Definition: modules/quant_rnn.py:230
nvinfer1::plugin::DetectionLayer::mBackgroundLabel
int mBackgroundLabel
Definition: detectionLayerPlugin.h:93
sample::InferenceTrace::outEnd
float outEnd
Definition: sampleReporting.h:99
half_float::detail::nexttoward
half nexttoward(half from, long double to)
Next representable value.
Definition: ieee_half.h:2694
nvinfer1::plugin::MultilevelProposeROIPluginCreator::mKeepTopK
int mKeepTopK
Definition: multilevelProposeROIPlugin.h:140
tests.quant_conv_transposed_test.TestQuantConvTranspose2D.test_against_unquantized
def test_against_unquantized(self)
Definition: quant_conv_transposed_test.py:211
test_ir.TestConstant.input_node
input_node
Definition: test_ir.py:124
tests.quant_modules_test.TestQuantModuleReplace.test_with_no_replace_list
def test_with_no_replace_list(self)
Definition: quant_modules_test.py:45
tools.test_surgeon.TestSurgeonInsert.test_graph_output
def test_graph_output(self)
Definition: test_surgeon.py:65
nvinfer1::DeviceType
DeviceType
The device that this layer/network will execute on.
Definition: NvInferRuntime.h:706
polygraphy.util.misc.find_in_dict
def find_in_dict(name, mapping, index=None)
Definition: Polygraphy/polygraphy/util/misc.py:36
polygraphy.comparator.data_loader.DataLoader.__init__
def __init__(self, seed=None, iterations=None, input_metadata=None, int_range=None, float_range=None)
Definition: comparator/data_loader.py:48
sample::defaultPercentile
constexpr float defaultPercentile
Definition: sampleOptions.h:53
pytorch_quantization.nn.modules.quant_rnn.QuantRNNCell.bias_hh
bias_hh
Definition: modules/quant_rnn.py:350
samplesCommon::writePPMFileWithBBox
void writePPMFileWithBBox(const std::string &filename, PPM< C, H, W > &ppm, const BBox &bbox)
Definition: samples/common/common.h:699
nmtSample::TextReader
reads sequences of data from input stream
Definition: textReader.h:33
nvinfer1::ErrorCode
ErrorCode
Error codes that can be returned by TensorRT during execution.
Definition: NvInferRuntimeCommon.h:1116
test_examples.test_examples
def test_examples(example_dir, artifacts)
Definition: onnx-graphsurgeon/tests/test_examples.py:88
SampleOnnxMnistCoordConvAC::mOutputDims
nvinfer1::Dims mOutputDims
The dimensions of the output to the network.
Definition: sampleOnnxMnistCoordConvAC.cpp:78
SampleINT8APIParams::mPreproc
SampleINT8APIPreprocessing mPreproc
Definition: sampleINT8API.cpp:61
nvinfer1::RNNDirection::kBIDIRECTION
@ kBIDIRECTION
Network iterates from first to last and vice versa and outputs concatenated.
nvcaffeparser1::ICaffeParser::setPluginFactoryExt
virtual void setPluginFactoryExt(IPluginFactoryExt *factory)=0
Set the IPluginFactoryExt used to create the user defined pluginExts.
nvinfer1::plugin::GenerateDetectionPluginCreator::mPluginAttributes
static std::vector< PluginField > mPluginAttributes
Definition: generateDetectionPlugin.h:135
onnx_graphsurgeon.ir.graph.NodeIDAdder.__init__
def __init__(self, graph)
Definition: graph.py:45
half_float::detail::operator+
enable< expr, T, U >::type operator+(T x, U y)
Add halfs.
Definition: ieee_half.h:2172
polygraphy.util.misc.Compressed.bytes
bytes
Definition: Polygraphy/polygraphy/util/misc.py:180
comparator.test_struct.TestRunResults.test_setitem_out_of_bounds
def test_setitem_out_of_bounds(self, run_results)
Definition: comparator/test_struct.py:79
nvinfer1::ITensor::getDynamicRangeMin
virtual float getDynamicRangeMin() const =0
Get minimum of dynamic range.
bert::EmbLayerNormVarSeqlenPlugin::mLayerName
const std::string mLayerName
Definition: embLayerNormVarSeqlenPlugin.h:85
nvinfer1::ITensor::setLocation
virtual void setLocation(TensorLocation location)=0
Set the storage location of a tensor.
nvinfer1::plugin::ReorgPluginCreator::getPluginName
const char * getPluginName() const override
Return the plugin name.
Definition: reorgPlugin.cpp:190
tests.calibrator_test.TestPercentileCalibrator.test_range
def test_range(self)
Definition: calibrator_test.py:298
tools.test_surgeon.TestSurgeonInsert.test_at_graph_input
def test_at_graph_input(self)
Definition: test_surgeon.py:73
export_tacotron2_onnx.Encoder.__init__
def __init__(self, tacotron2)
Definition: export_tacotron2_onnx.py:64
nvinfer1::plugin::ProposalLayer::mNameSpace
std::string mNameSpace
Definition: proposalLayerPlugin.h:113
nvinfer1::plugin::DetectionLayer::clone
IPluginV2Ext * clone() const override
Clone the plugin object.
Definition: detectionLayerPlugin.cpp:158
nmtSample::Vocabulary::mUnkId
int mUnkId
Definition: vocabulary.h:78
inference_trt.init_decoder_outputs
def init_decoder_outputs(memory, memory_lengths)
Definition: inference_trt.py:93
polygraphy.comparator.struct.RunResults.items
def items(self)
Definition: comparator/struct.py:60
polygraphy.logger.logger.Logger.severity
def severity(self)
Definition: Polygraphy/polygraphy/logger/logger.py:139
polygraphy.backend.trt.util.build_default_profile
def build_default_profile(builder, network, default_shape_value=None)
Definition: trt/util.py:280
nvinfer1::CUDADriverWrapper::cuModuleUnload
CUresult cuModuleUnload(CUmodule hmod) const
Definition: cudaDriverWrapper.cpp:86
util.test_format.test_format_deduction
def test_format_deduction(test_case)
Definition: test_format.py:36
Semaphore::post
void post()
Definition: sampleMovieLensMPS.cpp:99
half_float::detail::enable< T, half, expr, expr >::type
T type
Definition: ieee_half.h:382
nvinfer1::plugin::GridAnchorGenerator::getPluginVersion
const char * getPluginVersion() const override
Return the plugin version.
Definition: gridAnchorPlugin.cpp:284
bert::fused_multihead_attention_fp16_64_64_kernel_sm80_cu_o
unsigned char fused_multihead_attention_fp16_64_64_kernel_sm80_cu_o[]
Definition: fused_multihead_attention_fp16_64_64_kernel.sm80.cpp:19
AlgorithmCacheWriter
Writes the default algorithm choices made by TensorRT into a file.
Definition: sampleAlgorithmSelector.cpp:51
helpers.tokenization.convert_to_unicode
def convert_to_unicode(text)
Definition: tokenization.py:77
SampleSSDParams::visualThreshold
float visualThreshold
The minimum score threshold to consider a detection.
Definition: sampleSSD.cpp:52
bert::EmbLayerNormVarSeqlenPluginCreator::getPluginName
const char * getPluginName() const override
Return the plugin name.
Definition: embLayerNormVarSeqlenPlugin.cpp:425
polygraphy.backend.trt.runner.TrtRunner.get_input_metadata
def get_input_metadata(self)
Definition: backend/trt/runner.py:53
SampleMovieLens::parseMovieLensData
void parseMovieLensData()
Parses the MovieLens dataset and populates the SampleMovieLensParams data structure.
Definition: sampleMovieLens.cpp:423
nvinfer1::plugin::RPROIPluginCreator::getPluginVersion
const char * getPluginVersion() const override
Return the plugin version.
Definition: nvFasterRCNNPlugin.cpp:344
nvinfer1::plugin::Reorg::configurePlugin
void configurePlugin(const Dims *inputDims, int nbInputs, const Dims *outputDims, int nbOutputs, const DataType *inputTypes, const DataType *outputTypes, const bool *inputIsBroadcast, const bool *outputIsBroadcast, PluginFormat floatFormat, int maxBatchSize) override
Definition: reorgPlugin.cpp:154
nvinfer1::plugin::DetectionOutput::getPluginType
const char * getPluginType() const override
Return the plugin type.
Definition: nmsPlugin.cpp:148
pytorch_quantization.nn.modules.quant_pooling.QuantMaxPool2d
Definition: quant_pooling.py:49
nvinfer1::PluginFieldType::kDIMS
@ kDIMS
nvinfer1::Dims field type.
polygraphy.backend.base.runner.BaseRunner.__init__
def __init__(self, name=None, prefix=None)
Definition: backend/base/runner.py:29
nvinfer1::plugin::BatchedNMSPlugin::getPluginVersion
const char * getPluginVersion() const override
Return the plugin version.
Definition: batchedNMSPlugin.cpp:365
sample::MirroredBuffer::mHostBuffer
TrtHostBuffer mHostBuffer
Definition: sampleDevice.h:350
nvinfer1::INetworkDefinition::addShuffle
virtual IShuffleLayer * addShuffle(ITensor &input)=0
Add a shuffle layer to the network.
nvinfer1::ProfilingVerbosity
ProfilingVerbosity
List of verbosity levels of layer information exposed in NVTX annotations.
Definition: NvInfer.h:6415
samplesCommon::GpuTimer
Definition: samples/common/common.h:807
samplesCommon::vPPM::h
int h
Definition: samples/common/common.h:663
tests.quant_conv_transposed_test.TestQuantConvTranspose3D.test_no_quant
def test_no_quant(self)
Definition: quant_conv_transposed_test.py:240
polygraphy.tools.args.tf.loader.TfLoaderArgs.add_to_parser
def add_to_parser(self, parser)
Definition: tools/args/tf/loader.py:29
list
sample::anonymous_namespace{sampleInference.cpp}::MultiEvent
std::array< std::unique_ptr< TrtCudaEvent >, static_cast< int >(EventType::kNUM)> MultiEvent
Definition: sampleInference.cpp:273
SampleGoogleNet
The SampleGoogleNet class implements the GoogleNet sample.
Definition: sampleGoogleNet.cpp:46
nvinfer1::plugin::ResizeNearest::mScale
float mScale
Definition: resizeNearestPlugin.h:90
polygraphy.backend.onnx.loader.ModifyOnnx.__init__
def __init__(self, model, do_shape_inference=None, outputs=None, exclude_outputs=None)
Definition: backend/onnx/loader.py:108
nvinfer1::plugin::GridAnchorGenerator::serializeFromDevice
void serializeFromDevice(char *&hostBuffer, Weights deviceWeights) const
Definition: gridAnchorPlugin.cpp:262
sample::anonymous_namespace{sampleInference.cpp}::SyncStruct::mutex
std::mutex mutex
Definition: sampleInference.cpp:168
export_tacotron2_onnx.test_inference
def test_inference(encoder, decoder_iter, postnet)
Definition: export_tacotron2_onnx.py:208
model.Decoder.n_mel_channels
n_mel_channels
Definition: demo/Tacotron2/tacotron2/model.py:242
sample::LogStreamConsumerBuffer
Definition: samples/common/logging.h:34
polygraphy.backend.trt_legacy.TrtLegacyRunner.max_batch_size
max_batch_size
Definition: backend/trt_legacy.py:224
helpers.tokenization.BertTokenizer.wordpiece_tokenizer
wordpiece_tokenizer
Definition: tokenization.py:192
nvinfer1::plugin::FlattenConcat::getPluginNamespace
const char * getPluginNamespace() const override
Return the namespace of the plugin object.
Definition: flattenConcat.cpp:215
tests.tensor_quantizer_test.TestTensorQuantizer.test_properties
def test_properties(self)
Definition: tensor_quantizer_test.py:149
nvinfer1::plugin::ProposalLayer::ProposalLayer
ProposalLayer(int prenms_topk, int keep_topk, float iou_threshold, const nvinfer1::Dims &image_size)
Definition: proposalLayerPlugin.cpp:103
nvinfer1::ActivationType::kSELU
@ kSELU
Selu activation: x>0 ? beta * x : beta * (alpha*exp(x) - alpha)
nvinfer1::plugin::InstanceNormalizationPlugin::clone
nvinfer1::IPluginV2DynamicExt * clone() const override
Clone the plugin object.
Definition: instanceNormalizationPlugin.cpp:254
nvinfer1::plugin::FlattenConcatPluginCreator::mFC
static PluginFieldCollection mFC
Definition: flattenConcat.h:126
Args::failCount
std::atomic< int32_t > failCount
Definition: sampleMovieLensMPS.cpp:186
pytorch_quantization.nn.modules.quant_rnn.QuantLSTMCell.input_size
input_size
Definition: modules/quant_rnn.py:387
nvinfer1::ProfilingVerbosity::kVERBOSE
@ kVERBOSE
Register layer names in NVTX message field and register layer detail in NVTX JSON payload field.
test_importers.TestOnnxImporter.test_import_variable_tensor
def test_import_variable_tensor(self)
Definition: test_importers.py:33
tests.quant_rnn_test.TestQuantLSTM.test_quant_input_hidden
def test_quant_input_hidden(self, verbose)
Definition: quant_rnn_test.py:417
nvinfer1::plugin::DetectionLayerPluginCreator::getPluginVersion
const char * getPluginVersion() const override
Return the plugin version.
Definition: detectionLayerPlugin.cpp:52
nmtSample::LSTMDecoder::mGateKernelWeights
std::vector< nvinfer1::Weights > mGateKernelWeights
Definition: lstmDecoder.h:47
nvinfer1::plugin::DetectionLayer::mNameSpace
std::string mNameSpace
Definition: detectionLayerPlugin.h:105
SampleUffFasterRcnnParams::postNmsTopN
int postNmsTopN
Definition: sampleUffFasterRCNN.cpp:78
polygraphy.tools.args.tf.runner.TfRunnerArgs
Definition: tools/args/tf/runner.py:21
sample::UffInput::inputs
std::vector< std::pair< std::string, nvinfer1::Dims > > inputs
Definition: sampleOptions.h:86
Logger::report
Logger & report(Severity severity, const char *msg)
Definition: demo/BERT/infer_c/logging.h:47
nvinfer1::plugin::GenerateDetection::setPluginNamespace
void setPluginNamespace(const char *libNamespace) override
Set the namespace that this plugin object belongs to.
Definition: generateDetectionPlugin.cpp:180
model.Attention.query_layer
query_layer
Definition: demo/Tacotron2/tacotron2/model.py:51
sample::defaultSleep
constexpr int defaultSleep
Definition: sampleOptions.h:49
nvinfer1::plugin::DetectionLayer::~DetectionLayer
~DetectionLayer() override=default
backend.trt.test_loader.TestOnnxNetworkLoader.test_loader_explicit_precision
def test_loader_explicit_precision(self)
Definition: trt/test_loader.py:97
half_float::detail::expr::expr
expr(float f)
Conversion constructor.
Definition: ieee_half.h:357
text._clean_text
def _clean_text(text, cleaner_names)
Definition: demo/Tacotron2/tacotron2/text/__init__.py:56
test_ir.TestNodeIO.test_pop
def test_pop(self, field_names)
Definition: test_ir.py:261
bert::fused_multihead_attention_v2_fp16_96_64_kernel_sm75_cubin
unsigned char fused_multihead_attention_v2_fp16_96_64_kernel_sm75_cubin[]
Definition: fused_multihead_attention_v2_fp16_96_64_kernel.sm75.cpp:19
nvinfer1::plugin::MultilevelProposeROI::mMaxBatchSize
int mMaxBatchSize
Definition: multilevelProposeROIPlugin.h:101
polygraphy.util.misc.override_dynamic_shape
def override_dynamic_shape(shape)
Definition: Polygraphy/polygraphy/util/misc.py:153
nvcaffeparser1::CaffeWeightFactory::sizeOfCaffeType
static size_t sizeOfCaffeType(trtcaffe::Type type)
Definition: caffeWeightFactory.cpp:265
pytorch_quantization.calib.calibrator._Calibrator.collect
def collect(self, x)
Definition: tools/pytorch-quantization/pytorch_quantization/calib/calibrator.py:36
polygraphy.tools.args.comparator.ComparatorRunArgs.__init__
def __init__(self, iters=True, write=True)
Definition: tools/args/comparator.py:24
polygraphy.comparator.struct.RunResults.__getitem__
def __getitem__(self, key)
Definition: comparator/struct.py:98
samplesCommon::GenericBuffer::freeFn
FreeFunc freeFn
Definition: buffers.h:176
nvinfer1::IPluginV2IOExt::PluginFormat
const Dims PluginFormat
Definition: NvInferRuntimeCommon.h:792
nvcaffeparser1::CaffeWeightFactory::mInitialized
bool mInitialized
Definition: caffeWeightFactory.h:63
sample::anonymous_namespace{sampleEngines.cpp}::RndInt8Calibrator::mInputDeviceBuffers
std::map< std::string, void * > mInputDeviceBuffers
Definition: sampleEngines.cpp:186
tests.quant_instancenorm_test.TestQuantInstanceNorm3D.test_fake_quant_per_channel
def test_fake_quant_per_channel(self)
Definition: quant_instancenorm_test.py:182
tests.calibrator_test.TestHistogramCalibrator.test_skip_zeros
def test_skip_zeros(self, verbose)
Definition: calibrator_test.py:124
test_ir.TensorBaseTests.test_set_inputs_updates_old_inputs
def test_set_inputs_updates_old_inputs(self)
Definition: test_ir.py:55
nvinfer1::plugin::Normalize::copyToDevice
Weights copyToDevice(const void *hostData, size_t count)
Definition: normalizePlugin.cpp:144
sample::LogStreamConsumerBuffer::setShouldLog
void setShouldLog(bool shouldLog)
Definition: samples/common/logging.h:96
nvcaffeparser1::BinaryProtoBlob::BinaryProtoBlob
BinaryProtoBlob(void *memory, nvinfer1::DataType type, nvinfer1::DimsNCHW dimensions)
Definition: binaryProtoBlob.h:29
test_ir.TestNodeIO.node
node
Definition: test_ir.py:224
nvinfer1::IPluginRegistry::~IPluginRegistry
virtual ~IPluginRegistry() noexcept
Definition: NvInferRuntimeCommon.h:980
comparator.test_comparator.TestComparator.test_multiple_runners
def test_multiple_runners(self)
Definition: test_comparator.py:67
polygraphy.backend.tf.loader.ModifyGraph
Definition: backend/tf/loader.py:270
half_float::detail::functions::write
static std::basic_ostream< charT, traits > & write(std::basic_ostream< charT, traits > &out, float arg)
Output implementation.
Definition: samples/common/half.h:1626
pytorch_quantization.quant_modules.QuantModuleReplacementHelper.apply_quant_modules
def apply_quant_modules(self)
Definition: quant_modules.py:94
export_tacotron2_onnx.main
def main()
Definition: export_tacotron2_onnx.py:280
nvinfer1::plugin::InstanceNormalizationPlugin::_x_desc
cudnnTensorDescriptor_t _x_desc
Definition: instanceNormalizationPlugin.h:98
nvinfer1::plugin::SplitPlugin::_nx
int _nx
Definition: split.h:42
nvinfer1::plugin::DetectionLayerPluginCreator::mIOUThreshold
float mIOUThreshold
Definition: detectionLayerPlugin.h:130
nvinfer1::plugin::PriorBox::mW
int mW
Definition: priorBoxPlugin.h:93
pytorch_quantization.nn.modules.quant_bert.QuantBertSelfAttention.matmul_q_input_quantizer
matmul_q_input_quantizer
Definition: quant_bert.py:73
SampleErrorRecorder::~SampleErrorRecorder
virtual ~SampleErrorRecorder() noexcept
Definition: ErrorRecorder.h:45
FCPlugin::serialize
virtual void serialize(void *buffer) override
Serialize the layer.
Definition: samples/opensource/samplePlugin/fcPlugin.h:210
inference.MeasureTime
Definition: Tacotron2/inference.py:164
SampleBuffer::dims
nvinfer1::Dims dims
Definition: sampleReformatFreeIO.cpp:183
nvinfer1::plugin::ReorgPluginCreator::stride
int stride
Definition: reorgPlugin.h:111
nmtSample::LSTMEncoder::mNumUnits
int mNumUnits
Definition: lstmEncoder.h:55
anonymous_namespace{priorBoxPlugin.cpp}::PRIOR_BOX_PLUGIN_NAME
const char * PRIOR_BOX_PLUGIN_NAME
Definition: priorBoxPlugin.cpp:33
nvinfer1::plugin::NMSPluginCreator
Definition: nmsPlugin.h:94
polygraphy.common.exception.PolygraphyException
Definition: Polygraphy/polygraphy/common/exception.py:16
nvinfer1::plugin::ReorgPluginCreator::mFC
static PluginFieldCollection mFC
Definition: reorgPlugin.h:110
SampleCharRNNBase::convertRNNWeights
nvinfer1::Weights convertRNNWeights(nvinfer1::Weights input, int dataSize)
Converts RNN weights from TensorFlow's format to TensorRT's format.
Definition: sampleCharRNN.cpp:441
bert::DATA_TYPE_E8M10
@ DATA_TYPE_E8M10
Definition: fused_multihead_attention_common.h:25
polygraphy.tools.args.trt.loader.TrtLoaderArgs.calibration_cache
calibration_cache
Definition: tools/args/trt/loader.py:103
SampleUffFasterRcnn::teardown
bool teardown()
Cleans up any state created in the sample class.
Definition: sampleUffFasterRCNN.cpp:369
nmtSample::Embedder::addToModel
virtual void addToModel(nvinfer1::INetworkDefinition *network, nvinfer1::ITensor *input, nvinfer1::ITensor **output)=0
add the embedding vector calculation to the network
tests.optim_helper_test.TestGroupParameters
Definition: optim_helper_test.py:51
export_waveglow_onnx.export_onnx
def export_onnx(parser, args)
Definition: export_waveglow_onnx.py:156
sample::InferenceTime::~InferenceTime
~InferenceTime()=default
polygraphy.tools.args.trt.loader.TrtLoaderArgs.strict_types
strict_types
Definition: tools/args/trt/loader.py:104
bert::SkipLayerNormInterleavedPlugin::mGammaDev
bert::cuda_unique_ptr< void > mGammaDev
Definition: skipLayerNormInt8InterleavedPlugin.h:74
tests.quant_pooling_test.TestQuantAdaptiveAvgPool2d.test_input_variable_bits
def test_input_variable_bits(self)
Definition: quant_pooling_test.py:260
nvinfer1::RNNInputMode
RNNInputMode
Enumerates the RNN input modes that may occur with an RNN layer.
Definition: NvInfer.h:2841
BertInference::run
void run(const void *const *inputBuffers, int warmUps, int iterations)
Definition: bert_infer.h:219
comparator.test_compare.TestBasicCompareFunc.test_can_compare_bool
def test_can_compare_bool(self)
Definition: test_compare.py:23
sample::LayerProfile::timeMs
float timeMs
Definition: sampleReporting.h:161
tests.tensor_quant_test.TestTensorQuant.test_per_tensor_scale
def test_per_tensor_scale(self)
Definition: tensor_quant_test.py:46
SampleGoogleNet::SampleUniquePtr
std::unique_ptr< T, samplesCommon::InferDeleter > SampleUniquePtr
Definition: sampleGoogleNet.cpp:49
nvinfer1::utility::getTRTOptionIndex
int getTRTOptionIndex(const std::vector< TRTOption > &options, const TRTOption &opt)
getTRTOptionIndex returns the index of a TRTOption in a vector of TRTOptions, -1 if not found.
Definition: getOptions.cpp:64
tests.quant_pooling_test.TestQuantAdaptiveAvgPool3d.test_raise
def test_raise(self)
Definition: quant_pooling_test.py:289
nvinfer1::plugin::GridAnchorBasePluginCreator::mFC
static PluginFieldCollection mFC
Definition: gridAnchorPlugin.h:123
nvinfer1::plugin::GenerateDetection::~GenerateDetection
~GenerateDetection() override=default
SampleAlgorithmSelector::mMeanBlob
SampleUniquePtr< nvcaffeparser1::IBinaryProtoBlob > mMeanBlob
Definition: sampleAlgorithmSelector.cpp:364
sample::anonymous_namespace{sampleEngines.cpp}::RndInt8Calibrator::mCurrentBatch
int mCurrentBatch
Definition: sampleEngines.cpp:184
builder.BertConfig.use_fc2_gemm
use_fc2_gemm
Definition: builder.py:99
nvinfer1::plugin::Region::coords
int coords
Definition: regionPlugin.h:94
nvinfer1::DataType::kBOOL
@ kBOOL
8-bit boolean. 0 = false, 1 = true, other values undefined.
SampleDynamicReshape::makeUnique
SampleUniquePtr< T > makeUnique(T *t)
Definition: sampleDynamicReshape.cpp:94
Logger::TestResult
TestResult
Represents the state of a given test.
Definition: pluginLogging.h:239
bert::fused_multihead_attention_v2_fp16_96_64_kernel_sm75_cubin_len
unsigned int fused_multihead_attention_v2_fp16_96_64_kernel_sm75_cubin_len
Definition: fused_multihead_attention_v2_fp16_96_64_kernel.sm75.cpp:2852
polygraphy.comparator.struct.RunResults.__contains__
def __contains__(self, val)
Definition: comparator/struct.py:122
half_float::detail::functions::asinh
static expr asinh(float arg)
Hyperbolic area sine implementation.
Definition: samples/common/half.h:1967
nvinfer1::plugin::GenerateDetectionPluginCreator::mNbClasses
int mNbClasses
Definition: generateDetectionPlugin.h:131
sample::Parser::onnxParser
TrtUniquePtr< nvonnxparser::IParser > onnxParser
Definition: sampleEngines.h:37
nvinfer1::plugin::RPROIParams::featureStride
int32_t featureStride
Definition: NvInferPluginUtils.h:120
nvinfer1::plugin::reduced_divisor::get
__host__ __device__ __forceinline__ int get() const
Definition: reducedMathPlugin.h:85
pytorch_quantization.nn.modules.quant_pooling.QuantAdaptiveAvgPool1d.__init__
def __init__(self, output_size, **kwargs)
Definition: quant_pooling.py:119
polygraphy.backend.trt.runner
Definition: backend/trt/runner.py:1
nvinfer1::plugin::MultilevelProposeROI::mDeviceBboxes
float ** mDeviceBboxes
Definition: multilevelProposeROIPlugin.h:110
nmtSample::LSTMDecoder::addToModel
void addToModel(nvinfer1::INetworkDefinition *network, nvinfer1::ITensor *inputEmbeddedData, nvinfer1::ITensor **inputStates, nvinfer1::ITensor **outputData, nvinfer1::ITensor **outputStates) override
add the memory, cell, and hidden states to the network
Definition: lstmDecoder.cpp:64
onnx_graphsurgeon.exporters.onnx_exporter
Definition: onnx_exporter.py:1
model.Postnet.n_convs
n_convs
Definition: demo/Tacotron2/tacotron2/model.py:159
model.Decoder.parse_decoder_outputs
def parse_decoder_outputs(self, mel_outputs, gate_outputs, alignments)
Definition: demo/Tacotron2/tacotron2/model.py:361
nvinfer1::plugin::MultilevelProposeROI::mAnchorBoxesHost
std::vector< std::vector< float > > mAnchorBoxesHost
Definition: multilevelProposeROIPlugin.h:106
samplesCommon::argMagnitudeSort
std::vector< size_t > argMagnitudeSort(Iter begin, Iter end)
Return vector of indices that puts magnitudes of sequence in descending order.
Definition: samples/common/common.h:399
infer.DeviceBuffer.binding
def binding(self)
Definition: infer.py:36
TLTMaskRCNNConfig::MAX_PRE_NMS_RESULTS
static const int MAX_PRE_NMS_RESULTS
Definition: tlt_mrcnn_config.h:76
sample::Options
Definition: sampleOptions.h:69
comparator.test_struct.TestRunResults.test_items
def test_items(self, run_results)
Definition: comparator/test_struct.py:28
backend.trt.test_loader.TestModifyNetwork
Definition: trt/test_loader.py:120
polygraphy.backend.onnx.runner.OnnxTfRunner.tf_rep
tf_rep
Definition: backend/onnx/runner.py:43
TLTMaskRCNNConfig::NUM_CLASSES
static const int NUM_CLASSES
Definition: tlt_mrcnn_config.h:56
nvinfer1::plugin::RPROIPlugin::supportsFormat
bool supportsFormat(DataType type, PluginFormat format) const override
Check format support.
Definition: nvFasterRCNNPlugin.cpp:214
polygraphy.common.func.invoke
def invoke(func, *args, **kwargs)
Definition: func.py:101
test_exporters.TestOnnxExporter
Definition: test_exporters.py:33
BatchStream::update
bool update()
Definition: BatchStream.h:295
nvinfer1::plugin::GridAnchorBasePluginCreator::getFieldNames
const PluginFieldCollection * getFieldNames() override
Return a list of fields that needs to be passed to createPlugin.
Definition: gridAnchorPlugin.cpp:374
polygraphy.backend.base.runner.BaseRunner.__del__
def __del__(self)
Definition: backend/base/runner.py:156
SampleAlgorithmSelector::SampleAlgorithmSelector
SampleAlgorithmSelector(const samplesCommon::CaffeSampleParams &params)
Definition: sampleAlgorithmSelector.cpp:318
polygraphy.backend.trt_legacy.TrtLegacyRunner.HostDeviceMem
Definition: backend/trt_legacy.py:179
nmtSample::SLPEmbedder::SLPEmbedder
SLPEmbedder(ComponentWeights::ptr weights)
Definition: slpEmbedder.cpp:25
nvinfer1::UnaryOperation::kSQRT
@ kSQRT
Square root.
text.numbers._remove_commas
def _remove_commas(m)
Definition: numbers.py:31
backend.trt.test_loader.TestModifyNetwork.test_mark_shape_outputs
def test_mark_shape_outputs(self, reshape_network)
Definition: trt/test_loader.py:145
nvinfer1::plugin::Region::getPluginNamespace
const char * getPluginNamespace() const override
Return the namespace of the plugin object.
Definition: regionPlugin.cpp:419
nvinfer1::EnumMax< PaddingMode >
constexpr int32_t EnumMax< PaddingMode >()
Maximum number of elements in PaddingMode enum.
Definition: NvInfer.h:1182
nvinfer1::plugin::LReLU::enqueue
int enqueue(int batchSize, const void *const *inputs, void **outputs, void *workspace, cudaStream_t stream) override
Definition: lReluPlugin.cpp:58
pytorch_quantization.tensor_quant.ScaledQuantDescriptor.dict
def dict(self)
Definition: tensor_quant.py:174
pytorch_quantization.nn.modules.quant_instancenorm.QuantInstanceNorm3d
Definition: quant_instancenorm.py:65
polygraphy.comparator.comparator
Definition: comparator/comparator.py:1
BufferDesc::getElememtSize
int getElememtSize()
Definition: sampleReformatFreeIO.cpp:123
nvinfer1::plugin::DetectionLayer::mScoreThreshold
float mScoreThreshold
Definition: detectionLayerPlugin.h:96
polygraphy.backend.trt.runner.TrtRunner.owns_engine
owns_engine
Definition: backend/trt/runner.py:80
nvinfer1::IExprBuilder::~IExprBuilder
virtual ~IExprBuilder()
Definition: NvInferRuntime.h:367
nvinfer1::plugin::InstanceNormalizationPlugin::_d_bytes
size_t _d_bytes
Definition: instanceNormalizationPlugin.h:96
comparator.test_compare.TestBasicCompareFunc.test_default_tol_in_map
def test_default_tol_in_map(self, mode)
Definition: test_compare.py:65
SampleDynamicReshape::mPredictionOutputDims
nvinfer1::Dims mPredictionOutputDims
The dimensions of the output of the MNIST model.
Definition: sampleDynamicReshape.cpp:81
nvinfer1::NetworkDefinitionCreationFlags
uint32_t NetworkDefinitionCreationFlags
This bitset is capable of representing one or more NetworkDefinitionCreationFlag flags constructed wi...
Definition: NvInfer.h:6880
polygraphy.comparator.struct.RunResults.values
def values(self)
Definition: comparator/struct.py:77
sample::volume
int volume(const nvinfer1::Dims &d)
Definition: sampleUtils.h:64
nvcaffeparser1::IBlobNameToTensor::~IBlobNameToTensor
virtual ~IBlobNameToTensor()
Definition: NvCaffeParser.h:59
TLTMaskRCNNConfig::MASK_THRESHOLD
static const float MASK_THRESHOLD
Definition: tlt_mrcnn_config.h:34
backend.onnxrt.test_runner.TestOnnxrtRunner.test_basic
def test_basic(self)
Definition: onnxrt/test_runner.py:37
sample::UffInput::help
static void help(std::ostream &out)
Definition: sampleOptions.cpp:1145
nvinfer1::IBuilderConfig::destroy
virtual void destroy()=0
De-allocates any internally allocated memory.
half_float::detail::lround
long lround(half arg)
Nearest integer.
Definition: ieee_half.h:2579
polygraphy.logger.logger.has_colors
def has_colors()
Definition: Polygraphy/polygraphy/logger/logger.py:29
tests.quant_pooling_test.TestQuantMaxPool1d.test_raise
def test_raise(self)
Definition: quant_pooling_test.py:40
polygraphy.backend.tf.runner.TfRunner.timeline_dir
timeline_dir
Definition: backend/tf/runner.py:53
nmtSample::BLEUScoreWriter::getScore
float getScore() const
Definition: bleuScoreWriter.cpp:190
nmtSample::LSTMDecoder::mWeights
ComponentWeights::ptr mWeights
Definition: lstmDecoder.h:46
ConcatTopKWorkSpace::sortedBBoxDims
nvinfer1::DimsHW sortedBBoxDims
Definition: maskRCNNKernels.h:198
backend.trt.test_loader.TestEngineFromNetwork.test_can_build_without_parser_non_owning
def test_can_build_without_parser_non_owning(self, identity_builder_network)
Definition: trt/test_loader.py:252
sample::ModelOptions::outputs
std::vector< std::string > outputs
Definition: sampleOptions.h:98
half_float::detail::expr
Temporary half-precision expression.
Definition: ieee_half.h:353
test_ir.TestNodeIO.test_iadd
def test_iadd(self, field_names)
Definition: test_ir.py:301
half_float::detail::isfinite
bool isfinite(half arg)
Check if finite number.
Definition: ieee_half.h:2728
test_ir.TestConstant.setup_method
def setup_method(self)
Definition: test_ir.py:122
nvinfer1::plugin::RPROIPlugin::detachFromContext
void detachFromContext() override
Detach the plugin object from its execution context.
Definition: nvFasterRCNNPlugin.cpp:309
samplesCommon::getH
int getH(const Dims &d)
Definition: samples/common/common.h:886
nvonnxparser::ErrorCode::kINVALID_VALUE
@ kINVALID_VALUE
tools.common.check_subprocess
def check_subprocess(status)
Definition: tools/common.py:27
nvinfer1::plugin::CodeTypeSSD::CORNER
@ CORNER
Use box corners.
nvinfer1::plugin::DetectionLayer::configurePlugin
void configurePlugin(const Dims *inputDims, int nbInputs, const Dims *outputDims, int nbOutputs, const DataType *inputTypes, const DataType *outputTypes, const bool *inputIsBroadcast, const bool *outputIsBroadcast, PluginFormat floatFormat, int maxBatchSize) override
Definition: detectionLayerPlugin.cpp:294
sample::BuildOptions::nvtxMode
nvinfer1::ProfilingVerbosity nvtxMode
Definition: sampleOptions.h:120
bert::fused_multihead_attention_v2_int8_192_64_kernel_cubin
unsigned char fused_multihead_attention_v2_int8_192_64_kernel_cubin[]
Definition: fused_multihead_attention_v2_int8_192_64_kernel.sm72.cpp:19
SampleMaskRCNN::mEngine
std::shared_ptr< nvinfer1::ICudaEngine > mEngine
Definition: sampleUffMaskRCNN.cpp:340
pytorch_quantization.nn.modules.quant_rnn.GRUCell.bias_hh
bias_hh
Definition: modules/quant_rnn.py:435
polygraphy.tools.precision.precision.Precision.add_parser_args
def add_parser_args(self, parser)
Definition: precision.py:300
nvuffparser::FieldMap
An array of field params used as a layer parameter for plugin layers.
Definition: NvUffParser.h:76
model.Postnet
Definition: demo/Tacotron2/tacotron2/model.py:122
nmtSample::Likelihood
calculates likelihood and TopK indices for the raw input logits
Definition: likelihood.h:33
nvinfer1::IRefitter::setWeights
virtual bool setWeights(const char *layerName, WeightsRole role, Weights weights)=0
Specify new weights for a layer of given name.
onnx_models.initializer_is_output_model
def initializer_is_output_model()
Definition: onnx_models.py:175
test_ir.TestNode.test_o_multiple_outputs
def test_o_multiple_outputs(self)
Definition: test_ir.py:211
samplesCommon::BufferManager::copyInputToDeviceAsync
void copyInputToDeviceAsync(const cudaStream_t &stream=0)
Copy the contents of input host buffers to input device buffers asynchronously.
Definition: buffers.h:411
UffPoolPluginV2::mOutputDims
Dims mOutputDims
Definition: sampleUffPluginV2Ext.cpp:639
polygraphy.tools.base.tool.Tool.name
name
Definition: tool.py:25
sample::anonymous_namespace{sampleInference.cpp}::TimePoint
std::chrono::time_point< std::chrono::high_resolution_clock > TimePoint
Definition: sampleInference.cpp:160
SampleAlgorithmSelector::infer
bool infer()
Runs the TensorRT inference engine for this sample.
Definition: sampleAlgorithmSelector.cpp:561
half_float::half
Half-precision floating point type.
Definition: ieee_half.h:1084
sample::AllOptions::reporting
ReportingOptions reporting
Definition: sampleOptions.h:192
pytorch_quantization.nn.modules.tensor_quantizer.TensorQuantizer._num_bits
_num_bits
Definition: tensor_quantizer.py:74
model.Decoder.encoder_embedding_dim
encoder_embedding_dim
Definition: demo/Tacotron2/tacotron2/model.py:244
layers.LinearNorm.linear_layer
linear_layer
Definition: layers.py:26
half_float::detail::isgreaterequal
bool isgreaterequal(half x, half y)
Comparison for greater equal.
Definition: ieee_half.h:2784
nmtSample::Attention::addToModel
virtual void addToModel(nvinfer1::INetworkDefinition *network, nvinfer1::ITensor *inputFromDecoder, nvinfer1::ITensor *context, nvinfer1::ITensor **attentionOutput)=0
add the attention vector calculation to the network
CudaBind
Definition: maskRCNNKernels.h:63
sample::anonymous_namespace{sampleInference.cpp}::Iteration::Iteration
Iteration(int id, const InferenceOptions &inference, nvinfer1::IExecutionContext &context, Bindings &bindings)
Definition: sampleInference.cpp:285
nvinfer1::plugin::PriorBoxParameters::maxSize
float * maxSize
Definition: NvInferPluginUtils.h:93
nvcaffeparser1::CaffeWeightFactory::mOK
bool mOK
Definition: caffeWeightFactory.h:65
nvinfer1::plugin::DetectionLayer::setPluginNamespace
void setPluginNamespace(const char *libNamespace) override
Set the namespace that this plugin object belongs to.
Definition: detectionLayerPlugin.cpp:165
pytorch_quantization.quant_modules
Definition: quant_modules.py:1
sample::modelToEngine
ICudaEngine * modelToEngine(const ModelOptions &model, const BuildOptions &build, const SystemOptions &sys, std::ostream &err)
Create an engine for a given model.
Definition: sampleEngines.cpp:590
IPluginLayer
Layer type for plugins.
Definition: NvInfer.h:3421
nvinfer1::plugin::SplitPluginCreator::getPluginVersion
const char * getPluginVersion() const
Return the plugin version.
Definition: split.h:147
nmtSample::BeamSearchPolicy::mTimestepId
int mTimestepId
Definition: beamSearchPolicy.h:74
nvinfer1::plugin::FlattenConcat::getPluginVersion
const char * getPluginVersion() const override
Return the plugin version.
Definition: flattenConcat.cpp:274
polygraphy.util.misc.is_dimension_dynamic
def is_dimension_dynamic(dim)
Shapes.
Definition: Polygraphy/polygraphy/util/misc.py:134
polygraphy.backend.onnx.runner
Definition: backend/onnx/runner.py:1
half_float::detail::binary_specialized< half, half >::fmax
static half fmax(half x, half y)
Definition: samples/common/half.h:2594
nmtSample::LSTMDecoder
encodes single input into output states with LSTM
Definition: lstmDecoder.h:31
nvinfer1::plugin::ProposalLayer::mAnchorsCnt
int mAnchorsCnt
Definition: proposalLayerPlugin.h:103
half_float::detail::is_float
Type traits for floating point types.
Definition: ieee_half.h:305
bert::Fused_multihead_attention_params::scale_bmm1
uint32_t scale_bmm1
Definition: fused_multihead_attention.h:81
nvinfer1::CUDADriverWrapper::_cuLinkCreate
CUresult(* _cuLinkCreate)(unsigned int, CUjit_option *, void **, CUlinkState *)
Definition: cudaDriverWrapper.h:75
nmtSample::SoftmaxLikelihood::addToModel
void addToModel(nvinfer1::INetworkDefinition *network, int beamWidth, nvinfer1::ITensor *inputLogits, nvinfer1::ITensor *inputLikelihoods, nvinfer1::ITensor **newCombinedLikelihoods, nvinfer1::ITensor **newRayOptionIndices, nvinfer1::ITensor **newVocabularyIndices) override
add calculation of likelihood and TopK indices to the network
Definition: softmaxLikelihood.cpp:25
nvinfer1::ILoopBoundaryLayer
Definition: NvInfer.h:4523
nvinfer1::IPlugin::enqueue
virtual int32_t enqueue(int32_t batchSize, const void *const *inputs, void **outputs, void *workspace, cudaStream_t stream)=0
Execute the layer.
layers.TacotronSTFT.spectral_de_normalize
def spectral_de_normalize(self, magnitudes)
Definition: layers.py:74
nvinfer1::plugin::SplitPluginCreator::SplitPluginCreator
SplitPluginCreator()
Definition: split.h:138
backend.trt.test_loader.TestEngineFromNetwork.test_can_build_with_calibrator
def test_can_build_with_calibrator(self, identity_builder_network)
Definition: trt/test_loader.py:259
polygraphy.backend.tf.runner.TfRunner.sess
sess
Definition: backend/tf/runner.py:75
polygraphy.tools.args.onnx.loader.OnnxLoaderArgs.check_registered
def check_registered(self)
Definition: tools/args/onnx/loader.py:57
backend.trt.test_calibrator.TestCalibrator.test_calibrator_rechecks_cache_on_reset
def test_calibrator_rechecks_cache_on_reset(self, identity_builder_network)
Definition: test_calibrator.py:128
tests.functional_test.TestClip.test_simple_run
def test_simple_run(self)
Definition: functional_test.py:37
export_waveglow_onnx.convert_1d_to_2d_
def convert_1d_to_2d_(glow)
Definition: export_waveglow_onnx.py:83
nvinfer1::UnaryOperation::kRECIP
@ kRECIP
Reciprocal.
bert::Fused_multihead_attention_params::s
int s
Definition: fused_multihead_attention.h:79
LogStreamConsumer::getMutex
std::mutex & getMutex()
Definition: pluginLogging.h:156
bert::fused_multihead_attention_fp16_128_64_kernel_sm80_cu_o_len
unsigned int fused_multihead_attention_fp16_128_64_kernel_sm80_cu_o_len
Definition: fused_multihead_attention_fp16_128_64_kernel.sm80.cpp:2283
generate.volume
def volume
Definition: 09_shape_operations_with_the_layer_api/generate.py:59
sample::Logger::TestResult::kWAIVED
@ kWAIVED
The test was waived.
polygraphy.common.struct
Definition: common/struct.py:1
nvinfer1::plugin::SplitPlugin::deserialize
void deserialize(void const *serialData, size_t serialLength)
Definition: split.h:53
nvinfer1::plugin::gLogError
LogStream< ILogger::Severity::kERROR > gLogError
Definition: checkMacrosPlugin.cpp:48
PoolParameters
Definition: sampleUffPluginV2Ext.cpp:172
nvinfer1::plugin::BatchedNMSPlugin::configurePlugin
void configurePlugin(const Dims *inputDims, int nbInputs, const Dims *outputDims, int nbOutputs, const DataType *inputTypes, const DataType *outputTypes, const bool *inputIsBroadcast, const bool *outputIsBroadcast, PluginFormat floatFormat, int maxBatchSize) override
Definition: batchedNMSPlugin.cpp:284
nvinfer1::plugin::BatchedNMSPluginCreator::createPlugin
IPluginV2Ext * createPlugin(const char *name, const PluginFieldCollection *fc) override
Return a plugin object.
Definition: batchedNMSPlugin.cpp:511
backend.trt.test_runner.TestTrtRunner.test_basic
def test_basic(self)
Definition: trt/test_runner.py:42
test_examples.Example.__exit__
def __exit__(self, exc_type, exc_value, traceback)
Definition: Polygraphy/tests/test_examples.py:77
pytorch_quantization.tensor_quant.ScaledQuantDescriptor.scale_amax
def scale_amax(self)
Definition: tensor_quant.py:132
nvinfer1::plugin::CoordConvACPlugin::getPluginVersion
const char * getPluginVersion() const override
Return the plugin version.
Definition: coordConvACPlugin.cpp:134
polygraphy.backend.tf.loader.SessionFromGraph.graph
graph
Definition: backend/tf/loader.py:405
util.test_format.FormatTestCase.shape
shape
Definition: test_format.py:25
nvinfer1::plugin::InstanceNormalizationPlugin::InstanceNormalizationPlugin
InstanceNormalizationPlugin()=delete
polygraphy.backend.onnxrt.runner.OnnxrtRunner
Definition: backend/onnxrt/runner.py:25
data_function.batch_to_gpu
def batch_to_gpu(batch)
Definition: tacotron2/data_function.py:132
MultilevelProposeROIWorkSpace::argMaxBboxDims
nvinfer1::DimsHW argMaxBboxDims
Definition: maskRCNNKernels.h:165
samplesCommon::CpuTimer::stop
void stop()
Definition: samples/common/common.h:849
MaskRCNNConfig::MODEL_MASK_SHAPE
static const Dims4 MODEL_MASK_SHAPE
Definition: plugin/proposalLayerPlugin/mrcnn_config.h:175
tests.tensor_quant_test.TestQuantDescriptor.test_from_to_dict
def test_from_to_dict(self)
Definition: tensor_quant_test.py:325
test_func.TestExtend.test_extend_can_modify_rv_objects
def test_extend_can_modify_rv_objects(self)
Definition: test_func.py:118
SampleOnnxMnistCoordConvAC::mInputDims
nvinfer1::Dims mInputDims
The dimensions of the input to the network.
Definition: sampleOnnxMnistCoordConvAC.cpp:77
nvinfer1::LayerType::kRAGGED_SOFTMAX
@ kRAGGED_SOFTMAX
Ragged softmax layer.
polygraphy.backend.tf.loader.CreateConfig.use_xla
use_xla
Definition: backend/tf/loader.py:371
nvinfer1::ActivationType::kCLIP
@ kCLIP
Clip activation: max(alpha, min(beta, x))
bert::TFusedMultiHeadAttentionXMMAKernel::FusedMultiHeadAttentionKernelInfo::mDeviceFunction
CUfunction mDeviceFunction
Definition: fused_multihead_attention.h:285
bert::EmbLayerNormVarSeqlenPlugin::mWordVocabSize
size_t mWordVocabSize
Definition: embLayerNormVarSeqlenPlugin.h:94
sample::operator+
InferenceTime operator+(const InferenceTime &a, const InferenceTime &b)
Definition: sampleReporting.h:102
RefineDetectionWorkSpace::argMaxScoreDims
nvinfer1::DimsHW argMaxScoreDims
Definition: maskRCNNKernels.h:97
polygraphy.tools.args.tf.loader.TfLoaderArgs._enable_artifacts
_enable_artifacts
Definition: tools/args/tf/loader.py:25
polygraphy.tools.inspect.subtool.data
Definition: data.py:1
backend.trt.test_loader.TestConfigLoader.test_int8
def test_int8(self, identity_builder_network, flag)
Definition: trt/test_loader.py:218
nvuffparser::FieldMap::data
const void * data
Definition: NvUffParser.h:80
SampleUffFasterRcnnParams::classNames
std::vector< std::string > classNames
Definition: sampleUffFasterRCNN.cpp:76
test_ir.TestNode.test_i
def test_i(self)
Definition: test_ir.py:190
nvinfer1::utility::TRTOption::longName
std::string longName
Option name in long (double hyphen) form (i.e. –foo, –bar)
Definition: getOptions.h:37
pytorch_quantization.nn.modules.tensor_quantizer.TensorQuantizer._fake_quant
_fake_quant
Definition: tensor_quantizer.py:75
nvinfer1::LayerType::kSCALE
@ kSCALE
Scale layer.
onnx_graphsurgeon.logger.logger.Logger.debug
def debug(self, message, mode=LogMode.EACH)
Definition: onnx-graphsurgeon/onnx_graphsurgeon/logger/logger.py:227
nvinfer1::IAlgorithmIOInfo::getDataType
virtual DataType getDataType() const =0
Return DataType of the input/output of algorithm.
polygraphy.tools.args.trt.loader.TrtLoaderArgs.ext
ext
Definition: tools/args/trt/loader.py:88
nvinfer1::plugin::InstanceNormalizationPluginCreator::mFC
static PluginFieldCollection mFC
Definition: instanceNormalizationPlugin.h:120
tests.quant_conv_test.TestQuantConv2D.test_no_quant
def test_no_quant(self)
Definition: quant_conv_test.py:51
BatchStream::mSuffix
std::string mSuffix
Batch file name suffix.
Definition: BatchStream.h:384
onnx_graphsurgeon.logger.logger.LoggerSuppress.logger
logger
Definition: onnx-graphsurgeon/onnx_graphsurgeon/logger/logger.py:43
nvinfer1::plugin::CropAndResizePlugin
Definition: cropAndResizePlugin.h:36
SampleUffSSDParams::outputClsSize
int32_t outputClsSize
The number of output classes.
Definition: sampleUffSSD.cpp:52
nvinfer1::plugin::GenerateDetection::attachToContext
void attachToContext(cudnnContext *cudnnContext, cublasContext *cublasContext, IGpuAllocator *gpuAllocator) override
Attach the plugin object to an execution context and grant the plugin the access to some context reso...
Definition: generateDetectionPlugin.cpp:329
CUmodule
BboxInfo
Definition: bboxUtils.h:40
polygraphy.backend.trt.util.check_profile
def check_profile(profile)
Definition: trt/util.py:274
test_ir.TestNode.output_tensor
output_tensor
Definition: test_ir.py:137
nvonnxparser::IOnnxConfig::getModelFileName
virtual const char * getModelFileName() const =0
Get the Model FileName.
bert::kSM_XAVIER
static constexpr int32_t kSM_XAVIER
Definition: qkvToContextInt8InterleavedPlugin.h:29
nvcaffeparser1::CaffeWeightFactory::getTmpAllocs
std::vector< void * > & getTmpAllocs()
Definition: caffeWeightFactory.cpp:88
pytorch_quantization.nn.modules._utils.QuantInputMixin._fake_quant
_fake_quant
Definition: _utils.py:122
onnx_graphsurgeon.importers.onnx_importer.OnnxImporter.get_opset
def get_opset(onnx.ModelProto model)
Definition: onnx_importer.py:71
sample::anonymous_namespace{sampleOptions.cpp}::getShapesInference
bool getShapesInference(Arguments &arguments, std::unordered_map< std::string, std::vector< int >> &shapes, const char *argument)
Definition: sampleOptions.cpp:258
nvinfer1::IPluginFactory
Plugin factory for deserialization.
Definition: NvInferRuntime.h:966
nvinfer1::plugin::PriorBox::getPluginType
const char * getPluginType() const override
Return the plugin type.
Definition: priorBoxPlugin.cpp:234
nmtSample::DebugUtil::DumpTensorPlugin::terminate
void terminate() override
Release resources acquired during plugin layer initialization.
Definition: debugUtil.cpp:75
half_float::detail::functions::isnormal
static bool isnormal(half arg)
Classification implementation.
Definition: samples/common/half.h:2369
pytorch_quantization.nn.modules.quant_rnn.QuantLSTMCell.reset_parameters
def reset_parameters(self)
Definition: modules/quant_rnn.py:402
nvinfer1::ITensor::setAllowedFormats
virtual void setAllowedFormats(TensorFormats formats)=0
Set allowed formats for this tensor.
UffPoolPluginV2::serialize
void serialize(void *buffer) const override
Serialize the layer.
Definition: sampleUffPluginV2Ext.cpp:475
tests.quant_rnn_test.TestQuantLSTM.test_no_quant_input_hidden
def test_no_quant_input_hidden(self, verbose)
Definition: quant_rnn_test.py:310
nvinfer1::plugin::LReLU::LReLU
LReLU(float negSlope)
Definition: lReluPlugin.cpp:32
train.main
def main()
Definition: train.py:326
sample::InferenceOptions::skipTransfers
bool skipTransfers
Definition: sampleOptions.h:155
tests.calibrator_test.TestMaxCalibrator.test_simple_run
def test_simple_run(self)
Definition: calibrator_test.py:36
nmtSample::inferTypeToBytes
int inferTypeToBytes(nvinfer1::DataType t)
Definition: trtUtil.cpp:25
nvinfer1::plugin::ProposalLayer::mImageSize
nvinfer1::Dims mImageSize
Definition: proposalLayerPlugin.h:109
nvcaffeparser1::parseELU
nvinfer1::ILayer * parseELU(nvinfer1::INetworkDefinition &network, const trtcaffe::LayerParameter &msg, CaffeWeightFactory &, BlobNameToTensor &tensors)
Definition: parseELU.cpp:23
text.cleaners.english_cleaners
def english_cleaners(text)
Definition: cleaners.py:98
sample::TrtCudaEvent
Managed CUDA event.
Definition: sampleDevice.h:107
sample::Profiler::mLayers
std::vector< LayerProfile > mLayers
Definition: sampleReporting.h:188
sample::ModelOptions::help
static void help(std::ostream &out)
Definition: sampleOptions.cpp:1155
backend.onnx.test_loader.TestOnnxFileLoader.test_basic
def test_basic(self)
Definition: onnx/test_loader.py:36
nvinfer1::PluginType
PluginType
The type values for the various plugins.
Definition: NvInferPluginUtils.h:37
half_float::detail::functions::trunc
static half trunc(half arg)
Truncation implementation.
Definition: samples/common/half.h:2094
nmtSample::ComponentWeights::ptr
std::shared_ptr< ComponentWeights > ptr
Definition: componentWeights.h:34
bert::Fused_multihead_attention_params_v2::s
int s
Definition: fused_multihead_attention_v2.h:56
polygraphy.common
Definition: tools/Polygraphy/polygraphy/common/__init__.py:1
bert::QKVToContextInterleavedPlugin::getPluginType
const char * getPluginType() const override
Return the plugin type.
Definition: qkvToContextInt8InterleavedPlugin.cpp:167
bert::Fused_multihead_attention_params
Definition: fused_multihead_attention.h:48
BertInferenceRunner
Definition: infer_c.cpp:23
half_float::detail::bits::type
unsigned char type
Definition: samples/common/half.h:375
nvinfer1::plugin::GridAnchorBasePluginCreator::getPluginVersion
const char * getPluginVersion() const override
Return the plugin version.
Definition: gridAnchorPlugin.cpp:369
nvinfer1::plugin::BatchTilePlugin::mCopySize
size_t mCopySize
Definition: batchTilePlugin.h:80
nvinfer1::plugin::BatchedNMSDynamicPlugin::param
NMSParameters param
Definition: batchedNMSPlugin.h:112
polygraphy.backend.tf.runner.TfRunner.get_input_metadata
def get_input_metadata(self)
Definition: backend/tf/runner.py:69
nvinfer1::plugin::Normalize::eps
float eps
Definition: normalizePlugin.h:100
nvinfer1::INetworkDefinition::getNbInputs
virtual int32_t getNbInputs() const =0
Get the number of inputs in the network.
nvinfer1::plugin::CropAndResizePluginCreator::createPlugin
IPluginV2Ext * createPlugin(const char *name, const PluginFieldCollection *fc) override
Return a plugin object.
Definition: cropAndResizePlugin.cpp:268
model.Decoder.prenet_dim
prenet_dim
Definition: demo/Tacotron2/tacotron2/model.py:247
bert::QKVToContextInterleavedPlugin::mDqProbs
float mDqProbs
Definition: qkvToContextInt8InterleavedPlugin.h:90
nvinfer1::IDimensionExpr::isConstant
virtual bool isConstant() const =0
Return true if expression is a build-time constant.
tests.license_test.TestLicense.test_license
def test_license(self)
Definition: license_test.py:28
tools.test_run.TestOnnxrt.test_tf2onnxrt
def test_tf2onnxrt(self)
Definition: test_run.py:182
sample::InferenceOptions
Definition: sampleOptions.h:146
tests.quant_conv_transposed_test.TestQuantConvTranspose2D.test_weight_fake_quant_per_channel
def test_weight_fake_quant_per_channel(self)
Definition: quant_conv_transposed_test.py:88
anonymous_namespace{nvFasterRCNNPlugin.cpp}::RPROI_PLUGIN_NAME
const char * RPROI_PLUGIN_NAME
Definition: nvFasterRCNNPlugin.cpp:31
SampleMNIST::mEngine
std::shared_ptr< nvinfer1::ICudaEngine > mEngine
The TensorRT engine used to run the network.
Definition: sampleMNIST.cpp:94
nvinfer1::TacticSource
TacticSource
List of tactic sources for TensorRT.
Definition: NvInfer.h:6437
bert::fused_multihead_attention_v2_fp16_96_64_kernel_sm80_cubin
unsigned char fused_multihead_attention_v2_fp16_96_64_kernel_sm80_cubin[]
Definition: fused_multihead_attention_v2_fp16_96_64_kernel.sm80.cpp:19
nvinfer1::plugin::MultilevelCropAndResizePluginCreator::deserializePlugin
IPluginV2Ext * deserializePlugin(const char *name, const void *data, size_t length) override
Called during deserialization of plugin layer.
Definition: multilevelCropAndResizePlugin.cpp:84
bert::Fused_multihead_attention_params::qkv_stride_in_bytes
int64_t qkv_stride_in_bytes
Definition: fused_multihead_attention.h:58
tests.tensor_quant_test.TestTensorQuant.test_per_channel_scale
def test_per_channel_scale(self)
Definition: tensor_quant_test.py:57
tests.quant_conv_test.TestQuantConv1D.test_fake_quant_input
def test_fake_quant_input(self)
Definition: quant_conv_test.py:327
MaskRCNNConfig::POST_NMS_ROIS_INFERENCE
static const int POST_NMS_ROIS_INFERENCE
Definition: plugin/proposalLayerPlugin/mrcnn_config.h:83
tools.test_run.TestTrt
Definition: test_run.py:60
backend.onnx.test_loader.TestModifyOnnx.test_layerwise
def test_layerwise(self)
Definition: onnx/test_loader.py:54
parserutils::divUp
A divUp(A m, B n)
Definition: parserUtils.h:135
SampleUffSSDParams::nbCalBatches
int32_t nbCalBatches
The number of batches for calibration.
Definition: sampleUffSSD.cpp:54
SampleUffMNIST::constructNetwork
void constructNetwork(SampleUniquePtr< nvuffparser::IUffParser > &parser, SampleUniquePtr< nvinfer1::INetworkDefinition > &network)
Parses a Uff model for MNIST and creates a TensorRT network.
Definition: sampleUffMNIST.cpp:173
nvinfer1::plugin::ProposalLayerPluginCreator::mFC
static PluginFieldCollection mFC
Definition: proposalLayerPlugin.h:134
nmtSample::Attention::getAttentionSize
virtual int getAttentionSize()=0
get the size of the attention vector
nvinfer1::plugin::ProposalPlugin::isOutputBroadcastAcrossBatch
bool isOutputBroadcastAcrossBatch(int outputIndex, const bool *inputIsBroadcasted, int nbInputs) const override
Definition: proposalPlugin.cpp:283
bert::EmbLayerNormVarSeqlenPlugin::mBetaDev
bert::cuda_unique_ptr< float > mBetaDev
Definition: embLayerNormVarSeqlenPlugin.h:89
sample::InferenceTrace::operator=
InferenceTrace & operator=(const InferenceTrace &)=default
nmtSample::MultiplicativeAlignment::mInputChannelCount
int mInputChannelCount
Definition: multiplicativeAlignment.h:53
data_function.TextMelLoader
Definition: tacotron2/data_function.py:26
sample::Bindings::dumpBindingValues
void dumpBindingValues(int binding, std::ostream &os, const std::string &separator=" ") const
Definition: sampleUtils.h:378
nvinfer1::IIteratorLayer
Definition: NvInfer.h:4622
comparator.test_comparator.TestComparator.test_list_as_data_loader
def test_list_as_data_loader(self)
Definition: test_comparator.py:40
nmtSample::SLPAttention::mWeights
ComponentWeights::ptr mWeights
Definition: slpAttention.h:47
nvinfer1::OptProfileSelector::kMAX
@ kMAX
This is used to set or get the maximum permitted value for dynamic dimensions etc.
SampleFasterRCNN::mPPMs
std::vector< samplesCommon::PPM< kIMG_CHANNELS, kIMG_H, kIMG_W > > mPPMs
PPMs of test images.
Definition: sampleFasterRCNN.cpp:90
BatchStream::reset
void reset(int firstBatch)
Definition: frcnnUtils.h:244
SampleCharRNNBindingNames::HIDDEN_OUT_BLOB_NAME
const char * HIDDEN_OUT_BLOB_NAME
Definition: sampleCharRNN.cpp:90
samplesCommon::volume
int64_t volume(const nvinfer1::Dims &d)
Definition: samples/common/common.h:627
nvinfer1::plugin::ResizeNearest::isOutputBroadcastAcrossBatch
bool isOutputBroadcastAcrossBatch(int outputIndex, const bool *inputIsBroadcasted, int nbInputs) const override
Definition: resizeNearestPlugin.cpp:228
nvinfer1::plugin::InstanceNormalizationPlugin::getPluginNamespace
const char * getPluginNamespace() const override
Return the namespace of the plugin object.
Definition: instanceNormalizationPlugin.cpp:267
half_float::detail::functions::erf
static double erf(double arg)
Definition: samples/common/half.h:2476
nvinfer1::ActivationType::kSCALED_TANH
@ kSCALED_TANH
Scaled tanh activation: alpha*tanh(beta*x)
nvinfer1::plugin::GenerateDetection::getWorkspaceSize
size_t getWorkspaceSize(int maxBatchSize) const override
Definition: generateDetectionPlugin.cpp:249
samplesCommon::BBox::x1
float x1
Definition: samples/common/common.h:669
nmtSample::SLPAttention
Linear attention calculation.
Definition: slpAttention.h:34
sample::SystemOptions::help
static void help(std::ostream &out)
Definition: sampleOptions.cpp:1229
half_float::detail::functions::remquo
static expr remquo(float x, float y, int *quo)
Remainder implementation.
Definition: samples/common/half.h:1688
bert::fused_multihead_attention_v2_int8_384_64_kernel_sm80_cubin
unsigned char fused_multihead_attention_v2_int8_384_64_kernel_sm80_cubin[]
Definition: fused_multihead_attention_v2_int8_384_64_kernel.sm80.cpp:19
nvinfer1::plugin::BaseCreator
Definition: plugin.h:60
nvcaffeparser1::WeightType::kBIAS
@ kBIAS
nvinfer1::IResizeLayer
A resize layer in a network definition.
Definition: NvInfer.h:4364
polygraphy.backend.trt_legacy.TrtLegacyRunner.deactivate_impl
def deactivate_impl(self)
Definition: backend/trt_legacy.py:325
onnx_graphsurgeon.logger.logger.LoggerIndent.__exit__
def __exit__(self, exc_type, exc_value, traceback)
Definition: onnx-graphsurgeon/onnx_graphsurgeon/logger/logger.py:36
nvinfer1::plugin::RPROIPluginCreator::anchorsScales
std::vector< float > anchorsScales
Definition: nvFasterRCNNPlugin.h:124
half_float::detail::binary_t
Tag type for binary construction.
Definition: ieee_half.h:346
bert::SkipLayerNormInterleavedPlugin::setPluginNamespace
void setPluginNamespace(const char *pluginNamespace) override
Set the namespace that this plugin object belongs to.
Definition: skipLayerNormInt8InterleavedPlugin.cpp:257
nmtSample::DeviceBuffer::DeviceBuffer
DeviceBuffer(size_t elementCount)
Definition: deviceBuffer.h:32
samplesCommon::setAllTensorScales
void setAllTensorScales(INetworkDefinition *network, float inScales=2.0f, float outScales=4.0f)
Definition: samples/common/common.h:520
tests.fixtures.models.resnet18
def resnet18()
Definition: tools/pytorch-quantization/tests/fixtures/models.py:62
nvinfer1::IPluginV2::serialize
virtual void serialize(void *buffer) const =0
Serialize the layer.
half_float::detail::functions::log1p
static expr log1p(float arg)
Logarithm implementation.
Definition: samples/common/half.h:1815
sample::anonymous_namespace{sampleInference.cpp}::EventType::kOUTPUT_S
@ kOUTPUT_S
bert::fused_multihead_attention_v2_fp16_384_64_kernel_sm80_cubin_len
unsigned int fused_multihead_attention_v2_fp16_384_64_kernel_sm80_cubin_len
Definition: fused_multihead_attention_v2_fp16_384_64_kernel.sm80.cpp:6003
polygraphy.backend.trt_legacy.TrtLegacyRunner.max_workspace_size
max_workspace_size
Definition: backend/trt_legacy.py:216
helpers.tokenization.BertTokenizer.vocab
vocab
Definition: tokenization.py:188
polygraphy.backend.cntk.runner.CNTKRunner.__init__
def __init__(self, model, name=None)
Definition: backend/cntk/runner.py:26
polygraphy.util.misc.str_from_layer
def str_from_layer(prefix, index, name, op, input_info, output_info)
Definition: Polygraphy/polygraphy/util/misc.py:405
tests.clip_test.TestClip.test_simple_run
def test_simple_run(self)
Definition: clip_test.py:38
nvinfer1::plugin::DetectionLayer::serialize
void serialize(void *buffer) const override
Serialize the layer.
Definition: detectionLayerPlugin.cpp:180
nvinfer1::plugin::BatchedNMSPlugin::setClipParam
void setClipParam(bool clip)
Definition: batchedNMSPlugin.cpp:449
nvinfer1::plugin::MultilevelProposeROI::check_valid_inputs
void check_valid_inputs(const nvinfer1::Dims *inputs, int nbInputDims)
Definition: multilevelProposeROIPlugin.cpp:292
half_float::detail::functions::round
static half round(half arg)
Nearest integer implementation.
Definition: samples/common/half.h:2102
pytorch_quantization.nn.modules.clip.Clip.clip_value_min
clip_value_min
Definition: clip.py:45
FCPlugin
Definition: samples/opensource/samplePlugin/fcPlugin.h:36
polygraphy.backend.tf.loader.GraphFromKeras.__call__
def __call__(self)
Definition: backend/tf/loader.py:122
model.Decoder.initialize_decoder_states
def initialize_decoder_states(self, memory)
Definition: demo/Tacotron2/tacotron2/model.py:304
nvinfer1::plugin::DetectionLayer::mType
nvinfer1::DataType mType
Definition: detectionLayerPlugin.h:102
nvinfer1::plugin::BatchedNMSDynamicPlugin::configurePlugin
void configurePlugin(const DynamicPluginTensorDesc *in, int nbInputs, const DynamicPluginTensorDesc *out, int nbOutputs) override
Definition: batchedNMSPlugin.cpp:305
nvinfer1::IAlgorithm::~IAlgorithm
virtual ~IAlgorithm()
Definition: NvInfer.h:6294
polygraphy.util.misc.FreeOnException.__enter__
def __enter__(self)
Definition: Polygraphy/polygraphy/util/misc.py:445
backend.trt.test_loader.TestNetworkFromOnnxPath
Definition: trt/test_loader.py:105
tools.test_run.TestTrt.test_trt_explicit_profile_opt_runtime_shapes_differ
def test_trt_explicit_profile_opt_runtime_shapes_differ(self)
Definition: test_run.py:122
tests.quant_conv_test.TestQuantConv2D.test_fake_quant_per_channel_other_prec
def test_fake_quant_per_channel_other_prec(self)
Definition: quant_conv_test.py:164
nvinfer1::TensorFormats
uint32_t TensorFormats
It is capable of representing one or more TensorFormat by binary OR operations, e....
Definition: NvInferRuntimeCommon.h:210
bert::SkipLayerNormInterleavedPlugin::mLd
size_t mLd
Definition: skipLayerNormInt8InterleavedPlugin.h:76
text.cleaners.expand_numbers
def expand_numbers(text)
Definition: cleaners.py:67
nvinfer1::plugin::scaleShiftChannelsInplace
void scaleShiftChannelsInplace(T *inOut, const int B, const int C, const int channelVolume, const float *beta, const float *gamma, cudaStream_t stream)
nmtSample::BeamSearchPolicy::ptr
std::shared_ptr< BeamSearchPolicy > ptr
Definition: beamSearchPolicy.h:36
polygraphy.logger.logger.LoggerIndent.old_indent
old_indent
Definition: Polygraphy/polygraphy/logger/logger.py:46
nvinfer1::plugin::RPROIPluginCreator::mPluginAttributes
static std::vector< PluginField > mPluginAttributes
Definition: nvFasterRCNNPlugin.h:125
SampleFasterRCNN::infer
bool infer()
Runs the TensorRT inference engine for this sample.
Definition: sampleFasterRCNN.cpp:205
tests.quant_linear_test.TestQuantLinear.test_test_input_fake_per_tensor
def test_test_input_fake_per_tensor(self)
Definition: quant_linear_test.py:89
polygraphy.tools.run.run.Run.__init__
def __init__(self)
Definition: run.py:99
SampleMaskRCNNParams::uffFileName
std::string uffFileName
Definition: sampleUffMaskRCNN.cpp:306
test_ir.TestGraph.test_io_cannot_be_sync_list_on_assign
def test_io_cannot_be_sync_list_on_assign(self)
Definition: test_ir.py:789
onnx_models.Model.assert_equal
def assert_equal(self, Graph graph)
Definition: onnx_models.py:45
loss_functions.get_loss_function
def get_loss_function(loss_function, sigma=1.0)
Definition: loss_functions.py:23
nvinfer1::CUDADriverWrapper::cuLaunchCooperativeKernel
CUresult cuLaunchCooperativeKernel(CUfunction f, unsigned int gridDimX, unsigned int gridDimY, unsigned int gridDimZ, unsigned int blockDimX, unsigned int blockDimY, unsigned int blockDimZ, unsigned int sharedMemBytes, CUstream hStream, void **kernelParams) const
Definition: cudaDriverWrapper.cpp:124
bert::TFusedMultiHeadAttentionXMMAKernel
Definition: fused_multihead_attention.h:189
SampleINT8API::teardown
sample::Logger::TestResult teardown()
Used to clean up any state created in the sample class.
Definition: sampleINT8API.cpp:638
samplesCommon::Args
/brief Struct to maintain command-line arguments.
Definition: argsParser.h:79
test_examples.load_code_blocks_from_readme
def load_code_blocks_from_readme(readme)
Definition: Polygraphy/tests/test_examples.py:33
util.test_cuda.TestStream
Definition: test_cuda.py:101
model.WaveGlow
Definition: demo/Tacotron2/waveglow/model.py:157
half_float::detail::isinf
bool isinf(half arg)
Check for infinity.
Definition: ieee_half.h:2736
onnx_graphsurgeon.logger.logger.Logger.line_info
line_info
Definition: onnx-graphsurgeon/onnx_graphsurgeon/logger/logger.py:107
nvinfer1::plugin::DetectionOutput::terminate
void terminate() override
Release resources acquired during plugin layer initialization.
Definition: nmsPlugin.cpp:78
nvinfer1::plugin::ProposalPlugin::mBboxMinSize
float mBboxMinSize
Definition: proposalPlugin.h:115
half_float::half::operator++
half & operator++()
Prefix increment.
Definition: samples/common/half.h:1523
bert::getParamWordType
static DataType getParamWordType(DataType cfgType)
Definition: skipLayerNormInt8InterleavedPlugin.cpp:53
bert::DATA_TYPE_BOOL
@ DATA_TYPE_BOOL
Definition: fused_multihead_attention_common.h:24
nvinfer1::plugin::RegionPluginCreator::mFC
static PluginFieldCollection mFC
Definition: regionPlugin.h:120
nmtSample::Vocabulary::ptr
std::shared_ptr< Vocabulary > ptr
Definition: vocabulary.h:37
test_ir.TestGraph.test_tensors_includes_non_node_tensors
def test_tensors_includes_non_node_tensors(self)
Definition: test_ir.py:579
nvinfer1::plugin::BatchedNMSDynamicPlugin::supportsFormatCombination
bool supportsFormatCombination(int pos, const PluginTensorDesc *inOut, int nbInputs, int nbOutputs) override
Definition: batchedNMSPlugin.cpp:336
util.test_misc.FindInDictCase.expected
expected
Definition: test_misc.py:39
sample::BuildOptions::inputFormats
std::vector< IOFormat > inputFormats
Definition: sampleOptions.h:125
nvinfer1::plugin::BatchedNMSPlugin::getPluginType
const char * getPluginType() const override
Return the plugin type.
Definition: batchedNMSPlugin.cpp:355
bert::sMhaKernelMetaInfos
static const struct bert::FusedMultiHeadAttentionKernelMetaInfoV1 sMhaKernelMetaInfos[]
Semaphore
Definition: sampleMovieLensMPS.cpp:80
onnx_graphsurgeon.ir.tensor.Variable.outputs
outputs
Definition: tensor.py:166
polygraphy.backend.trt.util.mark_layerwise
def mark_layerwise(network)
Definition: trt/util.py:225
polygraphy.tools.util.script.Script.get_runners
def get_runners(self)
Definition: script.py:187
tools.test_run.TestOther
Definition: test_run.py:221
onnx_graphsurgeon.ir.graph.Graph.level
level
Definition: graph.py:265
models.meta.Model.path
path
Definition: meta.py:32
bert::fused_multihead_attention_v2_fp16_384_64_kernel_sm80_cubin
unsigned char fused_multihead_attention_v2_fp16_384_64_kernel_sm80_cubin[]
Definition: fused_multihead_attention_v2_fp16_384_64_kernel.sm80.cpp:19
pytorch_quantization.nn.modules.quant_conv.QuantConv2d.forward
def forward(self, input)
Definition: quant_conv.py:117
nvinfer1::plugin::ProposalPlugin::enqueue
int enqueue(int batchSize, const void *const *inputs, void **outputs, void *workspace, cudaStream_t stream) override
Definition: proposalPlugin.cpp:188
nvinfer1::plugin::GenerateDetection::getSerializationSize
size_t getSerializationSize() const override
Find the size of the serialization buffer required.
Definition: generateDetectionPlugin.cpp:190
pytorch_quantization.calib.histogram.HistogramCalibrator._compute_amax_percentile
def _compute_amax_percentile(self, percentile)
Definition: histogram.py:196
bert::SkipLayerNormInterleavedPlugin::destroy
void destroy() override
Destroy the plugin object.
Definition: skipLayerNormInt8InterleavedPlugin.cpp:248
nvinfer1::IGatherLayer
Definition: NvInfer.h:2674
SampleUffFasterRcnnParams::outputClsName
std::string outputClsName
Definition: sampleUffFasterRCNN.cpp:63
nvinfer1::plugin::PyramidROIAlign::isOutputBroadcastAcrossBatch
bool isOutputBroadcastAcrossBatch(int outputIndex, const bool *inputIsBroadcasted, int nbInputs) const override
Definition: pyramidROIAlignPlugin.cpp:254
polygraphy.logger.logger.Logger.error
def error(self, message, mode=LogMode.EACH)
Definition: Polygraphy/polygraphy/logger/logger.py:282
sample::networkToEngine
ICudaEngine * networkToEngine(const BuildOptions &build, const SystemOptions &sys, IBuilder &builder, INetworkDefinition &network, std::ostream &err)
Create an engine for a network defintion.
Definition: sampleEngines.cpp:301
nvinfer1::plugin::DetectionOutputParameters::numClasses
int32_t numClasses
Definition: NvInferPluginUtils.h:184
data_function.TextMelLoader.load_mel_from_disk
load_mel_from_disk
Definition: tacotron2/data_function.py:37
bert::QKVToContextInterleavedPlugin::destroy
void destroy() override
Destroy the plugin object.
Definition: qkvToContextInt8InterleavedPlugin.cpp:206
SampleMNISTAPI::verifyOutput
bool verifyOutput(const samplesCommon::BufferManager &buffers)
Classifies digits and verify result.
Definition: sampleMNISTAPI.cpp:355
polygraphy.tools.args.tf2onnx.loader.Tf2OnnxLoaderArgs.tf_loader_args
tf_loader_args
Definition: tools/args/tf2onnx/loader.py:33
polygraphy.util.cuda.DeviceBuffer.free
def free(self)
Definition: cuda.py:139
BatchStream::next
bool next()
Definition: frcnnUtils.h:253
builder.BertConfig.__init__
def __init__(self, bert_config_path, use_fp16, use_int8, use_strict, use_fc2_gemm, use_int8_skipln, use_int8_multihead, use_qat)
Definition: builder.py:89
bert::FusedMultiHeadAttentionKernelMetaInfoV2::mCubinSize
unsigned int mCubinSize
Definition: fused_multihead_attention_v2.h:165
SampleMNISTAPI::constructNetwork
bool constructNetwork(SampleUniquePtr< nvinfer1::IBuilder > &builder, SampleUniquePtr< nvinfer1::INetworkDefinition > &network, SampleUniquePtr< nvinfer1::IBuilderConfig > &config)
Uses the API to create the MNIST Network.
Definition: sampleMNISTAPI.cpp:173
bert::fused_multihead_attention_v2_int8_384_64_kernel_sm75_cubin_len
unsigned int fused_multihead_attention_v2_int8_384_64_kernel_sm75_cubin_len
Definition: fused_multihead_attention_v2_int8_384_64_kernel.sm75.cpp:18693
nvinfer1::plugin::PriorBoxParameters::offset
float offset
Definition: NvInferPluginUtils.h:100
SampleUffSSDParams::visualThreshold
float visualThreshold
The minimum score threshold to consider a detection.
Definition: sampleUffSSD.cpp:56
RefineNMSParameters::keepTopK
int keepTopK
Definition: maskRCNNKernels.h:86
nvcaffeparser1::CaffeWeightFactory::allocateWeights
nvinfer1::Weights allocateWeights(int64_t elems, std::uniform_real_distribution< float > distribution=std::uniform_real_distribution< float >(-0.01f, 0.01F))
Definition: caffeWeightFactory.cpp:199
model.Tacotron2.embedding
embedding
Definition: demo/Tacotron2/tacotron2/model.py:593
tests.quant_instancenorm_test.TestQuantInstanceNorm1D
Definition: quant_instancenorm_test.py:46
half_float::detail::functions::nanh
static half nanh()
Get NaN.
Definition: samples/common/half.h:1759
nvcaffeparser1::CaffeWeightFactory::getBlobProtoDataType
static trtcaffe::Type getBlobProtoDataType(const trtcaffe::BlobProto &blobMsg)
Definition: caffeWeightFactory.cpp:251
nmtSample::resizeWeights
std::vector< float > resizeWeights(int rows, int cols, int rowsNew, int colsNew, const float *memory)
Definition: trtUtil.cpp:41
SampleBuffer
Definition: sampleReformatFreeIO.cpp:133
nvinfer1::plugin::MultilevelCropAndResize::getWorkspaceSize
size_t getWorkspaceSize(int) const override
Definition: multilevelCropAndResizePlugin.cpp:120
nvinfer1::ISoftMaxLayer
A Softmax layer in a network definition.
Definition: NvInfer.h:2193
nvinfer1::plugin::RPROIPlugin::anchorsDev
float * anchorsDev
Definition: nvFasterRCNNPlugin.h:94
nvinfer1::plugin::InstanceNormalizationPlugin::_h_bias
std::vector< float > _h_bias
Definition: instanceNormalizationPlugin.h:93
onnx_models.dim_param_model
def dim_param_model()
Definition: onnx_models.py:103
nvinfer1::plugin::reduced_divisor::div
__host__ __device__ __forceinline__ int div(int x) const
Definition: reducedMathPlugin.h:69
nvcaffeparser1::BlobNameToTensor::mMap
std::map< std::string, nvinfer1::ITensor * > mMap
Definition: blobNameToTensor.h:67
nvinfer1::plugin::CoordConvACPlugin::~CoordConvACPlugin
~CoordConvACPlugin() override=default
nvinfer1::plugin::Reorg::supportsFormat
bool supportsFormat(DataType type, PluginFormat format) const override
Check format support.
Definition: reorgPlugin.cpp:88
nvinfer1::plugin::Normalize::clone
IPluginV2Ext * clone() const override
Clone the plugin object.
Definition: normalizePlugin.cpp:245
samplesCommon::BufferManager::getDeviceBuffer
void * getDeviceBuffer(const std::string &tensorName) const
Returns the device buffer corresponding to tensorName.
Definition: buffers.h:295
nmtSample::DataWriter
writer of sequences of data
Definition: dataWriter.h:33
SampleCharRNNBindingNames::SEQ_LEN_IN_BLOB_NAME
const char * SEQ_LEN_IN_BLOB_NAME
Definition: sampleCharRNN.cpp:93
SampleMLP::mNumber
int mNumber
The number to classify.
Definition: sampleMLP.cpp:87
pytorch_quantization.tensor_quant.TensorQuantFunction.backward
def backward(ctx, grad_outputs, grad_scale)
Definition: tensor_quant.py:281
nvinfer1::DimsHW::w
int32_t w() const
Get the width.
Definition: NvInfer.h:154
pytorch_quantization.nn.modules.quant_pooling.QuantMaxPool3d.__init__
def __init__(self, kernel_size, stride=None, padding=0, dilation=1, return_indices=False, ceil_mode=False, **kwargs)
Definition: quant_pooling.py:64
nvinfer1::WeightsRole::kKERNEL
@ kKERNEL
kernel for IConvolutionLayer, IDeconvolutionLayer, or IFullyConnectedLayer
half_float::half::functions
friend struct detail::functions
Definition: ieee_half.h:1086
tests.tensor_quant_test.TestTensorQuant.test_full_range
def test_full_range(self)
Definition: tensor_quant_test.py:125
polygraphy.tools.args.tf.runner.TfRunnerArgs.tf_config_args
tf_config_args
Definition: tools/args/tf/runner.py:34
nvinfer1::ActivationType::kSOFTSIGN
@ kSOFTSIGN
Softsign activation: x / (1+|x|)
backend.tf.test_runner.TestTfRunner.test_basic
def test_basic(self)
Definition: tf/test_runner.py:33
onnx_graphsurgeon.logger.logger.LoggerSuppress.old_severity
old_severity
Definition: onnx-graphsurgeon/onnx_graphsurgeon/logger/logger.py:44
bert::FusedMultiHeadAttentionKernelMetaInfoV1
Definition: fused_multihead_attention.h:115
polygraphy.tools.inspect.subtool.model
Definition: tools/Polygraphy/polygraphy/tools/inspect/subtool/model.py:1
pytorch_quantization.nn.modules.quant_conv.QuantConv1d
Definition: quant_conv.py:178
nvuffparser::IPluginFactoryExt::getVersion
virtual int32_t getVersion() const
Definition: NvUffParser.h:132
tests.utils.quantize_by_range_fused
def quantize_by_range_fused(x_tuple, num_bits)
Definition: tools/pytorch-quantization/tests/utils.py:32
samplesCommon::GenericBuffer::~GenericBuffer
~GenericBuffer()
Definition: buffers.h:166
bert::anonymous_namespace{skipLayerNormInt8InterleavedPlugin.cpp}::SKIP_LAYER_NORM_INTERLEAVED_NAME
static const char * SKIP_LAYER_NORM_INTERLEAVED_NAME
Definition: skipLayerNormInt8InterleavedPlugin.cpp:42
polygraphy.tools.args.onnxrt.runner.OnnxrtRunnerArgs
Definition: tools/args/onnxrt/runner.py:20
nvinfer1::ISelectLayer
Definition: NvInfer.h:4718
nvinfer1::plugin::MultilevelProposeROIPluginCreator::mFGThreshold
float mFGThreshold
Definition: multilevelProposeROIPlugin.h:141
bert::EmbLayerNormVarSeqlenPlugin::getNbOutputs
int getNbOutputs() const override
Get the number of outputs from the layer.
Definition: embLayerNormVarSeqlenPlugin.cpp:347
sample::dumpOutputs
void dumpOutputs(const nvinfer1::IExecutionContext &context, const Bindings &bindings, std::ostream &os)
Print output tensors to stream.
Definition: sampleReporting.cpp:329
stft.STFT.forward_transform
forward_transform
Definition: stft.py:50
nvinfer1::plugin::PyramidROIAlignPluginCreator::getFieldNames
const PluginFieldCollection * getFieldNames() override
Return a list of fields that needs to be passed to createPlugin.
Definition: pyramidROIAlignPlugin.cpp:53
inference_varseqlen.inference
def inference(features, tokens)
Definition: inference_varseqlen.py:143
half_float::detail::enable< T, expr, expr, void >::type
T type
Definition: ieee_half.h:378
polygraphy.backend.trt_legacy.TrtLegacyRunner.context
context
Definition: backend/trt_legacy.py:311
nvinfer1::plugin::Normalize::mNbWeights
int mNbWeights
Definition: normalizePlugin.h:97
nvinfer1::plugin::NMSPluginCreator::params
DetectionOutputParameters params
Definition: nmsPlugin.h:115
nvinfer1::plugin::PriorBox::configurePlugin
void configurePlugin(const Dims *inputDims, int nbInputs, const Dims *outputDims, int nbOutputs, const DataType *inputTypes, const DataType *outputTypes, const bool *inputIsBroadcast, const bool *outputIsBroadcast, PluginFormat floatFormat, int maxBatchSize) override
Definition: priorBoxPlugin.cpp:314
nmtSample::Vocabulary::getEndSequenceId
int getEndSequenceId() override
Definition: vocabulary.cpp:98
bert::SkipLayerNormInterleavedPlugin::configurePlugin
void configurePlugin(const nvinfer1::DynamicPluginTensorDesc *in, int nbInputs, const nvinfer1::DynamicPluginTensorDesc *out, int nbOutputs) override
Definition: skipLayerNormInt8InterleavedPlugin.cpp:129
SampleMovieLensArgs::strict
bool strict
Definition: sampleMovieLens.cpp:528
tests.quant_instancenorm_test.TestQuantInstanceNorm3D.test_fake_quant_per_tensor
def test_fake_quant_per_tensor(self)
Definition: quant_instancenorm_test.py:166
nvinfer1::plugin::PyramidROIAlign::attachToContext
void attachToContext(cudnnContext *cudnnContext, cublasContext *cublasContext, IGpuAllocator *gpuAllocator) override
Attach the plugin object to an execution context and grant the plugin the access to some context reso...
Definition: pyramidROIAlignPlugin.cpp:286
nmtSample::DebugUtil::DumpTensorPlugin::mTensorVolume
int mTensorVolume
Definition: debugUtil.h:70
nvinfer1::ILoopOutputLayer::setInput
void setInput(int32_t index, ITensor &tensor)=0
Append or replace an input of this layer with a specific tensor.
test_ir.TestNodeIO.test_remove
def test_remove(self, field_names)
Definition: test_ir.py:253
nvinfer1::DeviceType::kDLA
@ kDLA
DLA Core.
sample::anonymous_namespace{sampleOptions.cpp}::splitToStringVec
std::vector< std::string > splitToStringVec(const std::string &option, char separator)
Definition: sampleOptions.cpp:36
nvinfer1::BuilderFlag::kGPU_FALLBACK
@ kGPU_FALLBACK
Enable layers marked to execute on GPU if layer cannot execute on DLA.
samplesCommon::Args::batch
int32_t batch
Definition: argsParser.h:85
nvinfer1::SliceMode::kDEFAULT
@ kDEFAULT
Fail with error when the coordinates are out of bounds. This is the default.
pytorch_quantization.nn.modules.quant_rnn.QuantLSTM
Definition: modules/quant_rnn.py:290
nvonnxparser::IParser::supportsOperator
virtual bool supportsOperator(const char *op_name) const =0
Returns whether the specified operator may be supported by the parser.
samplesCommon::BufferManager::copyOutputToHostAsync
void copyOutputToHostAsync(const cudaStream_t &stream=0)
Copy the contents of output device buffers to output host buffers asynchronously.
Definition: buffers.h:419
half_float::detail::int2half_impl
uint16 int2half_impl(T value)
Convert integer to half-precision floating point.
Definition: ieee_half.h:692
EntropyCalibratorImpl::getBatch
bool getBatch(void *bindings[], const char *names[], int nbBindings)
Definition: EntropyCalibrator.h:54
nmtSample::SLPProjection::~SLPProjection
~SLPProjection() override=default
samplesCommon::HostAllocator::operator()
bool operator()(void **ptr, size_t size) const
Definition: buffers.h:200
polygraphy.backend.trt.runner.TrtRunner.context
context
Definition: backend/trt/runner.py:86
pytorch_quantization.utils.amp_wrapper.half_function
def half_function(fn)
Definition: amp_wrapper.py:24
polygraphy.tools.args.tf.runner.TfRunnerArgs.parse
def parse(self, args)
Definition: tools/args/tf/runner.py:42
nvinfer1::plugin::InstanceNormalizationPluginCreator
Definition: instanceNormalizationPlugin.h:102
tests.print_test.TestPrint
Definition: print_test.py:33
polygraphy.backend.pyt.runner.PytRunner.input_metadata
input_metadata
Definition: backend/pyt/runner.py:39
nvinfer1::plugin::RPROIPlugin::enqueue
int enqueue(int batchSize, const void *const *inputs, void **outputs, void *workspace, cudaStream_t stream) override
Definition: nvFasterRCNNPlugin.cpp:148
nvcaffeparser1::CaffeParser::setErrorRecorder
void setErrorRecorder(nvinfer1::IErrorRecorder *recorder) override
Set the ErrorRecorder for this interface.
Definition: caffeParser.h:59
nvinfer1::Dims
Structure to define the dimensions of a tensor.
Definition: NvInferRuntimeCommon.h:194
builder_varseqlen.bert_model
def bert_model(config, init_dict, network, input_tensor, mask_idx, cu_seqlens, max_seqlen)
Definition: builder_varseqlen.py:295
test_ir.TestNodeIO
Definition: test_ir.py:221
nvinfer1::plugin::Region::terminate
void terminate() override
Release resources acquired during plugin layer initialization.
Definition: regionPlugin.cpp:381
bert::TFusedMultiHeadAttentionXMMAKernel::mKernelMetaCount
unsigned int mKernelMetaCount
Definition: fused_multihead_attention.h:279
ParserOnnxConfig::mVerbosity
nvonnxparser::IOnnxConfig::Verbosity mVerbosity
Definition: parserOnnxConfig.h:48
bert::FusedMultiHeadAttentionKernelMetaInfoV1::mD
unsigned int mD
Definition: fused_multihead_attention.h:119
nvinfer1::plugin::SpecialSlice::getPluginType
const char * getPluginType() const override
Return the plugin type.
Definition: specialSlicePlugin.cpp:77
tests.quant_conv_transposed_test.TestQuantConvTranspose1D.test_fake_quant_per_channel_bias
def test_fake_quant_per_channel_bias(self)
Definition: quant_conv_transposed_test.py:476
samplesCommon::HostMemory
Definition: samples/common/common.h:306
nvinfer1::plugin::ProposalPlugin::mInputWidth
size_t mInputWidth
Definition: proposalPlugin.h:110
nvinfer1::IPluginCreator::getFieldNames
virtual const PluginFieldCollection * getFieldNames()=0
Return a list of fields that needs to be passed to createPlugin.
nvinfer1::plugin::ResizeNearestPluginCreator::ResizeNearestPluginCreator
ResizeNearestPluginCreator()
Definition: resizeNearestPlugin.cpp:39
nvcaffeparser1::CaffeParser::parseBuffers
const IBlobNameToTensor * parseBuffers(const char *deployBuffer, size_t deployLength, const char *modelBuffer, size_t modelLength, nvinfer1::INetworkDefinition &network, nvinfer1::DataType weightType) override
Definition: caffeParser.cpp:311
nvinfer1::plugin::Region::serialize
void serialize(void *buffer) const override
Serialize the layer.
Definition: regionPlugin.cpp:304
samplesCommon::setDummyInt8Scales
void setDummyInt8Scales(const IBuilderConfig *c, INetworkDefinition *n)
Definition: samples/common/common.h:563
nvinfer1::plugin::ProposalLayer::mMaxBatchSize
int mMaxBatchSize
Definition: proposalLayerPlugin.h:102
builder.squad_output
def squad_output(prefix, config, init_dict, network, input_tensor)
Definition: builder.py:338
comparator.test_comparator.TestComparator.test_validate_nan
def test_validate_nan(self)
Definition: test_comparator.py:133
half_float::detail::operator<=
enable< bool, T, U >::type operator<=(T x, U y)
Comparison for less equal.
Definition: ieee_half.h:2155
MNISTBatchStream::getBatchSize
int getBatchSize() const override
Definition: BatchStream.h:88
nvinfer1::plugin::CoordConvACPluginCreator::~CoordConvACPluginCreator
~CoordConvACPluginCreator() override=default
nvinfer1::plugin::Normalize::mCublas
cublasHandle_t mCublas
Definition: normalizePlugin.h:94
FCPlugin::convertAndCopyToDevice
void convertAndCopyToDevice(void *&deviceWeights, const nvinfer1::Weights &weights)
Definition: samples/opensource/samplePlugin/fcPlugin.h:251
polygraphy.backend.onnx.util.get_input_metadata
def get_input_metadata(graph)
Definition: onnx/util.py:157
nvinfer1::plugin::SpecialSlice::getPluginVersion
const char * getPluginVersion() const override
Return the plugin version.
Definition: specialSlicePlugin.cpp:82
SampleINT8APIArgs::verbose
bool verbose
Definition: sampleINT8API.cpp:649
nvinfer1::plugin::GridAnchorGenerator::getSerializationSize
size_t getSerializationSize() const override
Find the size of the serialization buffer required.
Definition: gridAnchorPlugin.cpp:212
polygraphy.tools.surgeon.subtool.extract.Extract.run
def run(self, args)
Definition: extract.py:50
backend.trt.test_loader.TestConfigLoader.test_calibrator_metadata_set
def test_calibrator_metadata_set(self, identity_builder_network)
Definition: trt/test_loader.py:225
SampleReformatFreeIO::verifyOutput
bool verifyOutput(SampleBuffer &outputBuf, int groundTruthDigit) const
Verifies that the output is correct and prints it.
Definition: sampleReformatFreeIO.cpp:476
prn_utils.filterData
def filterData(data, allFeatures, selection)
Definition: prn_utils.py:82
pytorch_quantization.tensor_quant.ScaledQuantDescriptor.__dict__
__dict__
Definition: tensor_quant.py:172
pytorch_quantization.nn.modules.quant_rnn.QuantLSTM.__init__
def __init__(self, *args, **kwargs)
Definition: modules/quant_rnn.py:295
fp16::__float2half
__half __float2half(float f)
Definition: fp16.h:32
sample::InferenceOptions::warmup
int warmup
Definition: sampleOptions.h:150
nvcaffeparser1::parseLRN
nvinfer1::ILayer * parseLRN(nvinfer1::INetworkDefinition &network, const trtcaffe::LayerParameter &msg, CaffeWeightFactory &, BlobNameToTensor &tensors)
Definition: parseLRN.cpp:23
model
Definition: demo/Tacotron2/tacotron2/model.py:1
sample::LogStreamConsumerBuffer::sync
virtual int sync()
Definition: samples/common/logging.h:66
half_float::detail::enable< T, half, void, void >::type
T type
Definition: ieee_half.h:373
nvinfer1::plugin::BatchTilePlugin::initialize
int initialize() override
Initialize the layer for execution.
Definition: batchTilePlugin.cpp:81
half_float::detail::round_half_up
uint16 round_half_up(uint16 value)
Round half-precision number to nearest integer value using round-to-nearest-away-from-zero.
Definition: ieee_half.h:1046
nmtSample::BeamSearchPolicy::mMaxOutputSequenceLengths
std::vector< int > mMaxOutputSequenceLengths
Definition: beamSearchPolicy.h:73
SampleMNISTAPI::SampleUniquePtr
std::unique_ptr< T, samplesCommon::InferDeleter > SampleUniquePtr
Definition: sampleMNISTAPI.cpp:62
layers.ConvNorm.__init__
def __init__(self, in_channels, out_channels, kernel_size=1, stride=1, padding=None, dilation=1, bias=True, w_init_gain='linear')
Definition: layers.py:37
nvinfer1::plugin::CropAndResizePlugin::getPluginType
const char * getPluginType() const override
Return the plugin type.
Definition: cropAndResizePlugin.cpp:98
nvinfer1::plugin::ProposalPlugin::mAnchorRatios
std::vector< float > mAnchorRatios
Definition: proposalPlugin.h:121
util.test_misc.FindInDictCase.name
name
Definition: test_misc.py:36
backend.trt.test_loader.TestConfigLoader
Definition: trt/test_loader.py:171
nmtSample::SLPEmbedder::mNumOutputs
int mNumOutputs
Definition: slpEmbedder.h:54
sample::ModelOptions
Definition: sampleOptions.h:94
nvinfer1::LayerType::kPADDING
@ kPADDING
Padding layer.
nvinfer1::plugin::NormalizePluginCreator::NormalizePluginCreator
NormalizePluginCreator()
Definition: normalizePlugin.cpp:255
bert::Fused_multihead_attention_params_v2::enable_i2f_trick
bool enable_i2f_trick
Definition: fused_multihead_attention_v2.h:62
nmtSample::DebugUtil::DumpTensorPlugin::enqueue
int enqueue(int batchSize, const void *const *inputs, void **outputs, void *workspace, cudaStream_t stream) override
Definition: debugUtil.cpp:86
nvinfer1::MatrixOperation::kTRANSPOSE
@ kTRANSPOSE
Like kNONE, but transpose the matrix dimensions.
half_float::detail::builtin_signbit
bool builtin_signbit(T arg)
Check sign.
Definition: ieee_half.h:436
nvcaffeparser1::parseReshape
nvinfer1::ILayer * parseReshape(nvinfer1::INetworkDefinition &network, const trtcaffe::LayerParameter &msg, CaffeWeightFactory &, BlobNameToTensor &tensors)
Definition: parseReshape.cpp:23
nvinfer1::EnumMax< ProfilingVerbosity >
constexpr int32_t EnumMax< ProfilingVerbosity >()
Maximum number of profile verbosity levels in ProfilingVerbosity enum.
Definition: NvInfer.h:6424
nvinfer1::CUDADriverWrapper::cuLinkDestroy
CUresult cuLinkDestroy(CUlinkState state) const
Definition: cudaDriverWrapper.cpp:91
nvinfer1::plugin::SplitPlugin::_y_stride
int _y_stride
Definition: split.h:43
nvinfer1::IOptimizationProfile
Optimization profile for dynamic input dimensions and shape tensors.
Definition: NvInferRuntime.h:1034
bert::Fused_multihead_attention_params_v2::o_ptr
void * o_ptr
Definition: fused_multihead_attention_v2.h:32
pytorch_quantization.tensor_quant.ScaledQuantDescriptor._name
_name
Definition: tensor_quant.py:82
onnx_graphsurgeon.util.misc
Definition: onnx-graphsurgeon/onnx_graphsurgeon/util/misc.py:1
polygraphy.tools.util.misc.get
def get(args, attr)
Definition: Polygraphy/polygraphy/tools/util/misc.py:24
nvinfer1::BuilderFlag::kREFIT
@ kREFIT
Enable building a refittable engine.
nvcaffeparser1::CaffeWeightFactory::getDataTypeSize
size_t getDataTypeSize() const
Definition: caffeWeightFactory.cpp:73
MaskRCNNConfig::DETECTION_MIN_CONFIDENCE
static const float DETECTION_MIN_CONFIDENCE
Definition: plugin/proposalLayerPlugin/mrcnn_config.h:44
samplesCommon::HostMemory::mType
DataType mType
Definition: samples/common/common.h:331
Logger::TestAtom::mName
std::string mName
Definition: pluginLogging.h:303
sample::Profiler::print
void print(std::ostream &os) const
Definition: sampleReporting.cpp:265
nvonnxparser::IParserError::node
virtual int node() const =0
index of the ONNX model node in which the error occurred
polygraphy.util.misc.FreeOnException.__exit__
def __exit__(self, exc_type, exc_value, traceback)
Definition: Polygraphy/polygraphy/util/misc.py:452
nvinfer1::plugin::FlattenConcatPluginCreator::FlattenConcatPluginCreator
FlattenConcatPluginCreator()
Definition: flattenConcat.cpp:292
nvinfer1::ITensor::setType
virtual void setType(DataType type)=0
Set the data type of a tensor.
EntropyCalibratorImpl::writeCalibrationCache
void writeCalibrationCache(const void *cache, size_t length)
Definition: EntropyCalibrator.h:80
tests.calibrator_test.TestMaxCalibrator.test_raises
def test_raises(self)
Definition: calibrator_test.py:68
nvinfer1::plugin::CoordConvACPluginCreator::mFC
static PluginFieldCollection mFC
Definition: coordConvACPlugin.h:110
polygraphy.util.misc.try_receive_on_queue
def try_receive_on_queue(queue, timeout=None)
Definition: Polygraphy/polygraphy/util/misc.py:253
sample::parseHelp
bool parseHelp(Arguments &arguments)
Definition: sampleOptions.cpp:725
polygraphy.backend.onnx.loader.OnnxFromTfGraph.fold_constant
fold_constant
Definition: backend/onnx/loader.py:69
bert::DATA_TYPE_INT32
@ DATA_TYPE_INT32
Definition: fused_multihead_attention_common.h:31
builder.BertConfig.num_hidden_layers
num_hidden_layers
Definition: builder.py:95
inference.f
f
Definition: BERT/inference.py:92
polygraphy.backend.trt.loader.Profile.add
def add(self, name, min, opt, max)
Definition: backend/trt/loader.py:266
Batch::mEngine
ICudaEngine * mEngine
Definition: sampleMovieLensMPS.cpp:245
nvinfer1::plugin::RPROIPlugin::~RPROIPlugin
~RPROIPlugin() override
Definition: nvFasterRCNNPlugin.cpp:98
nvinfer1::plugin::MultilevelCropAndResizePluginCreator::MultilevelCropAndResizePluginCreator
MultilevelCropAndResizePluginCreator()
Definition: multilevelCropAndResizePlugin.cpp:38
samplesCommon::BufferManager::mManagedBuffers
std::vector< std::unique_ptr< ManagedBuffer > > mManagedBuffers
The vector of pointers to managed buffers.
Definition: buffers.h:457
pytorch_quantization.nn.modules.quant_rnn.GRUCell.reset_parameters
def reset_parameters(self)
Definition: modules/quant_rnn.py:441
nvinfer1::plugin::PriorBox::getOutputDimensions
Dims getOutputDimensions(int index, const Dims *inputs, int nbInputDims) override
Definition: priorBoxPlugin.cpp:150
SampleINT8::SampleINT8
SampleINT8(const SampleINT8Params &params)
Definition: sampleINT8.cpp:64
tests.tensor_quant_test.TestTensorQuant.test_overflow_fp16
def test_overflow_fp16(self)
Definition: tensor_quant_test.py:110
tests.optim_helper_test.TestMatchParameters.test_regex
def test_regex(self, resnet18)
Definition: optim_helper_test.py:43
half_float::detail::remainder
expr remainder(half x, half y)
Remainder of division.
Definition: ieee_half.h:2253
Args::iterations
int iterations
Definition: demo/BERT/infer_c/common.h:32
nvonnxparser::IOnnxConfig::getPrintLayerInfo
virtual bool getPrintLayerInfo() const =0
Get whether the layer information will be printed.
nvinfer1::plugin::MultilevelProposeROI::mRegWeightDevice
std::shared_ptr< CudaBind< float > > mRegWeightDevice
Definition: multilevelProposeROIPlugin.h:111
half_float::detail::half_caster< T, half, R >::cast_impl
static T cast_impl(half arg, true_type)
Definition: samples/common/half.h:2650
nvinfer1::plugin::PriorBox::~PriorBox
~PriorBox() override=default
Logger::mOstream
std::ostream * mOstream
Definition: demo/BERT/infer_c/logging.h:74
sample::Logger::reportTestResult
static void reportTestResult(const TestAtom &testAtom, TestResult result)
method that implements logging test results
Definition: samples/common/logging.h:418
comparator.test_struct.TestRunResults
Definition: comparator/test_struct.py:27
half_float::detail::functions::tan
static expr tan(float arg)
Tan implementation.
Definition: samples/common/half.h:1902
SampleErrorRecorder::operator[]
const errorPair & operator[](size_t index) const noexcept
Definition: ErrorRecorder.h:112
tests.integration_test.TestNetwork.test_calibration
def test_calibration(self)
Definition: integration_test.py:145
polygraphy.tools.precision.precision.Precision.__init__
def __init__(self)
Definition: precision.py:296
sample::TrtCudaGraph::~TrtCudaGraph
~TrtCudaGraph()
Definition: sampleDevice.h:178
anonymous_namespace{groupNormalizationPlugin.cpp}::GROUP_NORM_VERSION
constexpr const char * GROUP_NORM_VERSION
Definition: groupNormalizationPlugin.cpp:37
data_functions.get_data_loader
def get_data_loader(model_name, dataset_path, audiopaths_and_text, args)
Definition: data_functions.py:37
nmtSample::DataWriter::finalize
virtual void finalize()=0
it is called right after inference ends
polygraphy.tools.args.trt.loader.TrtLoaderArgs.data_loader_args
data_loader_args
Definition: tools/args/trt/loader.py:75
nvinfer1::plugin::BatchedNMSDynamicPlugin::BatchedNMSDynamicPlugin
BatchedNMSDynamicPlugin(NMSParameters param)
Definition: batchedNMSPlugin.cpp:58
nvinfer1::plugin::RegionPluginCreator::params
RegionParameters params
Definition: regionPlugin.h:121
polygraphy.comparator.postprocess.PostprocessFunc.topk_func
def topk_func(k=10, axis=-1, outputs=None, exclude=None)
Definition: postprocess.py:27
nvinfer1::DataType::kHALF
@ kHALF
IEEE 16-bit floating-point format.
SampleConfig::SampleConfig
SampleConfig()
Definition: sampleConfig.h:62
nvinfer1::plugin::SpecialSlice::getPluginNamespace
const char * getPluginNamespace() const override
Return the namespace of the plugin object.
Definition: specialSlicePlugin.cpp:99
nvinfer1::plugin::LReLU::mNegSlope
float mNegSlope
Definition: lReluPlugin.h:72
bert::EmbLayerNormVarSeqlenPlugin::mNamespace
std::string mNamespace
Definition: embLayerNormVarSeqlenPlugin.h:86
bert::fused_multihead_attention_v2_fp16_256_64_kernel_sm75_cubin
unsigned char fused_multihead_attention_v2_fp16_256_64_kernel_sm75_cubin[]
Definition: fused_multihead_attention_v2_fp16_256_64_kernel.sm75.cpp:19
polygraphy.comparator.data_loader.BaseDataLoader.__getitem__
def __getitem__(self, index)
Definition: comparator/data_loader.py:29
bert::QKVToContextInterleavedPlugin::supportsFormatCombination
bool supportsFormatCombination(int pos, const nvinfer1::PluginTensorDesc *inOut, int nbInputs, int nbOutputs) override
Definition: qkvToContextInt8InterleavedPlugin.cpp:118
nvinfer1::plugin::GenerateDetection::configurePlugin
void configurePlugin(const Dims *inputDims, int nbInputs, const Dims *outputDims, int nbOutputs, const DataType *inputTypes, const DataType *outputTypes, const bool *inputIsBroadcast, const bool *outputIsBroadcast, PluginFormat floatFormat, int maxBatchSize) override
Definition: generateDetectionPlugin.cpp:316
half_float::detail::tgamma
expr tgamma(half arg)
Gamma function.
Definition: ieee_half.h:2540
nvinfer1::IExecutionContext::setProfiler
virtual void setProfiler(IProfiler *) noexcept=0
Set the profiler.
test_ir.TensorBaseTests.test_o_multiple_outputs
def test_o_multiple_outputs(self)
Definition: test_ir.py:100
nvinfer1::IRaggedSoftMaxLayer
A RaggedSoftmax layer in a network definition.
Definition: NvInfer.h:4241
bert::fused_multihead_attention_v2_fp16_96_64_kernel_sm80_cubin_len
unsigned int fused_multihead_attention_v2_fp16_96_64_kernel_sm80_cubin_len
Definition: fused_multihead_attention_v2_fp16_96_64_kernel.sm80.cpp:2788
nvinfer1::plugin::Region::~Region
~Region() override=default
half_float::half::half
half(detail::expr rhs)
Copy constructor.
Definition: samples/common/half.h:1408
SampleCharRNNMaps::idToChar
const std::vector< char > idToChar
Definition: sampleCharRNN.cpp:108
tests.quant_rnn_test.TestQuantLSTM.test_no_quant
def test_no_quant(self, verbose)
Definition: quant_rnn_test.py:283
stft.STFT.transform
def transform(self, input_data)
Definition: stft.py:77
nvinfer1::ITensor::isExecutionTensor
virtual bool isExecutionTensor() const =0
Whether the tensor is an execution tensor.
model.Decoder.forward
def forward(self, memory, decoder_inputs, memory_lengths)
Definition: demo/Tacotron2/tacotron2/model.py:439
generate.relu
def relu(self, a)
Definition: 07_creating_a_model_with_the_layer_api/generate.py:59
text.cmudict.CMUDict._entries
_entries
Definition: cmudict.py:44
onnx_graphsurgeon.ir.tensor.Tensor.__repr__
def __repr__(self)
Definition: tensor.py:136
sample::anonymous_namespace{sampleOptions.cpp}::removeSingleQuotationMarks
std::string removeSingleQuotationMarks(std::string &str)
Definition: sampleOptions.cpp:229
SampleMovieLens::mParams
SampleMovieLensParams mParams
Definition: sampleMovieLens.cpp:149
half_float::detail::operator>=
enable< bool, T, U >::type operator>=(T x, U y)
Comparison for greater equal.
Definition: ieee_half.h:2162
data_function.MelAudioLoader.__init__
def __init__(self, dataset_path, audiopaths_and_text, args)
Definition: waveglow/data_function.py:29
model.Attention.location_layer
location_layer
Definition: demo/Tacotron2/tacotron2/model.py:56
onnx_graphsurgeon.ir.graph.UnseenTensor.__init__
def __init__(self, initial_tensors=None)
Definition: graph.py:30
onnx_graphsurgeon.util
Definition: tools/onnx-graphsurgeon/onnx_graphsurgeon/util/__init__.py:1
nvinfer1::plugin::BatchTilePlugin::configurePlugin
void configurePlugin(const Dims *inputDims, int nbInputs, const Dims *outputDims, int nbOutputs, const DataType *inputTypes, const DataType *outputTypes, const bool *inputIsBroadcast, const bool *outputIsBroadcast, PluginFormat floatFormat, int maxBatchSize) override
Definition: batchTilePlugin.cpp:137
pytorch_quantization.calib.histogram.HistogramCalibrator._calib_hist
_calib_hist
Definition: histogram.py:50
polygraphy.comparator.struct.IterationResult
Definition: comparator/struct.py:22
sample::Logger::TestResult::kFAILED
@ kFAILED
The test failed.
bert::fused_multihead_attention_v2_int8_192_64_kernel_sm80_cubin_len
unsigned int fused_multihead_attention_v2_int8_192_64_kernel_sm80_cubin_len
Definition: fused_multihead_attention_v2_int8_192_64_kernel.sm80.cpp:15004
polygraphy.backend.trt
Definition: tools/Polygraphy/polygraphy/backend/trt/__init__.py:1
BatchStream::reset
void reset(int firstBatch) override
Definition: BatchStream.h:208
half_float::detail::functions::isfinite
static bool isfinite(half arg)
Classification implementation.
Definition: samples/common/half.h:2342
BatchStream::getBatchSize
int getBatchSize() const
Definition: frcnnUtils.h:314
tests.quant_conv_transposed_test.TestQuantConvTranspose1D.test_weight_fake_quant_per_channel
def test_weight_fake_quant_per_channel(self)
Definition: quant_conv_transposed_test.py:374
nvinfer1::IPluginV2Ext::detachFromContext
virtual void detachFromContext()
Detach the plugin object from its execution context.
Definition: NvInferRuntimeCommon.h:675
nvinfer1::plugin::GridAnchorGenerator::mDeviceWidths
Weights * mDeviceWidths
Definition: gridAnchorPlugin.h:98
SampleCharRNNLoop::LstmParams
Definition: sampleCharRNN.cpp:248
nvinfer1::ITopKLayer::~ITopKLayer
virtual ~ITopKLayer()
Definition: NvInfer.h:4123
SampleErrorRecorder::incRefCount
IErrorRecorder::RefCount incRefCount() noexcept final
Increments the refcount for the current ErrorRecorder.
Definition: ErrorRecorder.h:101
nvinfer1::plugin::MultilevelProposeROI::getOutputDimensions
Dims getOutputDimensions(int index, const Dims *inputs, int nbInputDims) override
Definition: multilevelProposeROIPlugin.cpp:328
nvinfer1::ILogger::Severity::kINFO
@ kINFO
Informational messages with instructional information.
polygraphy.util.cuda.DeviceBuffer.__str__
def __str__(self)
Definition: cuda.py:199
sample::InferenceOptions::graph
bool graph
Definition: sampleOptions.h:158
nvinfer1::plugin::RPROIPlugin::configurePlugin
void configurePlugin(const Dims *inputDims, int nbInputs, const Dims *outputDims, int nbOutputs, const DataType *inputTypes, const DataType *outputTypes, const bool *inputIsBroadcast, const bool *outputIsBroadcast, PluginFormat floatFormat, int maxBatchSize) override
Definition: nvFasterRCNNPlugin.cpp:281
BboxInfo::kept
bool kept
Definition: bboxUtils.h:45
SampleCharRNNBase::mEngine
std::shared_ptr< nvinfer1::ICudaEngine > mEngine
The TensorRT engine used to run the network.
Definition: sampleCharRNN.cpp:220
nmtSample::BenchmarkWriter::mSampleCount
int mSampleCount
Definition: benchmarkWriter.h:48
nmtSample::BeamSearchPolicy::mCandidateLikelihoods
std::vector< float > mCandidateLikelihoods
Definition: beamSearchPolicy.h:77
half_float::half::operator=
half & operator=(float rhs)
Assignment operator.
Definition: samples/common/half.h:1479
std::numeric_limits< half_float::half >::signaling_NaN
static half_float::half signaling_NaN()
Signalling NaN.
Definition: samples/common/half.h:4258
nvinfer1::plugin::FlattenConcat::terminate
void terminate() override
Release resources acquired during plugin layer initialization.
Definition: flattenConcat.cpp:126
LogStreamConsumer::LogStreamConsumer
LogStreamConsumer(Severity reportableSeverity, Severity severity)
Creates a LogStreamConsumer which logs messages with level severity.
Definition: pluginLogging.h:134
polygraphy.backend.onnx.loader.ModifyOnnx.outputs
outputs
Definition: backend/onnx/loader.py:125
samplesCommon::parseArgs
bool parseArgs(Args &args, int32_t argc, char *argv[])
Populates the Args struct with the provided command-line parameters.
Definition: argsParser.h:99
nvinfer1::INetworkDefinition::addSlice
virtual ISliceLayer * addSlice(ITensor &input, Dims start, Dims size, Dims stride)=0
Add a slice layer to the network.
export_waveglow_onnx.infer_onnx
def infer_onnx(self, spect, z, sigma=0.9)
Definition: export_waveglow_onnx.py:109
helpers.calibrator.BertCalibrator.get_batch_size
def get_batch_size(self)
Definition: demo/BERT/helpers/calibrator.py:52
MaskRCNNUtils::maskPPM
void maskPPM(PPM< uint8_t > &image, const PPM< uint8_t > &mask, const int start_x, const int start_y, const std::vector< int > &color)
Definition: sampleUffMaskRCNN.cpp:230
nvinfer1::PluginVersion::kV2_DYNAMICEXT
@ kV2_DYNAMICEXT
IPluginV2IOExt.
sample::InferenceTrace::outStart
float outStart
Definition: sampleReporting.h:98
test_exporters.TestOnnxExporter.test_export_constant_tensor_to_value_info_proto
def test_export_constant_tensor_to_value_info_proto(self)
Definition: test_exporters.py:47
pytorch_quantization.nn.modules.quant_rnn.QuantLSTMCell
Definition: modules/quant_rnn.py:381
nvonnxparser::ErrorCode::kUNSUPPORTED_NODE
@ kUNSUPPORTED_NODE
nmtSample::SLPProjection::getOutputSize
int getOutputSize() override
get the size of raw logits vector
Definition: slpProjection.cpp:59
pytorch_quantization.nn.modules.quant_conv.QuantConv2d
Definition: quant_conv.py:90
export_tacotron2_onnx.Postnet.tacotron2
tacotron2
Definition: export_tacotron2_onnx.py:79
nvuffparser::FieldCollection::fields
const FieldMap * fields
Definition: NvUffParser.h:90
nmtSample::operator>>
std::istream & operator>>(std::istream &input, Vocabulary &value)
Definition: vocabulary.cpp:61
CudaBind::mPtr
void * mPtr
Definition: maskRCNNKernels.h:66
polygraphy.backend.trt.loader.CreateConfig.profiles
profiles
Definition: backend/trt/loader.py:325
nvinfer1::plugin::BatchedNMSDynamicPlugin::serialize
void serialize(void *buffer) const override
Serialize the layer.
Definition: batchedNMSPlugin.cpp:272
pytorch_quantization.calib.max.MaxCalibrator._track_amax
_track_amax
Definition: max.py:39
builder.set_output_range
def set_output_range(layer, maxval, out_idx=0)
Definition: builder.py:112
helpers.tokenization.WordpieceTokenizer
Definition: tokenization.py:330
polygraphy.backend.onnx.util.mark_layerwise
def mark_layerwise(model)
Definition: onnx/util.py:91
tests.quant_linear_test.TestQuantLinear.test_fake_quant_per_tensor_with_bias
def test_fake_quant_per_tensor_with_bias(self)
Definition: quant_linear_test.py:120
nmtSample::Context::getInfo
std::string getInfo() override
get the textual description of the component
Definition: contextNMT.cpp:40
tests.quant_conv_transposed_test.TestQuantConvTranspose2D.test_fake_quant_per_channel_other_prec
def test_fake_quant_per_channel_other_prec(self)
Definition: quant_conv_transposed_test.py:163
polygraphy.tools.args.comparator.ComparatorRunArgs.use_subprocess
use_subprocess
Definition: tools/args/comparator.py:44
nmtSample::LimitedSamplesDataReader::read
int read(int samplesToRead, int maxInputSequenceLength, int *hInputData, int *hActualInputSequenceLengths) override
reads the batch of smaples/sequences
Definition: limitedSamplesDataReader.cpp:31
samplesCommon::classify
std::vector< std::string > classify(const std::vector< std::string > &refVector, const std::vector< T > &output, const size_t topK)
Definition: samples/common/common.h:428
bert::fused_multihead_attention_v2_int8_256_64_kernel_cubin
unsigned char fused_multihead_attention_v2_int8_256_64_kernel_cubin[]
Definition: fused_multihead_attention_v2_int8_256_64_kernel.sm72.cpp:19
evaluate-v1.parser
parser
Definition: evaluate-v1.1.py:99
nvinfer1::plugin::Normalize::H
int H
Definition: normalizePlugin.h:102
polygraphy.tools.args.trt_legacy.TrtLegacyArgs.trt_loader_args
trt_loader_args
Definition: tools/args/trt_legacy.py:46
nvinfer1::plugin::GenerateDetection::mNameSpace
std::string mNameSpace
Definition: generateDetectionPlugin.h:109
arg_parser.parse_tacotron2_args
def parse_tacotron2_args(parent, add_help=False)
Definition: tacotron2/arg_parser.py:22
sample::Parser::uffParser
TrtUniquePtr< nvuffparser::IUffParser > uffParser
Definition: sampleEngines.h:36
SampleSSD
The SampleSSD class implements the SSD sample.
Definition: sampleSSD.cpp:60
onnx_graphsurgeon.ir.node.Node.copy
def copy(self, List["Tensor"] inputs=None, List["Tensor"] outputs=None)
Definition: node.py:96
sample::Bindings::mNames
std::unordered_map< std::string, int > mNames
Definition: sampleUtils.h:451
nvinfer1::plugin::BatchedNMSDynamicPlugin::getPluginType
const char * getPluginType() const override
Return the plugin type.
Definition: batchedNMSPlugin.cpp:360
pytorch_quantization.nn.modules._utils.QuantMixin
Definition: _utils.py:30
bert::fused_multihead_attention_fp16_128_64_kernel_sm75_cu_o
unsigned char fused_multihead_attention_fp16_128_64_kernel_sm75_cu_o[]
Definition: fused_multihead_attention_fp16_128_64_kernel.sm75.cpp:20
nvinfer1::UnaryOperation::kCEIL
@ kCEIL
Ceiling.
nvinfer1::ElementWiseOperation::kAND
@ kAND
Logical AND of two elements.
nvcaffeparser1::ICaffeParser
Class used for parsing Caffe models.
Definition: NvCaffeParser.h:169
polygraphy.tools.surgeon.subtool.base.Config.from_graph
def from_graph(graph)
Definition: surgeon/subtool/base.py:62
half_float::detail::enable< T, expr, half, half >::type
T type
Definition: ieee_half.h:383
tests.tensor_quantizer_test.TestTensorQuantizer.test_learn_amax
def test_learn_amax(self)
Definition: tensor_quantizer_test.py:93
sample::InferenceOptions::threads
bool threads
Definition: sampleOptions.h:157
PluginFactory::destroyPlugin
void destroyPlugin()
Definition: samples/opensource/samplePlugin/fcPlugin.h:376
SampleDynamicReshape::mPredictionEngine
SampleUniquePtr< nvinfer1::ICudaEngine > mPredictionEngine
Definition: sampleDynamicReshape.cpp:84
util.test_misc.FindInDictCase
Definition: test_misc.py:34
polygraphy.tools.registry.MissingTool.__init__
def __init__(self, name, err)
Definition: registry.py:31
polygraphy.util.cuda.Cuda.__init__
def __init__(self)
Definition: cuda.py:33
nvinfer1::plugin::BatchTilePluginCreator::BatchTilePluginCreator
BatchTilePluginCreator()
Definition: batchTilePlugin.cpp:183
half_float::detail::functions::lrint
static long lrint(half arg)
Nearest integer implementation.
Definition: samples/common/half.h:2126
nvinfer1::BuilderFlag::kINT8
@ kINT8
Enable Int8 layer selection, with FP32 fallback with FP16 fallback if kFP16 also specified.
sample::Options::parse
virtual void parse(Arguments &arguments)=0
builder_varseqlen.BertConfig.get_trt_dtype
def get_trt_dtype(self)
Definition: builder_varseqlen.py:103
polygraphy.backend.tf.loader.GraphFromFrozen.__call__
def __call__(self)
Definition: backend/tf/loader.py:150
pytorch_quantization.nn.modules.clip
Definition: clip.py:1
nvinfer1::plugin::RPROIPluginCreator::mFC
static PluginFieldCollection mFC
Definition: nvFasterRCNNPlugin.h:121
tests.classification_flow_test.TestClassificationFlow
Definition: classification_flow_test.py:27
polygraphy.backend.trt_legacy.ConvertToUff.__init__
def __init__(self, tf_loader, save_uff=None, preprocessor=None)
Definition: backend/trt_legacy.py:49
helpers.tokenization.BertTokenizer.ids_to_tokens
ids_to_tokens
Definition: tokenization.py:189
sample::LogStreamConsumer::mSeverity
Severity mSeverity
Definition: samples/common/logging.h:179
half_float::detail::bits< double >::type
unsigned long type
Definition: ieee_half.h:341
polygraphy.backend.trt_legacy.LoadUffFile.__call__
def __call__(self)
Definition: backend/trt_legacy.py:41
nvinfer1::plugin::CropAndResizePlugin::~CropAndResizePlugin
~CropAndResizePlugin() override
Definition: cropAndResizePlugin.cpp:96
pytorch_quantization.calib.max.MaxCalibrator.__repr__
def __repr__(self)
Definition: max.py:99
sample::TrtDestroyer::operator()
void operator()(T *t)
Definition: sampleUtils.h:459
helpers.tokenization.FullTokenizer.convert_tokens_to_ids
def convert_tokens_to_ids(self, tokens)
Definition: tokenization.py:177
nvinfer1::PluginFieldType::kCHAR
@ kCHAR
char field type.
test_func.TestExtend.test_extend_2_args_2_rv
def test_extend_2_args_2_rv(self)
Definition: test_func.py:92
half_float::detail::atanh
expr atanh(half arg)
Hyperbolic area tangent.
Definition: ieee_half.h:2508
SampleAlgorithmSelector
The SampleAlgorithmSelector class implements the SampleAlgorithmSelector sample.
Definition: sampleAlgorithmSelector.cpp:312
AlgorithmCacheWriter::mCacheFileName
std::string mCacheFileName
Definition: sampleAlgorithmSelector.cpp:114
sample::anonymous_namespace{sampleEngines.cpp}::UffBufferShutter::~UffBufferShutter
~UffBufferShutter()
Definition: sampleEngines.cpp:53
half_float::half::half
half(float rhs)
Conversion constructor.
Definition: samples/common/half.h:1415
FCPlugin::getOutputDimensions
nvinfer1::Dims getOutputDimensions(int index, const nvinfer1::Dims *inputs, int nbInputDims) override
Definition: samples/opensource/samplePlugin/fcPlugin.h:100
samplesCommon::GpuTimer::~GpuTimer
~GpuTimer()
Definition: samples/common/common.h:816
backend.trt.test_loader.TestConfigLoader.test_fp16
def test_fp16(self, identity_builder_network, flag)
Definition: trt/test_loader.py:210
nvinfer1::plugin::LReluPluginCreator
Definition: lReluPlugin.h:76
nvinfer1::plugin::GenerateDetection::mIOUThreshold
float mIOUThreshold
Definition: generateDetectionPlugin.h:98
SampleMNISTAPI::mWeightMap
std::map< std::string, nvinfer1::Weights > mWeightMap
The weight name to weight value map.
Definition: sampleMNISTAPI.cpp:91
SampleMNIST::build
bool build()
Builds the network engine.
Definition: sampleMNIST.cpp:112
nvinfer1::plugin::MultilevelProposeROIPluginCreator::~MultilevelProposeROIPluginCreator
~MultilevelProposeROIPluginCreator()
Definition: multilevelProposeROIPlugin.h:125
nvinfer1::IRuntime::setGpuAllocator
virtual void setGpuAllocator(IGpuAllocator *allocator) noexcept=0
Set the GPU allocator.
train.save_checkpoint
def save_checkpoint(model, optimizer, epoch, config, amp_run, output_dir, model_name, local_rank, world_size)
Definition: train.py:173
nvuffparser::FieldMap::name
const char * name
Definition: NvUffParser.h:79
polygraphy.tools.args.onnxrt.runner.OnnxrtRunnerArgs.register
def register(self, maker)
Definition: tools/args/onnxrt/runner.py:21
bert::fused_multihead_attention_int8_384_64_kernel_sm75_cu_o
unsigned char fused_multihead_attention_int8_384_64_kernel_sm75_cu_o[]
Definition: fused_multihead_attention_int8_384_64_kernel.sm75.cpp:20
nvcaffeparser1::IPluginFactoryExt::getVersion
virtual int32_t getVersion() const
Definition: NvCaffeParser.h:119
util.test_cuda.TestDeviceBuffer
Definition: test_cuda.py:39
helpers.calibrator.BertCalibrator.data
data
Definition: demo/BERT/helpers/calibrator.py:36
tests.fixtures.models.LeNet
Definition: tools/pytorch-quantization/tests/fixtures/models.py:26
half_float::detail::ldexp
half ldexp(half arg, int exp)
Multiply by power of two.
Definition: ieee_half.h:2635
nvinfer1::plugin::GenerateDetectionPluginCreator::createPlugin
IPluginV2Ext * createPlugin(const char *name, const PluginFieldCollection *fc) override
Return a plugin object.
Definition: generateDetectionPlugin.cpp:66
builder.attention_layer_opt
def attention_layer_opt(prefix, config, init_dict, network, input_tensor, imask)
Definition: builder.py:124
OutputArgs::userId
int32_t userId
Definition: sampleMovieLensMPS.cpp:157
nvinfer1::plugin::GroupNormalizationPlugin::bnDesc
cudnnTensorDescriptor_t bnDesc
Definition: groupNormalizationPlugin.h:103
nvinfer1::plugin::ProposalPlugin::mInputHeight
size_t mInputHeight
Definition: proposalPlugin.h:109
pytorch_quantization.quant_modules.QuantModuleReplacementHelper.restore_float_modules
def restore_float_modules(self)
Definition: quant_modules.py:102
nvinfer1::plugin::NMSPluginCreator::deserializePlugin
IPluginV2Ext * deserializePlugin(const char *name, const void *serialData, size_t serialLength) override
Called during deserialization of plugin layer.
Definition: nmsPlugin.cpp:383
backend.trt.test_loader.reshape_network
def reshape_network()
Definition: trt/test_loader.py:58
samplesCommon::divUp
A divUp(A x, B n)
Definition: samples/common/common.h:646
half_float::detail::functions::islessequal
static bool islessequal(half x, half y)
Comparison implementation.
Definition: samples/common/half.h:2444
tests.quant_conv_test.TestQuantConv2D.test_weight_fake_quant_per_channel
def test_weight_fake_quant_per_channel(self)
Definition: quant_conv_test.py:90
nvinfer1::plugin::BatchedNMSDynamicPluginCreator::deserializePlugin
IPluginV2DynamicExt * deserializePlugin(const char *name, const void *serialData, size_t serialLength) override
Called during deserialization of plugin layer.
Definition: batchedNMSPlugin.cpp:636
SampleReformatFreeIO::reference
bool reference()
Used to run CPU reference and get result.
SampleUffSSD::build
bool build()
Function builds the network engine.
Definition: sampleUffSSD.cpp:125
generate.identity
def identity(self, inp)
Definition: 08_replacing_a_subgraph/generate.py:31
nvinfer1::plugin::ProposalLayerPluginCreator::getPluginVersion
const char * getPluginVersion() const override
Return the plugin version.
Definition: proposalLayerPlugin.cpp:56
nvinfer1::plugin::CoordConvACPlugin::oW
int oW
Definition: coordConvACPlugin.h:87
pytorch_quantization.nn.modules.quant_conv.QuantConv3d.__init__
def __init__(self, in_channels, out_channels, kernel_size, stride=1, padding=0, dilation=1, groups=1, bias=True, padding_mode='zeros', **kwargs)
Definition: quant_conv.py:139
pytorch_quantization.nn.modules.quant_rnn.QuantRNNCell.weight_ih
weight_ih
Definition: modules/quant_rnn.py:346
bert::QKVToContextInterleavedPlugin::getPluginVersion
const char * getPluginVersion() const override
Return the plugin version.
Definition: qkvToContextInt8InterleavedPlugin.cpp:172
sample::Profiler::mIterator
std::vector< LayerProfile >::iterator mIterator
Definition: sampleReporting.h:189
tools.test_run.TestTrt.test_trt_explicit_profile
def test_trt_explicit_profile(self)
Definition: test_run.py:112
samplesCommon::DeviceBuffer
GenericBuffer< DeviceAllocator, DeviceFree > DeviceBuffer
Definition: buffers.h:216
numeric_limits
polygraphy.tools.registry.try_register_tool
def try_register_tool(module, tool_class)
Definition: registry.py:41
nvinfer1::EnumMax< QuantizationFlag >
constexpr int32_t EnumMax< QuantizationFlag >()
Maximum number of quantization flags in QuantizationFlag enum.
Definition: NvInfer.h:6364
tests.optim_helper_test.TestFreezeParameters
Definition: optim_helper_test.py:95
nmtSample::MultiplicativeAlignment::MultiplicativeAlignment
MultiplicativeAlignment(ComponentWeights::ptr weights)
Definition: multiplicativeAlignment.cpp:24
onnx_graphsurgeon.util.misc.default_value
def default_value(value, default)
Definition: onnx-graphsurgeon/onnx_graphsurgeon/util/misc.py:49
SampleMovieLens::teardown
bool teardown()
Used to clean up any state created in the sample class.
Definition: sampleMovieLens.cpp:448
tools.test_run.TestOther.test_runner_coexistence
def test_runner_coexistence(self)
Definition: test_run.py:273
nmtSample::Vocabulary::Vocabulary
Vocabulary()
Definition: vocabulary.cpp:29
data_function.MelAudioLoader.sampling_rate
sampling_rate
Definition: waveglow/data_function.py:32
tests.quant_instancenorm_test.TestQuantInstanceNorm2D.test_no_quant
def test_no_quant(self)
Definition: quant_instancenorm_test.py:99
onnx_models.scan_model
def scan_model()
Definition: onnx_models.py:142
onnx_graphsurgeon.ir.tensor.Constant.__repr__
def __repr__(self)
Definition: tensor.py:229
nvinfer1::plugin::DetectionLayer::getNbOutputs
int getNbOutputs() const override
Get the number of outputs from the layer.
Definition: detectionLayerPlugin.cpp:118
test_ir.TestNode
Definition: test_ir.py:134
nvinfer1::INetworkDefinition::addLoop
virtual ILoop * addLoop() noexcept=0
Add a loop to the network.
text.cleaners.collapse_whitespace
def collapse_whitespace(text)
Definition: cleaners.py:75
tools.test_inspect.TestInspectModel.test_model_tf_sanity
def test_model_tf_sanity(self, run_inspect_model)
Definition: test_inspect.py:287
nvinfer1::plugin::ProposalLayer::initialize
int initialize() override
Initialize the layer for execution.
Definition: proposalLayerPlugin.cpp:130
polygraphy.comparator.postprocess
Definition: postprocess.py:1
onnx_graphsurgeon.exporters.onnx_exporter.export_onnx
"onnx.ModelProto" export_onnx(Graph graph, do_type_check=True, **kwargs)
Definition: onnx_exporter.py:93
polygraphy.tools.args.model.ModelArgs.model_file
model_file
Definition: tools/Polygraphy/polygraphy/tools/args/model.py:93
polygraphy.tools.args.data_loader.DataLoaderArgs.add_to_parser
def add_to_parser(self, parser)
Definition: tools/args/data_loader.py:27
pytorch_quantization.tensor_quant.ScaledQuantDescriptor._fake_quant
_fake_quant
Definition: tensor_quant.py:84
nvinfer1::plugin::LReLU::initialize
int initialize() override
Initialize the layer for execution.
Definition: lReluPlugin.cpp:98
polygraphy.tools.base.tool.Tool
Definition: tool.py:21
tests.tensor_quantizer_test.TestTensorQuantizer.test_entropy_and_percentile_calib
def test_entropy_and_percentile_calib(self)
Definition: tensor_quantizer_test.py:217
SampleMNIST::SampleMNIST
SampleMNIST(const samplesCommon::CaffeSampleParams &params)
Definition: sampleMNIST.cpp:54
nvinfer1::ILayer::getOutput
virtual ITensor * getOutput(int32_t index) const =0
Get the layer output corresponding to the given index.
nmtSample::TextWriter
Definition: textWriter.h:33
sample::InferenceOptions::shapes
std::unordered_map< std::string, std::vector< int > > shapes
Definition: sampleOptions.h:162
nvinfer1::plugin::ProposalLayer::serialize
void serialize(void *buffer) const override
Serialize the layer.
Definition: proposalLayerPlugin.cpp:197
test_ir.TensorBaseTests.test_can_convert_in_place_to_constant
def test_can_convert_in_place_to_constant(self)
Definition: test_ir.py:31
polygraphy.comparator.struct.RunResults.__setitem__
def __setitem__(self, key, value)
Definition: comparator/struct.py:110
tests.quant_pooling_test.TestQuantMaxPool2d
Definition: quant_pooling_test.py:59
nvinfer1::plugin::NMSParameters::isNormalized
bool isNormalized
Definition: NvInferPluginUtils.h:252
SampleConfig::setTolerance
void setTolerance(float t)
Definition: sampleConfig.h:301
nvinfer1::EnumMax< CalibrationAlgoType >
constexpr int32_t EnumMax< CalibrationAlgoType >()
Maximum number of elements in CalibrationAlgoType enum.
Definition: NvInfer.h:5982
SampleCharRNNLoop::addLSTMLayers
nvinfer1::ILayer * addLSTMLayers(SampleCharRNNBase::SampleUniquePtr< nvinfer1::INetworkDefinition > &network) final
Add inputs to the TensorRT network and configure LSTM layers using network definition API.
Definition: sampleCharRNN.cpp:566
AlgorithmCacheReader::AlgorithmCacheReader
AlgorithmCacheReader(const std::string &cacheFileName)
Definition: sampleAlgorithmSelector.cpp:191
nvinfer1::plugin::PriorBox::mPluginNamespace
std::string mPluginNamespace
Definition: priorBoxPlugin.h:95
nvinfer1::UnaryOperation::kASIN
@ kASIN
Inverse sine.
SampleErrorRecorder::hasOverflowed
bool hasOverflowed() const noexcept final
Determine if the error stack has overflowed.
Definition: ErrorRecorder.h:59
tests.tensor_quant_test.TestFakeTensorQuant.test_backward
def test_backward(self)
Definition: tensor_quant_test.py:178
nvinfer1::plugin::PriorBox::mNumPriors
int mNumPriors
Definition: priorBoxPlugin.h:93
nvinfer1::LayerType::kFILL
@ kFILL
Fill layer.
nvinfer1::IPluginV2::getTensorRTVersion
virtual int32_t getTensorRTVersion() const
Return the API version with which this plugin was built.
Definition: NvInferRuntimeCommon.h:409
models.get_model
def get_model(model_name, model_config, to_cuda, uniform_initialize_bn_weight=False, forward_is_infer=False)
Definition: demo/Tacotron2/models.py:54
pytorch_quantization.nn.modules.quant_rnn.QuantRNNBase._param_buf_size
_param_buf_size
Definition: modules/quant_rnn.py:139
helpers.data_processing.read_squad_json
def read_squad_json(input_file)
Definition: data_processing.py:186
SampleMLPParams::inputW
int inputW
The input width.
Definition: sampleMLP.cpp:48
helpers.tokenization.WordpieceTokenizer.tokenize
def tokenize(self, text)
Definition: tokenization.py:338
SampleDynamicReshape::mPredictionInput
samplesCommon::DeviceBuffer mPredictionInput
Device buffer for the output of the preprocessor, i.e.
Definition: sampleDynamicReshape.cpp:89
nvinfer1::ITensor::getBroadcastAcrossBatch
virtual bool getBroadcastAcrossBatch() const =0
Check if tensor is broadcast across the batch.
half_float::detail::functions::read
static std::basic_istream< charT, traits > & read(std::basic_istream< charT, traits > &in, half &arg)
Input implementation.
Definition: ieee_half.h:1251
SampleCharRNNv2
Definition: sampleCharRNN.cpp:223
Semaphore::~Semaphore
~Semaphore()
Definition: sampleMovieLensMPS.cpp:88
nvinfer1::IShuffleLayer::~IShuffleLayer
virtual ~IShuffleLayer()
Definition: NvInfer.h:3849
nvinfer1::plugin::MultilevelProposeROIPluginCreator::mPluginAttributes
static std::vector< PluginField > mPluginAttributes
Definition: multilevelProposeROIPlugin.h:143
nvinfer1::plugin::Normalize::deserializeToDevice
Weights deserializeToDevice(const char *&hostBuffer, size_t count)
Definition: normalizePlugin.cpp:158
polygraphy.backend.tf.loader.GraphFromCkpt.name
name
Definition: backend/tf/loader.py:176
bert::QKVToContextInterleavedPlugin::mLayerName
const std::string mLayerName
Definition: qkvToContextInt8InterleavedPlugin.h:78
bert::EmbLayerNormVarSeqlenPlugin::getPluginNamespace
const char * getPluginNamespace() const override
Return the namespace of the plugin object.
Definition: embLayerNormVarSeqlenPlugin.cpp:412
tests.optim_helper_test.TestGroupParameters.test_optimizer_feed
def test_optimizer_feed(self, resnet18)
Definition: optim_helper_test.py:71
polygraphy.util.misc.version
def version(version_str)
Definition: Polygraphy/polygraphy/util/misc.py:32
nvinfer1::plugin::SpecialSlice::attachToContext
void attachToContext(cudnnContext *cudnnContext, cublasContext *cublasContext, IGpuAllocator *gpuAllocator) override
Attach the plugin object to an execution context and grant the plugin the access to some context reso...
Definition: specialSlicePlugin.cpp:206
test_importers.TestOnnxImporter.test_import_graph_with_initializer
def test_import_graph_with_initializer(self)
Definition: test_importers.py:113
nvinfer1::plugin::Reorg::getOutputDataType
DataType getOutputDataType(int index, const nvinfer1::DataType *inputTypes, int nbInputs) const override
Definition: reorgPlugin.cpp:132
ProposalWorkSpace::argMaxLabelDims
nvinfer1::DimsHW argMaxLabelDims
Definition: maskRCNNKernels.h:131
nvinfer1::plugin::TRTException::TRTException
TRTException(const char *fl, const char *fn, int ln, int st, const char *msg, const char *nm)
Definition: checkMacrosPlugin.h:77
Batch::mMemSizes
std::vector< size_t > mMemSizes
Definition: sampleMovieLensMPS.cpp:250
MaskRCNNConfig::MODEL_INPUT_SHAPE
static const Dims3 MODEL_INPUT_SHAPE
Definition: plugin/proposalLayerPlugin/mrcnn_config.h:172
polygraphy.backend.trt.loader.EngineFromNetwork
Definition: backend/trt/loader.py:385
nvinfer1::plugin::DetectionOutputParameters::isNormalized
bool isNormalized
Definition: NvInferPluginUtils.h:189
nvinfer1::plugin::Normalize::serializeFromDevice
void serializeFromDevice(char *&hostBuffer, Weights deviceWeights) const
Definition: normalizePlugin.cpp:152
pytorch_quantization.nn.modules.quant_bert.QuantBertSelfAttention.key
key
Definition: quant_bert.py:65
nvinfer1::IPlugin::serialize
virtual void serialize(void *buffer)=0
Serialize the layer.
half_float::detail::functions::cbrt
static expr cbrt(float arg)
Cubic root implementation.
Definition: samples/common/half.h:1847
sample::SystemOptions::parse
void parse(Arguments &arguments) override
Definition: sampleOptions.cpp:650
nvinfer1::IRefitter::getTensorsWithDynamicRange
virtual int32_t getTensorsWithDynamicRange(int32_t size, const char **tensorNames) const =0
Get names of all tensors that have refittable dynamic ranges.
tests.fixtures.models.QuantLeNet
Definition: tools/pytorch-quantization/tests/fixtures/models.py:44
samplesCommon::vPPM::max
int max
Definition: samples/common/common.h:663
nvinfer1::plugin::SpecialSlice::getSerializationSize
size_t getSerializationSize() const override
Find the size of the serialization buffer required.
Definition: specialSlicePlugin.cpp:104
nmtSample::Embedder::ptr
std::shared_ptr< Embedder > ptr
Definition: embedder.h:35
MaskRCNNUtils::RawDetection::y2
float y2
Definition: sampleUffMaskRCNN.cpp:54
polygraphy.util.cuda.DeviceBuffer.dtype
dtype
Definition: cuda.py:123
samplesCommon::BufferManager::copyInputToDevice
void copyInputToDevice()
Copy the contents of input host buffers to input device buffers synchronously.
Definition: buffers.h:395
polygraphy.backend.onnx.util.infer_shapes
def infer_shapes(model)
Definition: onnx/util.py:40
TLTMaskRCNNConfig::MODEL_INPUT_SHAPE
static const Dims3 MODEL_INPUT_SHAPE
Definition: tlt_mrcnn_config.h:172
tests.optim_helper_test.TestGroupParameters.test_raises
def test_raises(self)
Definition: optim_helper_test.py:81
nvinfer1::plugin::MultilevelCropAndResize::MultilevelCropAndResize
MultilevelCropAndResize(int pooled_size, const nvinfer1::Dims &image_size)
Definition: multilevelCropAndResizePlugin.cpp:89
nvinfer1::IPluginCreator::setPluginNamespace
virtual void setPluginNamespace(const char *pluginNamespace)=0
Set the namespace of the plugin creator based on the plugin library it belongs to.
nvinfer1::ResizeMode
ResizeMode
Enumerates various modes of resize in the resize layer. Resize mode set using setResizeMode().
Definition: NvInfer.h:4331
nmtSample::ComponentWeights::mWeights
std::vector< char > mWeights
Definition: componentWeights.h:42
samplesCommon::safeCudaMalloc
void * safeCudaMalloc(size_t memSize)
Definition: samples/common/common.h:358
backend.trt.test_runner.TestTrtRunner.test_shape_output
def test_shape_output(self)
Definition: trt/test_runner.py:59
ParserOnnxConfig::getVerbosityLevel
virtual nvonnxparser::IOnnxConfig::Verbosity getVerbosityLevel() const
Get the Verbosity Level.
Definition: parserOnnxConfig.h:95
nvinfer1::TensorFormat
TensorFormat
Format of the input/output tensors.
Definition: NvInferRuntimeCommon.h:225
tests.print_test.TestPrint.quant_conv
quant_conv
Definition: print_test.py:48
nmtSample::DebugUtil::DumpTensorPlugin::getOutputDimensions
nvinfer1::Dims getOutputDimensions(int index, const nvinfer1::Dims *inputs, int nbInputDims) override
Definition: debugUtil.cpp:38
nvinfer1::plugin::BatchedNMSDynamicPlugin
Definition: batchedNMSPlugin.h:76
model.WN.start
start
Definition: demo/Tacotron2/waveglow/model.py:101
nvinfer1::plugin::DetectionLayerPluginCreator::mPluginAttributes
static std::vector< PluginField > mPluginAttributes
Definition: detectionLayerPlugin.h:131
tests.tensor_quant_test.TestFakeTensorQuant.test_cuda_ext_with_axis
def test_cuda_ext_with_axis(self)
Definition: tensor_quant_test.py:217
generate.e
e
Definition: 05_folding_constants/generate.py:33
tests.quant_conv_test.TestQuantConv2D
Definition: quant_conv_test.py:48
tests.quant_conv_transposed_test.TestQuantConvTranspose2D.test_no_quant
def test_no_quant(self)
Definition: quant_conv_transposed_test.py:50
Logger::defineTest
static TestAtom defineTest(const std::string &name, int argc, char const *const *argv)
A convenience overloaded version of defineTest() that accepts an array of command-line arguments as i...
Definition: pluginLogging.h:332
MaskRCNNConfig::RPN_ANCHOR_STRIDE
static const int RPN_ANCHOR_STRIDE
Definition: plugin/proposalLayerPlugin/mrcnn_config.h:72
nvinfer1::plugin::GroupNormalizationPlugin::getSerializationSize
size_t getSerializationSize() const override
Find the size of the serialization buffer required.
Definition: groupNormalizationPlugin.cpp:162
sample::Logger::reportTest
static int reportTest(const TestAtom &testAtom, bool pass)
Definition: samples/common/logging.h:365
sample::dumpBuffer
void dumpBuffer(const void *buffer, int volume, const std::string &separator, std::ostream &os)
Definition: sampleUtils.h:204
helpers.calibrator
Definition: demo/BERT/helpers/calibrator.py:1
SampleFasterRCNN::mEngine
std::shared_ptr< nvinfer1::ICudaEngine > mEngine
The TensorRT engine used to run the network.
Definition: sampleFasterRCNN.cpp:92
sample::Profiler::exportJSONProfile
void exportJSONProfile(const std::string &fileName) const
Export a profile to JSON file.
Definition: sampleReporting.cpp:302
nvinfer1::plugin::PriorBox::getPluginNamespace
const char * getPluginNamespace() const override
Return the namespace of the plugin object.
Definition: priorBoxPlugin.cpp:288
nvinfer1::ILogger::Severity::kWARNING
@ kWARNING
Application error has been discovered. TensorRT has recovered or fallen back to a default.
bert::fused_multihead_attention_v2_int8_128_64_kernel_sm75_cubin
unsigned char fused_multihead_attention_v2_int8_128_64_kernel_sm75_cubin[]
Definition: fused_multihead_attention_v2_int8_128_64_kernel.sm75.cpp:19
sample::TrtCudaBuffer::reset
void reset(void *ptr=nullptr)
Definition: sampleDevice.h:255
nvinfer1::Dims3::Dims3
Dims3(int32_t d0, int32_t d1, int32_t d2)
Construct a Dims3 from 3 elements.
Definition: NvInfer.h:183
onnx_graphsurgeon.ir.tensor.Constant.outputs
outputs
Definition: tensor.py:199
onnx_graphsurgeon.ir.graph.Graph.nodes
nodes
Definition: graph.py:114
nvinfer1::plugin::ProposalPlugin::mRpnHeight
size_t mRpnHeight
Definition: proposalPlugin.h:111
nvonnxparser::IOnnxConfig::~IOnnxConfig
virtual ~IOnnxConfig()
Definition: NvOnnxConfig.h:48
polygraphy.backend.trt.loader.SaveEngine
Definition: backend/trt/loader.py:489
polygraphy.backend.trt.loader.BaseNetworkFromOnnx.explicit_batch
explicit_batch
Definition: backend/trt/loader.py:101
bert::QKVToContextInterleavedPlugin::serialize
void serialize(void *buffer) const override
Serialize the layer.
Definition: qkvToContextInt8InterleavedPlugin.cpp:195
nvinfer1::plugin::Region::initialize
int initialize() override
Initialize the layer for execution.
Definition: regionPlugin.cpp:376
polygraphy.backend.trt_legacy.TrtLegacyRunner.activate_impl
def activate_impl(self)
Definition: backend/trt_legacy.py:228
onnx_graphsurgeon.ir.graph.Graph.inputs
inputs
Definition: graph.py:115
polygraphy.tools.args.trt_legacy.TrtLegacyArgs.trt_outputs
trt_outputs
Definition: tools/args/trt_legacy.py:59
export_tacotron2_onnx.DecoderIter.attention_layer
attention_layer
Definition: export_tacotron2_onnx.py:121
comparator.test_data_loader.TestDataLoader.test_no_shape_tensor_false_positive_float
def test_no_shape_tensor_false_positive_float(self)
Definition: test_data_loader.py:75
SampleMNIST::mMeanBlob
SampleUniquePtr< nvcaffeparser1::IBinaryProtoBlob > mMeanBlob
Definition: sampleMNIST.cpp:101
nvinfer1::plugin::CoordConvACPlugin::iType
DataType iType
Definition: coordConvACPlugin.h:85
polygraphy.backend.trt_legacy.TrtLegacyRunner.network_loader
network_loader
Definition: backend/trt_legacy.py:215
polygraphy.common.struct.TensorMetadata
Definition: common/struct.py:22
test_ir.TestNode.test_i_multiple_inputs
def test_i_multiple_inputs(self)
Definition: test_ir.py:196
sample::InferenceOptions::duration
int duration
Definition: sampleOptions.h:151
nvinfer1::plugin::PluginCreatorRegistry::getInstance
static PluginCreatorRegistry & getInstance()
Definition: InferPlugin.cpp:72
nvinfer1::plugin::ResizeNearest::getPluginVersion
const char * getPluginVersion() const override
Return the plugin version.
Definition: resizeNearestPlugin.cpp:170
SampleMNISTAPI
The SampleMNISTAPI class implements the MNIST API sample.
Definition: sampleMNISTAPI.cpp:59
SampleConfig::destroy
void destroy()
Destroy IOnnxConfig object.
Definition: sampleConfig.h:315
half_float::detail::float2half
uint16 float2half(T value)
Convert floating point to half-precision.
Definition: ieee_half.h:681
MultilevelProposeROIWorkSpace
Definition: maskRCNNKernels.h:155
polygraphy.tools.inspect.inspect.Inspect.add_parser_args
def add_parser_args(self, parser)
Definition: inspect.py:28
polygraphy.tools.args.tf.loader.TfLoaderArgs._enable_tftrt
_enable_tftrt
Definition: tools/args/tf/loader.py:24
UffPoolPluginV2::mPoolingParams
PoolParameters mPoolingParams
Definition: sampleUffPluginV2Ext.cpp:634
nvinfer1::plugin::DetectionOutput::C2
int C2
Definition: nmsPlugin.h:90
nvinfer1::UnaryOperation::kSINH
@ kSINH
Hyperbolic sine.
nvinfer1::plugin::PyramidROIAlign::mInputSize
int mInputSize
Definition: pyramidROIAlignPlugin.h:98
nvinfer1::plugin::GenerateDetection
Definition: generateDetectionPlugin.h:36
SampleUffFasterRcnnParams::visualizeThreshold
float visualizeThreshold
Definition: sampleUffFasterRCNN.cpp:74
nvinfer1::plugin::ResizeNearestPluginCreator::getPluginVersion
const char * getPluginVersion() const override
Return the plugin version.
Definition: resizeNearestPlugin.cpp:52
test_ir.TestNodeIO.test_pop_index
def test_pop_index(self, field_names)
Definition: test_ir.py:269
nvinfer1::plugin::SplitPlugin::SplitPlugin
SplitPlugin(void const *serialData, size_t serialLength)
Definition: split.h:81
pytorch_quantization.tensor_quant.ScaledQuantDescriptor.axis
def axis(self)
Definition: tensor_quant.py:120
tests.quant_linear_test.TestQuantLinear.test_raise
def test_raise(self)
Definition: quant_linear_test.py:46
nvinfer1::TensorFormat::kDHWC8
@ kDHWC8
Eight channel format where C is padded to a multiple of 8.
tools.test_run.TestTf.test_tf
def test_tf(self)
Definition: test_run.py:145
nvinfer1::IBuilder::setMaxBatchSize
virtual __attribute__((deprecated)) virtual nvinfer1 void setMaxBatchSize(int32_t batchSize)=0
Create a network definition object where all tensors have an implicit batch dimension.
OutputArgs::allItems
std::vector< int32_t > allItems
Definition: sampleMovieLensMPS.cpp:160
pytorch_quantization.calib.calibrator._Calibrator.compute_amax
def compute_amax(self, *args, **kwargs)
Definition: tools/pytorch-quantization/pytorch_quantization/calib/calibrator.py:48
polygraphy.backend.pyt.loader.BaseLoadPyt.__call__
def __call__(self)
Definition: backend/pyt/loader.py:19
inference.pad_sequences
def pad_sequences(batch)
Definition: Tacotron2/inference.py:130
model.remove
def remove(conv_list)
Definition: demo/Tacotron2/waveglow/model.py:283
polygraphy.logger.logger.Logger.info
def info(self, message, mode=LogMode.EACH)
Definition: Polygraphy/polygraphy/logger/logger.py:266
nvcaffeparser1::BlobNameToTensor::~BlobNameToTensor
~BlobNameToTensor() override=default
polygraphy.tools.args.data_loader.DataLoaderArgs.model_args
model_args
Definition: tools/args/data_loader.py:46
polygraphy.tools.util.misc.get_outputs
def get_outputs(args, name)
Definition: Polygraphy/polygraphy/tools/util/misc.py:37
nvinfer1::plugin::DetectionOutput::clone
IPluginV2Ext * clone() const override
Clone the plugin object.
Definition: nmsPlugin.cpp:166
polygraphy.backend.trt.loader.CreateNetwork.explicit_precision
explicit_precision
Definition: backend/trt/loader.py:71
comparator.test_comparator.TestComparator.test_warmup_runs
def test_warmup_runs(self)
Definition: test_comparator.py:33
nmtSample::SoftmaxLikelihood::getInfo
std::string getInfo() override
get the textual description of the component
Definition: softmaxLikelihood.cpp:103
Logger::TestResult::kRUNNING
@ kRUNNING
The test is running.
pytorch_quantization.quant_modules.QuantModuleReplacementHelper.prepare_state
def prepare_state(self, float_module_list=None, custom_map=None)
Definition: quant_modules.py:65
test_ir.TestNode.test_equals_op_mismatch
def test_equals_op_mismatch(self)
Definition: test_ir.py:147
OutputParams
Definition: sampleMovieLens.cpp:49
nvinfer1::plugin::BatchedNMSDynamicPlugin::getNbOutputs
int getNbOutputs() const override
Get the number of outputs from the layer.
Definition: batchedNMSPlugin.cpp:80
nvinfer1::plugin::PriorBoxPluginCreator
Definition: priorBoxPlugin.h:98
generate.add
def add(self, a, b)
Definition: 07_creating_a_model_with_the_layer_api/generate.py:34
SampleUffFasterRcnn::nms_classifier
std::vector< int > nms_classifier(std::vector< float > &boxes_per_cls, std::vector< float > &probs_per_cls, float NMS_OVERLAP_THRESHOLD, int NMS_MAX_BOXES)
NMS helper function in post-processing.
Definition: sampleUffFasterRCNN.cpp:617
SampleDynamicReshape::mParams
samplesCommon::OnnxSampleParams mParams
The parameters for the sample.
Definition: sampleDynamicReshape.cpp:78
polygraphy.comparator
Definition: tools/Polygraphy/polygraphy/comparator/__init__.py:1
nvinfer1::ILogger::log
virtual void log(Severity severity, const char *msg)=0
A callback implemented by the application to handle logging messages;.
nvinfer1::DimsExprs::d
const IDimensionExpr * d[Dims::MAX_DIMS]
The extent of each dimension.
Definition: NvInferRuntime.h:379
nvinfer1::plugin::GroupNormalizationPlugin::getPluginVersion
const char * getPluginVersion() const override
Return the plugin version.
Definition: groupNormalizationPlugin.cpp:72
tests.quant_pooling_test.TestQuantAvgPool2d.test_raise
def test_raise(self)
Definition: quant_pooling_test.py:151
polygraphy.backend.onnx.loader.OnnxFromTfGraph.__init__
def __init__(self, graph, opset=None, optimize=None, fold_constant=None)
Definition: backend/onnx/loader.py:50
nmtSample::PinnedHostBuffer::mBuffer
T * mBuffer
Definition: pinnedHostBuffer.h:62
anonymous_namespace{coordConvACPlugin.cpp}::COORDCONV_AC_PLUGIN_VERSION
const char * COORDCONV_AC_PLUGIN_VERSION
Definition: coordConvACPlugin.cpp:27
data_function.MelAudioLoader.__len__
def __len__(self)
Definition: waveglow/data_function.py:68
SampleConfig::mEngineFilename
std::string mEngineFilename
Definition: sampleConfig.h:37
tools.test_run.TestTf.test_tf_save_timeline
def test_tf_save_timeline(self)
Definition: test_run.py:163
polygraphy.backend.trt.loader.NetworkFromOnnxBytes.__init__
def __init__(self, model_bytes, explicit_precision=None)
Definition: backend/trt/loader.py:112
inference_perf.main
def main()
Definition: inference_perf.py:51
nvinfer1::plugin::FlattenConcat::getNbOutputs
int getNbOutputs() const override
Get the number of outputs from the layer.
Definition: flattenConcat.cpp:82
tools.test_run.TestOther.test_subprocess_sanity
def test_subprocess_sanity(self)
Definition: test_run.py:225
nmtSample::LimitedSamplesDataReader::gCurrentPosition
int gCurrentPosition
Definition: limitedSamplesDataReader.h:43
pytorch_quantization.nn.modules.tensor_quantizer.TensorQuantizer.disable_calib
def disable_calib(self)
Definition: tensor_quantizer.py:172
pytorch_quantization.tensor_quant.ScaledQuantDescriptor.unsigned
def unsigned(self)
Definition: tensor_quant.py:144
inference.checkpoint_from_distributed
def checkpoint_from_distributed(state_dict)
Definition: Tacotron2/inference.py:71
nvcaffeparser1::CaffeParser::setPluginFactoryExt
void setPluginFactoryExt(nvcaffeparser1::IPluginFactoryExt *factory) override
Set the IPluginFactoryExt used to create the user defined pluginExts.
Definition: caffeParser.h:49
samplesCommon::PPM::magic
std::string magic
Definition: samples/common/common.h:654
helpers.tokenization.FullTokenizer.wordpiece_tokenizer
wordpiece_tokenizer
Definition: tokenization.py:167
pytorch_quantization.nn.modules.quant_conv.QuantConvTranspose2d.__init__
def __init__(self, in_channels, out_channels, kernel_size, stride=1, padding=0, output_padding=0, groups=1, bias=True, dilation=1, padding_mode='zeros', **kwargs)
Definition: quant_conv.py:314
helpers.data_processing.get_predictions
def get_predictions(doc_tokens, features, results, n_best_size, max_answer_length)
Definition: data_processing.py:333
test_api.TestApi.setup_method
def setup_method(self)
Definition: test_api.py:26
SampleMNISTAPI::mParams
SampleMNISTAPIParams mParams
The parameters for the sample.
Definition: sampleMNISTAPI.cpp:87
nvinfer1::ILoop::addTripLimit
virtual ITripLimitLayer * addTripLimit(ITensor &tensor, TripLimit limit) noexcept=0
Add a trip-count limiter, based on the given tensor.
polygraphy.util.cuda.Cuda.stream_synchronize
def stream_synchronize(self, stream)
Definition: cuda.py:51
samplesCommon::BufferManager::mBatchSize
int mBatchSize
The batch size for legacy networks, 0 otherwise.
Definition: buffers.h:456
tools.test_run.TestOnnxrt.test_onnx_rt
def test_onnx_rt(self)
Definition: test_run.py:194
models.meta.Model
Definition: meta.py:30
nvinfer1::RNNGateType::kCELL
@ kCELL
Cell gate (c).
nvinfer1::plugin::GroupNormalizationPlugin::configurePlugin
void configurePlugin(const nvinfer1::DynamicPluginTensorDesc *in, int nbInputs, const nvinfer1::DynamicPluginTensorDesc *out, int nbOutputs) override
Definition: groupNormalizationPlugin.cpp:200
tests.quant_instancenorm_test.TestQuantInstanceNorm2D.test_fake_quant_per_tensor
def test_fake_quant_per_tensor(self)
Definition: quant_instancenorm_test.py:114
nvinfer1::plugin::PyramidROIAlign::getOutputDimensions
Dims getOutputDimensions(int index, const Dims *inputs, int nbInputDims) override
Definition: pyramidROIAlignPlugin.cpp:164
sample::fillBufferHalf
void fillBufferHalf(void *buffer, int volume, H min, H max)
Definition: sampleUtils.h:185
SampleConfig::~SampleConfig
~SampleConfig()
Definition: sampleConfig.h:73
SampleUffPluginV2Ext
Definition: sampleUffPluginV2Ext.cpp:188
samplesCommon::vPPM::w
int w
Definition: samples/common/common.h:663
nmtSample::SLPEmbedder::mKernelWeights
nvinfer1::Weights mKernelWeights
Definition: slpEmbedder.h:52
LogStreamConsumerBuffer::sync
virtual int sync()
Definition: pluginLogging.h:62
polygraphy.backend.trt.loader.BaseNetworkFromOnnx
Definition: backend/trt/loader.py:94
nvinfer1::IPluginV2DynamicExt::~IPluginV2DynamicExt
virtual ~IPluginV2DynamicExt()
Definition: NvInferRuntime.h:539
model.Decoder.attention_rnn_dim
attention_rnn_dim
Definition: demo/Tacotron2/tacotron2/model.py:245
bert::QKVToContextInterleavedPlugin::mXmmaKernel
const FusedMultiHeadAttentionXMMAKernelV2 * mXmmaKernel
Definition: qkvToContextInt8InterleavedPlugin.h:88
half_float::detail::ceil
half ceil(half arg)
Nearest integer not less than half value.
Definition: ieee_half.h:2551
bert::fused_multihead_attention_fp16_64_64_kernel_sm75_cu_o_len
unsigned int fused_multihead_attention_fp16_64_64_kernel_sm75_cu_o_len
Definition: fused_multihead_attention_fp16_64_64_kernel.sm75.cpp:1438
bert::TFusedMHAKernelFactory::TFusedMHAKernelFactory
TFusedMHAKernelFactory()=default
builder.BertConfig.use_fp16
use_fp16
Definition: builder.py:97
tests.quant_rnn_test.TestQuantLSTMCell.test_quant_input_hidden
def test_quant_input_hidden(self, verbose)
Definition: quant_rnn_test.py:165
nvinfer1::plugin::gLogInfo
LogStream< ILogger::Severity::kINFO > gLogInfo
Definition: checkMacrosPlugin.cpp:50
SampleUffFasterRcnnParams::outputBboxSize
int outputBboxSize
Definition: sampleUffFasterRCNN.cpp:72
generate.d
d
Definition: 05_folding_constants/generate.py:32
nvinfer1::plugin::Reorg::detachFromContext
void detachFromContext() override
Detach the plugin object from its execution context.
Definition: reorgPlugin.cpp:173
nmtSample::DataWriter::initialize
virtual void initialize()=0
it is called right before inference starts
nvinfer1::plugin::GridAnchorGenerator::getNbOutputs
int getNbOutputs() const override
Get the number of outputs from the layer.
Definition: gridAnchorPlugin.cpp:173
nvinfer1::plugin::GridAnchorGenerator::attachToContext
void attachToContext(cudnnContext *cudnnContext, cublasContext *cublasContext, IGpuAllocator *gpuAllocator) override
Attach the plugin object to an execution context and grant the plugin the access to some context reso...
Definition: gridAnchorPlugin.cpp:331
nvinfer1::plugin::GridAnchorParameters::numAspectRatios
int32_t numAspectRatios
Definition: NvInferPluginUtils.h:147
sample::MirroredBuffer::mSize
size_t mSize
Definition: sampleDevice.h:349
sample::AllOptions::system
SystemOptions system
Definition: sampleOptions.h:190
util.test_misc.FindInDictCase.__init__
def __init__(self, name, map, index, expected)
Definition: test_misc.py:35
sample::Logger::defineTest
static TestAtom defineTest(const std::string &name, int argc, char const *const *argv)
A convenience overloaded version of defineTest() that accepts an array of command-line arguments as i...
Definition: samples/common/logging.h:311
LogStreamConsumerBase::mLogMutex
std::mutex mLogMutex
Definition: pluginLogging.h:116
nvinfer1::plugin::throwCublasError
void throwCublasError(const char *file, const char *function, int line, int status, const char *msg)
Definition: checkMacrosPlugin.cpp:62
polygraphy.logger.logger.Logger.indent
def indent(self, level=1)
Definition: Polygraphy/polygraphy/logger/logger.py:162
samplesCommon::HostMemory::data
void * data() const noexcept override
A pointer to the raw data that is owned by the library.
Definition: samples/common/common.h:310
tests.calibrator_test.TestPercentileCalibrator.test_two_tensor
def test_two_tensor(self, verbose)
Definition: calibrator_test.py:279
anonymous_namespace{pluginLogging.h}::LOG_FATAL
LogStreamConsumer LOG_FATAL(const Logger &logger)
produces a LogStreamConsumer object that can be used to log messages of severity kINTERNAL_ERROR
Definition: pluginLogging.h:522
sample::LogStreamConsumerBuffer::mOutput
std::ostream & mOutput
Definition: samples/common/logging.h:102
SampleErrorRecorder::empty
bool empty() const noexcept
Simple helper function that.
Definition: ErrorRecorder.h:80
polygraphy.tools.util.script.Script.add_import
def add_import(self, imports, frm=None)
Definition: script.py:146
nvinfer1::utility::getOptions
TRTParsedArgs getOptions(int argc, const char *const *argv, const std::vector< TRTOption > &options)
Parse the input arguments passed to main() and extract options as well as positional arguments.
Definition: getOptions.cpp:237
SampleOnnxMNIST::mInputDims
nvinfer1::Dims mInputDims
The dimensions of the input to the network.
Definition: sampleOnnxMNIST.cpp:71
nvinfer1::plugin::PyramidROIAlign::mPooledSize
xy_t mPooledSize
Definition: pyramidROIAlignPlugin.h:94
tests.tensor_quant_test.TestFakeTensorQuant.test_simple_run
def test_simple_run(self)
Definition: tensor_quant_test.py:137
nvcaffeparser1::BinaryProtoBlob::mDimensions
nvinfer1::DimsNCHW mDimensions
Definition: binaryProtoBlob.h:63
SampleBuffer::operator=
SampleBuffer & operator=(SampleBuffer &&sampleBuffer) noexcept
Definition: sampleReformatFreeIO.cpp:160
nvinfer1::ITensor::resetDynamicRange
virtual void resetDynamicRange()=0
Undo effect of setDynamicRange.
pytorch_quantization.nn.modules.quant_rnn.GRU.__init__
def __init__(self, *args, **kwargs)
Definition: modules/quant_rnn.py:303
polygraphy.backend.trt_legacy.LoadNetworkFromCaffe.deploy
deploy
Definition: backend/trt_legacy.py:145
SamplePlugin
The SamplePlugin class implements samplePlugin.
Definition: samplePlugin.cpp:55
tests.print_test.TestPrint.test_print_calibrator
def test_print_calibrator(self)
Definition: print_test.py:55
onnx_graphsurgeon.ir.graph.Graph.name_idx
name_idx
Definition: graph.py:126
nvinfer1::plugin::MultilevelCropAndResize::initialize
int initialize() override
Initialize the layer for execution.
Definition: multilevelCropAndResizePlugin.cpp:106
half_float::detail::half_caster< T, half, R >::cast
static T cast(half arg)
Definition: samples/common/half.h:2644
nvinfer1::CUDADriverWrapper::cuLinkComplete
CUresult cuLinkComplete(CUlinkState state, void **cubinOut, size_t *sizeOut) const
Definition: cudaDriverWrapper.cpp:81
SampleErrorRecorder::getErrorDesc
IErrorRecorder::ErrorDesc getErrorDesc(int32_t errorIdx) const noexcept final
Returns the c-style string description of the error.
Definition: ErrorRecorder.h:54
nvinfer1::plugin::MultilevelCropAndResizePluginCreator::getFieldNames
const PluginFieldCollection * getFieldNames() override
Return a list of fields that needs to be passed to createPlugin.
Definition: multilevelCropAndResizePlugin.cpp:57
model.WaveGlow.n_group
n_group
Definition: demo/Tacotron2/waveglow/model.py:166
OutputArgs::expectedPredictedMaxRatingItemProb
float expectedPredictedMaxRatingItemProb
Definition: sampleMovieLensMPS.cpp:159
nvinfer1::PluginVersion::kV2_EXT
@ kV2_EXT
IPluginV2.
polygraphy.tools.util.script.Script.imports
imports
Definition: script.py:135
nvinfer1::plugin::PriorBoxPluginCreator::mFC
static PluginFieldCollection mFC
Definition: priorBoxPlugin.h:116
nvinfer1::plugin::BatchTilePlugin::enqueue
int enqueue(int batchSize, const void *const *inputs, void **outputs, void *workspace, cudaStream_t stream) override
Definition: batchTilePlugin.cpp:97
FCPlugin::deserializeToDevice
void deserializeToDevice(const char *&hostBuffer, void *&deviceWeights, size_t size)
Definition: samples/opensource/samplePlugin/fcPlugin.h:302
UffPoolPluginV2::read
T read(const char *&buffer) const
Definition: sampleUffPluginV2Ext.cpp:587
nvinfer1::plugin::SplitPlugin::SplitPlugin
SplitPlugin(int axis, int *const &output_lengths, int noutput)
Definition: split.h:69
nvinfer1::IActivationLayer
An Activation layer in a network definition.
Definition: NvInfer.h:1643
half_float::detail::enable< T, half, expr, void >::type
T type
Definition: ieee_half.h:376
nvonnxparser::EnumMax< ErrorCode >
int32_t EnumMax< ErrorCode >()
Definition: NvOnnxParser.h:80
train.parse_args
def parse_args(parser)
Definition: train.py:47
polygraphy.backend.base.runner.BaseRunner.deactivate
def deactivate(self)
Definition: backend/base/runner.py:143
nvinfer1::plugin::FlattenConcat::mIgnoreBatch
bool mIgnoreBatch
Definition: flattenConcat.h:101
polygraphy.tools.util.misc.get_outputs_for_script
def get_outputs_for_script(script, outputs)
Definition: Polygraphy/polygraphy/tools/util/misc.py:44
builder.transformer_layer_opt
def transformer_layer_opt(prefix, config, init_dict, network, input_tensor, imask)
Definition: builder.py:223
pytorch_quantization.nn.modules.tensor_quantizer.TensorQuantizer._unsigned
_unsigned
Definition: tensor_quantizer.py:79
nvinfer1::plugin::PriorBox::mH
int mH
Definition: priorBoxPlugin.h:93
MNISTBatchStream::getLabels
float * getLabels() override
Definition: BatchStream.h:78
pytorch_quantization.nn.modules.quant_bert
Definition: quant_bert.py:1
bert::EmbLayerNormVarSeqlenPlugin::mLd
size_t mLd
Definition: embLayerNormVarSeqlenPlugin.h:93
tests.tensor_quantizer_test.TestTensorQuantizer.test_max_calib
def test_max_calib(self)
Definition: tensor_quantizer_test.py:178
LogStreamConsumer::setReportableSeverity
void setReportableSeverity(Severity reportableSeverity)
Definition: pluginLogging.h:150
sample::anonymous_namespace{sampleOptions.cpp}::checkEraseOption
bool checkEraseOption(Arguments &arguments, const std::string &option, T &value)
Definition: sampleOptions.cpp:177
UffPoolPluginV2Creator::deserializePlugin
IPluginV2 * deserializePlugin(const char *name, const void *serialData, size_t serialLength) override
Called during deserialization of plugin layer.
Definition: sampleUffPluginV2Ext.cpp:671
SampleCharRNNBindingNames::INPUT_BLOB_NAME
const char * INPUT_BLOB_NAME
Definition: sampleCharRNN.cpp:87
BatchStream::mFileBatchPos
int mFileBatchPos
Definition: BatchStream.h:377
nvinfer1::RNNGateType::kRESET
@ kRESET
Reset gate (r).
nvinfer1::WeightsRole::kSCALE
@ kSCALE
scale part of IScaleLayer
half_float::half::operator+=
detail::enable< half &, T >::type operator+=(T rhs)
Arithmetic assignment.
Definition: ieee_half.h:1127
helpers.tokenization.BasicTokenizer._run_strip_accents
def _run_strip_accents(self, text)
Definition: tokenization.py:250
SampleOnnxMNIST::mNumber
int mNumber
The number to classify.
Definition: sampleOnnxMNIST.cpp:73
nvinfer1::plugin::InstanceNormalizationPlugin::getOutputDimensions
DimsExprs getOutputDimensions(int outputIndex, const nvinfer1::DimsExprs *inputs, int nbInputs, nvinfer1::IExprBuilder &exprBuilder) override
Definition: instanceNormalizationPlugin.cpp:148
nvinfer1::plugin::ProposalPlugin::serialize
void serialize(void *buffer) const override
Serialize the layer.
Definition: proposalPlugin.cpp:205
half_float::half::operator*=
half & operator*=(float rhs)
Arithmetic assignment.
Definition: samples/common/half.h:1506
onnx_graphsurgeon.ir.graph.Graph.outputs
outputs
Definition: graph.py:116
RefineNMSParameters::backgroundLabelId
int backgroundLabelId
Definition: maskRCNNKernels.h:86
model.Decoder.decode
def decode(self, decoder_input, attention_hidden, attention_cell, decoder_hidden, decoder_cell, attention_weights, attention_weights_cum, attention_context, memory, processed_memory, mask)
Definition: demo/Tacotron2/tacotron2/model.py:389
SampleFasterRCNN::kIMG_H
static const int kIMG_H
Definition: sampleFasterRCNN.cpp:88
SampleConfig::mFirstCalibBatch
int64_t mFirstCalibBatch
Definition: sampleConfig.h:49
onnx_graphsurgeon.ir.tensor.Tensor.__eq__
def __eq__(self, other)
Definition: tensor.py:140
nvcaffeparser1::CaffeWeightFactory::isInitialized
bool isInitialized()
Definition: caffeWeightFactory.cpp:189
polygraphy.comparator.compare.OutputCompareResult.max_reldiff
max_reldiff
Definition: compare.py:40
nvinfer1::plugin::ProposalLayer::mIOUThreshold
float mIOUThreshold
Definition: proposalLayerPlugin.h:100
pytorch_quantization.nn.modules.quant_bert.QuantBertSelfAttention.forward
def forward(self, hidden_states, attention_mask=None, head_mask=None, encoder_hidden_states=None, encoder_attention_mask=None)
Definition: quant_bert.py:83
generate.max
def max(self, *args)
Definition: 08_replacing_a_subgraph/generate.py:27
pytorch_quantization.calib.histogram.HistogramCalibrator.collect
def collect(self, x)
Definition: histogram.py:58
nvinfer1::IShapeLayer
Layer type for getting shape of a tensor.
Definition: NvInfer.h:4044
SampleAlgorithmSelector::processInput
bool processInput(const samplesCommon::BufferManager &buffers, const std::string &inputTensorName, int inputFileIdx) const
Reads the input and mean data, preprocesses, and stores the result in a managed buffer.
Definition: sampleAlgorithmSelector.cpp:444
nmtSample::Vocabulary::mTokenToId
std::map< std::string, int > mTokenToId
Definition: vocabulary.h:72
nvinfer1::plugin::BatchedNMSDynamicPlugin::terminate
void terminate() override
Release resources acquired during plugin layer initialization.
Definition: batchedNMSPlugin.cpp:97
backend.trt.test_loader.TestConfigLoader.test_defaults
def test_defaults(self, identity_builder_network)
Definition: trt/test_loader.py:172
model.Encoder.lstm
lstm
Definition: demo/Tacotron2/tacotron2/model.py:194
SampleDynamicReshape::build
bool build()
Builds both engines.
Definition: sampleDynamicReshape.cpp:109
sample::ModelOptions::parse
void parse(Arguments &arguments) override
Definition: sampleOptions.cpp:444
sample::anonymous_namespace{sampleOptions.cpp}::getShapesBuild
bool getShapesBuild(Arguments &arguments, std::unordered_map< std::string, ShapeRange > &shapes, const char *argument, nvinfer1::OptProfileSelector selector)
Definition: sampleOptions.cpp:242
polygraphy.tools.args.tf.loader.TfLoaderArgs.trt_runner_args
trt_runner_args
Definition: tools/args/tf/loader.py:63
sample::anonymous_namespace{sampleInference.cpp}::Iteration::getStream
TrtCudaStream & getStream(StreamType t)
Definition: sampleInference.cpp:387
MaskRCNNConfig::RPN_BBOX_STD_DEV
static const float RPN_BBOX_STD_DEV[]
Definition: plugin/proposalLayerPlugin/mrcnn_config.h:36
nmtSample::DataWriter::~DataWriter
~DataWriter() override=default
nvinfer1::TensorFormat::kLINEAR
@ kLINEAR
Row major linear format.
pytorch_quantization.nn.modules.quant_conv
Definition: quant_conv.py:1
nvinfer1::plugin::PyramidROIAlign
Definition: pyramidROIAlignPlugin.h:36
half_float::detail::operator!=
enable< bool, T, U >::type operator!=(T x, U y)
Comparison for inequality.
Definition: ieee_half.h:2134
nvcaffeparser1::parseEltwise
nvinfer1::ILayer * parseEltwise(nvinfer1::INetworkDefinition &network, const trtcaffe::LayerParameter &msg, CaffeWeightFactory &, BlobNameToTensor &tensors)
Definition: parseEltwise.cpp:23
sample::InferenceTrace::inEnd
float inEnd
Definition: sampleReporting.h:95
SampleINT8API::prepareInput
bool prepareInput(const samplesCommon::BufferManager &buffers)
Reads the ppm input image, preprocesses, and stores the result in a managed buffer.
Definition: sampleINT8API.cpp:408
mrcnn_to_trt_single.parse_command_line_arguments
def parse_command_line_arguments(args=None)
Definition: mrcnn_to_trt_single.py:36
nvinfer1::plugin::PyramidROIAlign::getPluginVersion
const char * getPluginVersion() const override
Return the plugin version.
Definition: pyramidROIAlignPlugin.cpp:122
nvinfer1::plugin::PriorBoxParameters::variance
float variance[4]
Definition: NvInferPluginUtils.h:97
nvinfer1::LoopOutput::kREVERSE
@ kREVERSE
Output value is concatenation of values of tensor for each iteration, in reverse order.
text.cleaners.convert_to_ascii
def convert_to_ascii(text)
Definition: cleaners.py:79
nmtSample::Projection::addToModel
virtual void addToModel(nvinfer1::INetworkDefinition *network, nvinfer1::ITensor *input, nvinfer1::ITensor **outputLogits)=0
add raw logits to the network
nvinfer1::plugin::Reorg::~Reorg
~Reorg() override=default
nvcaffeparser1::BlobNameToTensor::add
void add(const std::string &name, nvinfer1::ITensor *tensor)
Definition: blobNameToTensor.h:31
nmtSample::BeamSearchPolicy::mCurrentLikelihoods
std::vector< float > mCurrentLikelihoods
Definition: beamSearchPolicy.h:70
sample::anonymous_namespace{sampleInference.cpp}::EnqueueGraph
Functor to enqueue inference from CUDA Graph.
Definition: sampleInference.cpp:233
nvinfer1::DimensionOperation::kSUB
@ kSUB
Substract the second element from the first.
MNISTBatchStream::getBatchesRead
int getBatchesRead() const override
Definition: BatchStream.h:83
nvinfer1::IAlgorithmVariant
provides a unique 128-bit identifier, which along with the input and output information denotes the v...
Definition: NvInfer.h:6202
nvinfer1::plugin::ProposalLayerPluginCreator::mIOUThreshold
float mIOUThreshold
Definition: proposalLayerPlugin.h:137
onnx_graphsurgeon.ir.graph.NodeIDAdder.graph
graph
Definition: graph.py:46
FCPlugin::mSrcDescriptor
cudnnTensorDescriptor_t mSrcDescriptor
Definition: samples/opensource/samplePlugin/fcPlugin.h:317
nvinfer1::PluginFieldType
PluginFieldType
Definition: NvInferRuntimeCommon.h:826
builder
Definition: builder.py:1
sample::Logger::defineTest
static TestAtom defineTest(const std::string &name, const std::string &cmdline)
Define a test for logging.
Definition: samples/common/logging.h:297
polygraphy.backend.trt_legacy.LoadNetworkFromCaffe.dtype
dtype
Definition: backend/trt_legacy.py:156
IOutputDimensionsFormula
Application-implemented interface to compute layer output sizes.
Definition: NvInfer.h:3230
Args
Definition: demo/BERT/infer_c/common.h:26
nvinfer1::EnumMax< DeviceType >
constexpr int32_t EnumMax< DeviceType >()
Maximum number of elements in DeviceType enum.
Definition: NvInferRuntime.h:714
tests.quant_conv_transposed_test.TestQuantConvTranspose1D.test_weight_fake_quant_per_tensor
def test_weight_fake_quant_per_tensor(self)
Definition: quant_conv_transposed_test.py:355
nvinfer1::EnumMax< EngineCapability >
constexpr int32_t EnumMax< EngineCapability >()
Maximum number of elements in EngineCapability enum.
Definition: NvInferRuntime.h:60
half_float::detail::exp
expr exp(half arg)
Exponential function.
Definition: ieee_half.h:2326
pytorch_quantization.nn.modules.clip.Clip
Definition: clip.py:27
anonymous_namespace{resizeNearestPlugin.cpp}::RESIZE_PLUGIN_NAME
const char * RESIZE_PLUGIN_NAME
Definition: resizeNearestPlugin.cpp:33
nvinfer1::plugin::Normalize::W
int W
Definition: normalizePlugin.h:103
polygraphy.logger.logger.Logger.verbose
def verbose(self, message, mode=LogMode.EACH)
Definition: Polygraphy/polygraphy/logger/logger.py:262
half_float::detail::half2float_impl
float half2float_impl(uint16 value, float, true_type)
Convert half-precision to IEEE single-precision.
Definition: ieee_half.h:746
classification_flow.evaluate_onnx
def evaluate_onnx(onnx_filename, data_loader, criterion, print_freq)
Definition: classification_flow.py:276
builder.build_engine
def build_engine(batch_sizes, workspace_size, sequence_lengths, config, weights_dict, squad_json, vocab_file, calibrationCacheFile, calib_num)
Definition: builder.py:583
nvinfer1::EnumMax< ScaleMode >
constexpr int32_t EnumMax< ScaleMode >()
Maximum number of elements in ScaleMode enum.
Definition: NvInfer.h:2077
tests.utils.compare
def compare(a, b, rtol=1e-7, atol=1e-6, ctol=1e-6)
Definition: tools/pytorch-quantization/tests/utils.py:72
polygraphy.logger.logger.Logger.warning
def warning(self, message, mode=LogMode.EACH)
Definition: Polygraphy/polygraphy/logger/logger.py:278
nvinfer1::plugin::DetectionLayer::mParam
RefineNMSParameters mParam
Definition: detectionLayerPlugin.h:103
util.test_cuda.ResizeTestCase.new_bytes
new_bytes
Definition: test_cuda.py:30
pytorch_quantization.tensor_quant.FakeTensorQuantFunction.forward
def forward(ctx, inputs, amax, num_bits=8, unsigned=False, narrow_range=True)
Definition: tensor_quant.py:306
nmtSample::DataReader::ptr
std::shared_ptr< DataReader > ptr
Definition: dataReader.h:34
nmtSample::Likelihood::getLikelihoodCombinationOperator
virtual LikelihoodCombinationOperator::ptr getLikelihoodCombinationOperator() const =0
nvinfer1::plugin::ReorgPluginCreator::~ReorgPluginCreator
~ReorgPluginCreator() override=default
polygraphy.util.misc.try_match_shape
def try_match_shape(arr, shape)
Definition: Polygraphy/polygraphy/util/misc.py:325
polygraphy.backend.trt.loader.NetworkFromOnnxBytes._model_bytes
_model_bytes
Definition: backend/trt/loader.py:120
nvinfer1::plugin::InstanceNormalizationPluginCreator::InstanceNormalizationPluginCreator
InstanceNormalizationPluginCreator()
Definition: instanceNormalizationPlugin.cpp:325
data_function.TextMelLoader.stft
stft
Definition: tacotron2/data_function.py:38
half_float::detail::half_caster< half, U, R >::cast_impl
static half cast_impl(U arg, true_type)
Definition: samples/common/half.h:2628
polygraphy.tools.args.comparator.ComparatorCompareArgs.fail_fast
fail_fast
Definition: tools/args/comparator.py:109
half_float::detail::atan
expr atan(half arg)
Arc tangent function.
Definition: ieee_half.h:2452
nmtSample::DebugUtil::DumpTensorPlugin::serialize
void serialize(void *buffer) override
Serialize the layer.
Definition: debugUtil.cpp:118
nvinfer1::ILayer::getNbOutputs
virtual int32_t getNbOutputs() const =0
Get the number of outputs of a layer.
nvinfer1::DynamicPluginTensorDesc::desc
PluginTensorDesc desc
Information required to interpret a pointer to tensor data, except that desc.dims has -1 in place of ...
Definition: NvInferRuntime.h:390
nvinfer1::IPluginV2::clone
virtual IPluginV2 * clone() const =0
Clone the plugin object.
nvinfer1::IShuffleLayer::setReshapeDimensions
virtual void setReshapeDimensions(Dims dimensions)=0
Set the reshaped dimensions.
tests.quant_instancenorm_test.TestQuantInstanceNorm3D
Definition: quant_instancenorm_test.py:149
nvinfer1::IBuilder::destroy
virtual void destroy()=0
Destroy this object.
UffPoolPluginV2Creator::mNamespace
std::string mNamespace
Definition: sampleUffPluginV2Ext.cpp:689
nvinfer1::plugin::PriorBox::getWorkspaceSize
size_t getWorkspaceSize(int maxBatchSize) const override
Definition: priorBoxPlugin.cpp:168
bert::fused_multihead_attention_fp16_384_64_kernel_sm75_cu_o
unsigned char fused_multihead_attention_fp16_384_64_kernel_sm75_cu_o[]
Definition: fused_multihead_attention_fp16_384_64_kernel.sm75.cpp:20
polygraphy.tools.args.onnxrt.runner.OnnxrtRunnerArgs.onnx_loader_args
onnx_loader_args
Definition: tools/args/onnxrt/runner.py:25
train.init_distributed
def init_distributed(args, world_size, rank, group_name)
Definition: train.py:158
SampleConfig::mCalibBatchSize
int64_t mCalibBatchSize
Definition: sampleConfig.h:47
nvinfer1::plugin::GridAnchorBasePluginCreator::~GridAnchorBasePluginCreator
~GridAnchorBasePluginCreator() override=default
nvinfer1::DimsHW::w
int32_t & w()
Get the width.
Definition: NvInfer.h:144
nvinfer1::plugin::BatchedNMSBasePluginCreator::mClipBoxes
bool mClipBoxes
Definition: batchedNMSPlugin.h:135
nvinfer1::ILogger
Application-implemented logging interface for the builder, engine and runtime.
Definition: NvInferRuntimeCommon.h:1076
polygraphy.comparator.compare.OutputCompareResult.__bool__
def __bool__(self)
Definition: compare.py:43
nvinfer1::IExecutionContext
Context for executing inference using an engine, with functionally unsafe features.
Definition: NvInferRuntime.h:1583
half_float::detail::isnan
bool isnan(half arg)
Check for NaN.
Definition: ieee_half.h:2744
anonymous_namespace{serialize.hpp}::Serializer< std::vector< T >, typename std::enable_if< std::is_arithmetic< T >::value||std::is_enum< T >::value||std::is_pod< T >::value >::type >::deserialize
static void deserialize(void const **buffer, size_t *buffer_size, std::vector< T > *value)
Definition: serialize.hpp:102
nvinfer1::plugin::MultilevelCropAndResize::mPooledSize
xy_t mPooledSize
Definition: multilevelCropAndResizePlugin.h:94
nvinfer1::plugin::MultilevelCropAndResizePluginCreator
Definition: multilevelCropAndResizePlugin.h:105
nvinfer1::plugin::ResizeNearestPluginCreator::mPluginAttributes
static std::vector< PluginField > mPluginAttributes
Definition: resizeNearestPlugin.h:116
polygraphy.tools.args.trt.loader
Definition: tools/args/trt/loader.py:1
polygraphy.tools.args.tf.loader.TfLoaderArgs.trt_legacy_args
trt_legacy_args
Definition: tools/args/tf/loader.py:60
tests.print_test.TestPrint.test_print_tensor_quantizer
def test_print_tensor_quantizer(self)
Definition: print_test.py:39
ProposalWorkSpace::sortClassScoreDims
nvinfer1::DimsHW sortClassScoreDims
Definition: maskRCNNKernels.h:132
nvinfer1::CUDADriverWrapper::CUDADriverWrapper
CUDADriverWrapper()
Definition: cudaDriverWrapper.cpp:41
nmtSample::TextReader::mVocabulary
Vocabulary::ptr mVocabulary
Definition: textReader.h:46
SampleMaskRCNN::decodeOutput
std::vector< MaskRCNNUtils::BBoxInfo > decodeOutput(const int imageIdx, void *detectionsHost, void *masksHost)
Definition: sampleUffMaskRCNN.cpp:523
sample::anonymous_namespace{sampleInference.cpp}::Iteration::fetchOutputData
void fetchOutputData()
Definition: sampleInference.cpp:376
nmtSample::PinnedHostBuffer
wrapper for the pinned host memory region
Definition: pinnedHostBuffer.h:32
bert::Fused_multihead_attention_params::packed_mask_stride_in_bytes
int64_t packed_mask_stride_in_bytes
Definition: fused_multihead_attention.h:60
nvinfer1::plugin::RPROIPlugin::destroy
void destroy() override
Destroy the plugin object.
Definition: nvFasterRCNNPlugin.cpp:231
backend.tf.test_loader.TestSaveGraph.test_save_pb
def test_save_pb(self)
Definition: tf/test_loader.py:61
nvinfer1::plugin::LReLU::clone
IPluginV2 * clone() const override
Clone the plugin object.
Definition: lReluPlugin.cpp:125
bert::SkipLayerNormInterleavedPluginCreator::getPluginVersion
const char * getPluginVersion() const override
Return the plugin version.
Definition: skipLayerNormInt8InterleavedPlugin.cpp:280
bert::TFusedMHAKernelFactory
Definition: fused_multihead_attention.h:292
onnx_graphsurgeon.ir.tensor.Tensor.o
def o(self, consumer_idx=0, tensor_idx=0)
Definition: tensor.py:112
pytorch_quantization.nn.modules.tensor_quantizer.TensorQuantizer.narrow_range
def narrow_range(self)
Definition: tensor_quantizer.py:147
nvinfer1::PluginType::kYOLOREORG
@ kYOLOREORG
YOLO Reorg Plugin.
builder.custom_fc
def custom_fc(config, network, input_tensor, out_dims, W)
Definition: builder.py:213
samplesCommon::HostMemory::type
DataType type() const noexcept override
The type of the memory that was allocated.
Definition: samples/common/common.h:318
SampleMovieLensParams::userToItemsMap
std::map< int32_t, std::vector< int32_t > > userToItemsMap
Definition: sampleMovieLens.cpp:72
nmtSample::Count_t
std::map< Segment_t, int > Count_t
Definition: bleuScoreWriter.cpp:33
pytorch_quantization.nn.modules.quant_linear
Definition: quant_linear.py:1
polygraphy.tools.registry.MissingTool.err
err
Definition: registry.py:34
pytorch_quantization.optim.helper.freeze_parameters
def freeze_parameters(model, patterns)
Definition: helper.py:98
ParserOnnxConfig::reduceVerbosity
virtual void reduceVerbosity()
Decrease verbosity Level.
Definition: parserOnnxConfig.h:103
SampleGoogleNet::infer
bool infer()
Runs the TensorRT inference engine for this sample.
Definition: sampleGoogleNet.cpp:157
nvinfer1::plugin::GenerateDetection::mValidCnt
std::shared_ptr< CudaBind< int > > mValidCnt
Definition: generateDetectionPlugin.h:102
bert::Fused_multihead_attention_params_v2::h
int h
Definition: fused_multihead_attention_v2.h:56
sample::MirroredBuffer::hostToDevice
void hostToDevice(TrtCudaStream &stream)
Definition: sampleDevice.h:333
polygraphy.tools.args.onnx.loader.OnnxLoaderArgs.model_args
model_args
Definition: tools/args/onnx/loader.py:51
nvinfer1::plugin::RPROIPlugin::getWorkspaceSize
size_t getWorkspaceSize(int maxBatchSize) const override
Definition: nvFasterRCNNPlugin.cpp:143
bert::SkipLayerNormInterleavedPluginCreator::getPluginName
const char * getPluginName() const override
Return the plugin name.
Definition: skipLayerNormInt8InterleavedPlugin.cpp:275
sample::MirroredBuffer::allocate
void allocate(size_t size)
Definition: sampleDevice.h:316
tests.calibrator_test.TestEntropyCalibrator.test_repr
def test_repr(self)
Definition: calibrator_test.py:192
nvinfer1::plugin::MultilevelProposeROI::mDeviceScores
float ** mDeviceScores
Definition: multilevelProposeROIPlugin.h:109
nvinfer1::DimensionOperation::kCEIL_DIV
@ kCEIL_DIV
Division rounding up.
nvcaffeparser1::CaffeParser::parsePriorBoxParam
std::vector< nvinfer1::PluginField > parsePriorBoxParam(const trtcaffe::LayerParameter &msg, CaffeWeightFactory &weightFactory, BlobNameToTensor &tensors)
Definition: caffeParser.cpp:90
perf_varseqlen.DeviceBuffer.__init__
def __init__(self, shape, dtype=trt.int32)
Definition: perf_varseqlen.py:29
onnx_graphsurgeon.logger.logger.Logger.warning
def warning(self, message, mode=LogMode.EACH)
Definition: onnx-graphsurgeon/onnx_graphsurgeon/logger/logger.py:235
tools.test_inspect.TestInspectModel.test_model_onnx
def test_model_onnx(self, case)
Definition: test_inspect.py:251
tools.test_run.TestOnnxrt.test_onnx_rt_exclude_outputs_with_layerwise
def test_onnx_rt_exclude_outputs_with_layerwise(self)
Definition: test_run.py:212
bert::QKVToContextInterleavedPluginCreator::mFC
static nvinfer1::PluginFieldCollection mFC
Definition: qkvToContextInt8InterleavedPlugin.h:123
onnx_graphsurgeon.ir.node.Node.attrs
attrs
Definition: node.py:39
bert::QKVToContextInterleavedPlugin
Definition: qkvToContextInt8InterleavedPlugin.h:33
builder.BertConfig.intermediate_size
intermediate_size
Definition: builder.py:94
nvinfer1::plugin::MultilevelCropAndResize
Definition: multilevelCropAndResizePlugin.h:36
polygraphy.util.cuda.DeviceBuffer.allocated_nbytes
allocated_nbytes
Definition: cuda.py:124
bert::TFusedMultiHeadAttentionXMMAKernel::isValid
bool isValid(int s) const
Definition: fused_multihead_attention.h:253
generate.input
input
Definition: 05_folding_constants/generate.py:26
half_float::detail::builtin_isinf
bool builtin_isinf(T arg)
Check for infinity.
Definition: ieee_half.h:404
sample::anonymous_namespace{sampleInference.cpp}::makeThread
std::thread makeThread(const InferenceOptions &inference, InferenceEnvironment &iEnv, SyncStruct &sync, int thread, int streamsPerThread, int device, std::vector< InferenceTrace > &trace)
Definition: sampleInference.cpp:548
bert::Fused_multihead_attention_params_v2
Definition: fused_multihead_attention_v2.h:25
nvinfer1::plugin::RPROIPlugin::getOutputDataType
DataType getOutputDataType(int index, const nvinfer1::DataType *inputTypes, int nbInputs) const override
Definition: nvFasterRCNNPlugin.cpp:255
BboxInfo::conf_score
T conf_score
Definition: bboxUtils.h:42
sample::Binding::dataType
nvinfer1::DataType dataType
Definition: sampleUtils.h:220
nvinfer1::QuantizationFlag::kCALIBRATE_BEFORE_FUSION
@ kCALIBRATE_BEFORE_FUSION
Run int8 calibration pass before layer fusion.
nvinfer1::plugin::BatchedNMSDynamicPlugin::getPluginVersion
const char * getPluginVersion() const override
Return the plugin version.
Definition: batchedNMSPlugin.cpp:370
pytorch_quantization.nn.modules.tensor_quantizer.TensorQuantizer._get_amax
def _get_amax(self, inputs)
Definition: tensor_quantizer.py:258
SampleCharRNNParams::weightFileName
std::string weightFileName
Definition: sampleCharRNN.cpp:122
nvinfer1::plugin::ProposalPlugin::setPluginNamespace
void setPluginNamespace(const char *pluginNamespace) override
Set the namespace that this plugin object belongs to.
Definition: proposalPlugin.cpp:265
nvinfer1::plugin::GridAnchorPluginCreator
Definition: gridAnchorPlugin.h:127
util.test_cuda.ResizeTestCase
Definition: test_cuda.py:24
polygraphy.comparator.data_loader.DataLoader.iterations
iterations
Definition: comparator/data_loader.py:87
denoiser
Definition: denoiser.py:1
pytorch_quantization.utils.amp_wrapper.promote_function
def promote_function(fn)
Definition: amp_wrapper.py:28
nvinfer1::plugin::SplitPlugin::serialize
void serialize(void *buffer) const override
Serialize the layer.
Definition: split.h:62
stft.STFT.win_length
win_length
Definition: stft.py:48
polygraphy.util.cuda.Stream
Definition: cuda.py:90
polygraphy.tools.args.logger.LoggerArgs.silent
silent
Definition: Polygraphy/polygraphy/tools/args/logger.py:33
pytorch_quantization.nn.modules.quant_pooling.QuantAdaptiveAvgPool1d
Definition: quant_pooling.py:117
nvinfer1::plugin::PyramidROIAlignPluginCreator::mPluginAttributes
static std::vector< PluginField > mPluginAttributes
Definition: pyramidROIAlignPlugin.h:124
nvinfer1::EnumMax< ErrorCode >
constexpr int32_t EnumMax< ErrorCode >()
Maximum number of elements in ErrorCode enum.
Definition: NvInferRuntimeCommon.h:1207
bert::fused_multihead_attention_v2_fp16_384_64_kernel_sm75_cubin_len
unsigned int fused_multihead_attention_v2_fp16_384_64_kernel_sm75_cubin_len
Definition: fused_multihead_attention_v2_fp16_384_64_kernel.sm75.cpp:4700
pytorch_quantization.nn.modules.tensor_quantizer.TensorQuantizer._fb_fake_quant
def _fb_fake_quant(self, inputs, amax)
Definition: tensor_quantizer.py:278
onnx_graphsurgeon.importers.onnx_importer.get_onnx_tensor_shape
List[int] get_onnx_tensor_shape(Union[onnx.ValueInfoProto, onnx.TensorProto] onnx_tensor)
Definition: onnx_importer.py:46
pytorch_quantization.tensor_quant.ScaledQuantDescriptor.__init__
def __init__(self, num_bits=8, name=None, **kwargs)
Definition: tensor_quant.py:71
nvinfer1::plugin::Region::mPluginNamespace
std::string mPluginNamespace
Definition: regionPlugin.h:99
test_exporters.TestOnnxExporter.test_export_constant_tensor_to_tensor_proto
def test_export_constant_tensor_to_tensor_proto(self)
Definition: test_exporters.py:34
nvinfer1::EnumMax< FillOperation >
constexpr int32_t EnumMax< FillOperation >()
Maximum number of elements in FillOperation enum.
Definition: NvInfer.h:4739
bert::Fused_multihead_attention_params_v2::packed_mask_stride_in_bytes
int64_t packed_mask_stride_in_bytes
Definition: fused_multihead_attention_v2.h:37
bert::fused_multihead_attention_fp16_64_64_kernel_sm75_cu_o
unsigned char fused_multihead_attention_fp16_64_64_kernel_sm75_cu_o[]
Definition: fused_multihead_attention_fp16_64_64_kernel.sm75.cpp:19
SampleConfig::addVerbosity
void addVerbosity()
Increase the Verbosity Level.
Definition: sampleConfig.h:117
nvinfer1::plugin::PyramidROIAlign::~PyramidROIAlign
~PyramidROIAlign() override=default
sample::loadEngine
ICudaEngine * loadEngine(const std::string &engine, int DLACore, std::ostream &err)
Load a serialized engine.
Definition: sampleEngines.cpp:632
tools.test_run.TestGen
Definition: test_run.py:34
TLTMaskRCNNConfig::POST_NMS_ROIS_INFERENCE
static const int POST_NMS_ROIS_INFERENCE
Definition: tlt_mrcnn_config.h:83
sample::Bindings::transferInputToDevice
void transferInputToDevice(TrtCudaStream &stream)
Definition: sampleUtils.h:339
half_float::detail::enable
SFINAE helper for generic half-precision functions.
Definition: ieee_half.h:372
util.test_cuda.ResizeTestCase.__init__
def __init__(self, old, old_size, new, new_size)
Definition: test_cuda.py:26
nvinfer1::plugin::ProposalLayer::attachToContext
void attachToContext(cudnnContext *cudnnContext, cublasContext *cublasContext, IGpuAllocator *gpuAllocator) override
Attach the plugin object to an execution context and grant the plugin the access to some context reso...
Definition: proposalLayerPlugin.cpp:362
polygraphy.util.cuda.DeviceBuffer.__init__
def __init__(self, shape=None, dtype=None)
Definition: cuda.py:114
nvinfer1::plugin::InstanceNormalizationPlugin::initialize
int initialize() override
Initialize the layer for execution.
Definition: instanceNormalizationPlugin.cpp:155
SampleMaskRCNN::infer
bool infer()
Definition: sampleUffMaskRCNN.cpp:422
OutputArgs::expectedPredictedMaxRatingItem
int32_t expectedPredictedMaxRatingItem
Definition: sampleMovieLensMPS.cpp:158
test_ir.TensorBaseTests.test_i_multiple_inputs
def test_i_multiple_inputs(self)
Definition: test_ir.py:86
nvonnxparser::createONNXConfig
IOnnxConfig * createONNXConfig()
pytorch_quantization.quant_modules.initialize
def initialize(float_module_list=None, custom_quant_modules=None)
Definition: quant_modules.py:110
polygraphy.util.cuda.Stream.synchronize
def synchronize(self)
Definition: cuda.py:99
pytorch_quantization.nn.modules.quant_conv._QuantConvNd.__init__
def __init__(self, in_channels, out_channels, kernel_size, stride, padding, dilation, transposed, output_padding, groups, bias, padding_mode, quant_desc_input, quant_desc_weight)
Definition: quant_conv.py:67
sample::gLogFatal
LogStreamConsumer gLogFatal
Definition: logger.cpp:27
nvcaffeparser1::CaffeParser::mNewPlugins
std::vector< nvinfer1::IPluginV2 * > mNewPlugins
Definition: caffeParser.h:90
polygraphy.tools.args.trt_legacy.TrtLegacyArgs.onnx_loader_args
onnx_loader_args
Definition: tools/args/trt_legacy.py:42
nvinfer1::plugin::ProposalPluginCreator::deserializePlugin
IPluginV2Ext * deserializePlugin(const char *name, const void *serialData, size_t serialLength) override
Called during deserialization of plugin layer.
Definition: proposalPlugin.cpp:434
polygraphy.tools.args.trt.loader.TrtLoaderArgs.int8
int8
Definition: tools/args/trt/loader.py:101
nvinfer1::plugin::ResizeNearest::~ResizeNearest
~ResizeNearest() override=default
Int8EntropyCalibrator2::getBatchSize
int getBatchSize() const override
Definition: EntropyCalibrator.h:111
onnx_graphsurgeon.logger.logger.Logger.register_callback
def register_callback(self, callback)
Definition: onnx-graphsurgeon/onnx_graphsurgeon/logger/logger.py:123
tools.test_run.TestTrt.test_trt_save_load_engine
def test_trt_save_load_engine(self)
Definition: test_run.py:137
tests.quant_pooling_test.TestQuantAvgPool1d
Definition: quant_pooling_test.py:128
nvcaffeparser1::parsePReLU
nvinfer1::ILayer * parsePReLU(nvinfer1::INetworkDefinition &network, const trtcaffe::LayerParameter &msg, CaffeWeightFactory &weightFactory, BlobNameToTensor &tensors)
Definition: parsePReLU.cpp:23
tests.integration_test.TestNetwork.test_quant_module_replacement
def test_quant_module_replacement(self)
Definition: integration_test.py:101
SampleMLPParams
The SampleMLPParams structure groups the additional parameters required by the MLP sample.
Definition: sampleMLP.cpp:45
SampleConfig::InputDataFormat::kPPM
@ kPPM
polygraphy.tools.args.trt.loader.TrtLoaderArgs.add_to_parser
def add_to_parser(self, parser)
Definition: tools/args/trt/loader.py:31
nvinfer1::QuantizationFlags
uint32_t QuantizationFlags
Represents a collection of one or more QuantizationFlag values using binary OR operations.
Definition: NvInfer.h:6345
nvinfer1::plugin::MultilevelProposeROI::getSerializationSize
size_t getSerializationSize() const override
Find the size of the serialization buffer required.
Definition: multilevelProposeROIPlugin.cpp:236
polygraphy.backend.trt.loader.CreateConfig.strict_types
strict_types
Definition: backend/trt/loader.py:327
polygraphy.tools.util.script.Script.__init__
def __init__(self, summary=None)
Definition: script.py:127
nvinfer1::plugin::ReorgPluginCreator::deserializePlugin
IPluginV2Ext * deserializePlugin(const char *name, const void *serialData, size_t serialLength) override
Called during deserialization of plugin layer.
Definition: reorgPlugin.cpp:217
nvinfer1::plugin::PluginCreatorRegistry
Definition: InferPlugin.cpp:69
samplesCommon::TypedHostMemory::raw
ElemType * raw() noexcept
Definition: samples/common/common.h:348
tools.test_run.TestTrt.test_trt_layerwise_outputs
def test_trt_layerwise_outputs(self)
Definition: test_run.py:73
bert::EmbLayerNormVarSeqlenPlugin::initialize
int initialize() override
Initialize the layer for execution.
Definition: embLayerNormVarSeqlenPlugin.cpp:352
nvinfer1::ActivationType
ActivationType
Enumerates the types of activation to perform in an activation layer.
Definition: NvInferRuntimeCommon.h:108
nvinfer1::BuilderFlag::kFP16
@ kFP16
Enable FP16 layer selection, with FP32 fallback.
polygraphy.tools.surgeon.subtool.insert.Insert.__init__
def __init__(self)
Definition: insert.py:26
polygraphy.common.struct.TensorMetadata.add
def add(self, name, dtype, shape)
Definition: common/struct.py:26
nvinfer1::IPluginV2::getOutputDimensions
virtual Dims getOutputDimensions(int32_t index, const Dims *inputs, int32_t nbInputDims)=0
Get the dimension of an output tensor.
SampleConfig::mLabel
int64_t mLabel
Definition: sampleConfig.h:44
nvinfer1::ITensor::isNetworkInput
virtual bool isNetworkInput() const =0
Whether the tensor is a network input.
bert::EmbLayerNormVarSeqlenPlugin::getPluginVersion
const char * getPluginVersion() const override
Return the plugin version.
Definition: embLayerNormVarSeqlenPlugin.cpp:342
nmtSample::Decoder::Decoder
Decoder()=default
pytorch_quantization.nn.modules.quant_pooling.QuantAdaptiveAvgPool3d.forward
def forward(self, input)
Definition: quant_pooling.py:148
pytorch_quantization.nn.modules.quant_conv.QuantConvTranspose2d.forward
def forward(self, input, output_size=None)
Definition: quant_conv.py:337
export_tacotron2_onnx.prenet_infer
def prenet_infer(self, x)
Definition: export_tacotron2_onnx.py:92
nvcaffeparser1::WeightType
WeightType
Definition: weightType.h:22
nmtSample::SLPEmbedder::~SLPEmbedder
~SLPEmbedder() override=default
polygraphy.tools.args.onnx.loader.OnnxLoaderArgs._shape_inference_default
_shape_inference_default
Definition: tools/args/onnx/loader.py:27
nvinfer1::IRefitter::getMissing
virtual int32_t getMissing(int32_t size, const char **layerNames, WeightsRole *roles)=0
Get description of missing weights.
model.Attention.__init__
def __init__(self, attention_rnn_dim, embedding_dim, attention_dim, attention_location_n_filters, attention_location_kernel_size)
Definition: demo/Tacotron2/tacotron2/model.py:49
polygraphy.backend.trt.loader.BaseNetworkFromOnnx.__init__
def __init__(self, explicit_precision, explicit_batch=None)
Definition: backend/trt/loader.py:95
nvcaffeparser1::CaffeWeightFactory::convert
void convert(nvinfer1::Weights &weights, nvinfer1::DataType targetType)
Definition: caffeWeightFactory.cpp:160
test_exporters.TestOnnxExporter.test_export_graph
def test_export_graph(self, model)
Definition: test_exporters.py:127
polygraphy.comparator.struct.IterationResult.runtime
runtime
Definition: comparator/struct.py:45
samplesCommon::GenericBuffer::data
void * data()
Returns pointer to underlying array.
Definition: buffers.h:112
nvinfer1::plugin::BatchTilePlugin::setPluginNamespace
void setPluginNamespace(const char *libNamespace) override
Set the namespace that this plugin object belongs to.
Definition: batchTilePlugin.cpp:173
tests.fixtures.models.QuantLeNet.conv2
conv2
Definition: tools/pytorch-quantization/tests/fixtures/models.py:48
loss_function.Tacotron2Loss.forward
def forward(self, model_output, targets)
Definition: tacotron2/loss_function.py:24
bert::param_type
constexpr auto param_type
Definition: skipLayerNormInt8InterleavedPlugin.cpp:51
sample::Logger::reportPass
static int reportPass(const TestAtom &testAtom)
Definition: samples/common/logging.h:347
SampleUffFasterRcnnParams::inputHeight
int inputHeight
Definition: sampleUffFasterRCNN.cpp:69
tests.quant_pooling_test.TestQuantMaxPool1d
Definition: quant_pooling_test.py:38
nvonnxparser
The TensorRT ONNX parser API namespace.
Definition: NvOnnxConfig.h:22
inference_trt.main
def main()
Definition: inference_trt.py:288
helpers.calibrator.BertCalibrator.device_inputs
device_inputs
Definition: demo/BERT/helpers/calibrator.py:46
nvinfer1::PluginField::length
int32_t length
Number of data entries in the Plugin attribute.
Definition: NvInferRuntimeCommon.h:865
nvinfer1::LayerType::kLOOP_OUTPUT
@ kLOOP_OUTPUT
Loop output layer.
copyright-scan.parse_args
def parse_args(parser)
Definition: copyright-scan.py:99
nvcaffeparser1::CaffeParser::mPluginRegistry
std::unordered_map< std::string, nvinfer1::IPluginCreator * > mPluginRegistry
Definition: caffeParser.h:91
SampleINT8API::SampleUniquePtr
std::unique_ptr< T, samplesCommon::InferDeleter > SampleUniquePtr
Definition: sampleINT8API.cpp:80
nvinfer1::plugin::NormalizePluginCreator::~NormalizePluginCreator
~NormalizePluginCreator() override=default
nmtSample::SLPAttention::getAttentionSize
int getAttentionSize() override
get the size of the attention vector
Definition: slpAttention.cpp:68
SampleCharRNNParams::weightNames
SampleCharRNNWeightNames weightNames
Definition: sampleCharRNN.cpp:128
nvinfer1::plugin::MultilevelProposeROI::mPreNMSTopK
int mPreNMSTopK
Definition: multilevelProposeROIPlugin.h:95
nvinfer1::INetworkDefinition::addMatrixMultiply
virtual IMatrixMultiplyLayer * addMatrixMultiply(ITensor &input0, MatrixOperation op0, ITensor &input1, MatrixOperation op1)=0
Add a MatrixMultiply layer to the network.
SampleINT8APIPreprocessing::inputDims
std::vector< int > inputDims
Definition: sampleINT8API.cpp:48
polygraphy.tools.args.tf.loader.TfLoaderArgs.__init__
def __init__(self, tftrt=True, artifacts=True, outputs=True)
Definition: tools/args/tf/loader.py:23
samplesCommon::GenericBuffer::resize
void resize(size_t newSize)
Resizes the buffer.
Definition: buffers.h:144
nvinfer1::EnumMax< TensorLocation >
constexpr int32_t EnumMax< TensorLocation >()
Maximum number of elements in TensorLocation enum.
Definition: NvInferRuntimeCommon.h:1023
half_float::detail::abs
half abs(half arg)
Absolute value.
Definition: ieee_half.h:2228
tests.calibrator_test.TestEntropyCalibrator.test_two_tensor
def test_two_tensor(self, verbose)
Definition: calibrator_test.py:171
polygraphy.backend.trt_legacy.ConvertToUff.__call__
def __call__(self)
Definition: backend/trt_legacy.py:54
bert::FusedMultiHeadAttentionKernelMetaInfoV2::mSM
unsigned int mSM
Definition: fused_multihead_attention_v2.h:163
tests.quant_rnn_test.TestEpilogue
Definition: quant_rnn_test.py:512
onnx_models.Model.path
path
Definition: onnx_models.py:36
samplesCommon::readASCIIFile
bool readASCIIFile(const std::string &fileName, const size_t size, std::vector< T > &out)
Definition: samples/common/common.h:451
SharedMemory::mModelStreamFd
const char * mModelStreamFd
Definition: sampleMovieLensMPS.cpp:151
tests.fixtures.models.QuantLeNet.forward
def forward(self, x)
Definition: tools/pytorch-quantization/tests/fixtures/models.py:52
nvinfer1::plugin::GenerateDetection::isOutputBroadcastAcrossBatch
bool isOutputBroadcastAcrossBatch(int outputIndex, const bool *inputIsBroadcasted, int nbInputs) const override
Definition: generateDetectionPlugin.cpp:303
onnx_graphsurgeon.ir.tensor.Tensor.to_constant
def to_constant(self, np.ndarray values)
Definition: tensor.py:59
polygraphy.backend.onnx.runner.OnnxTfRunner.__init__
def __init__(self, model, name=None)
Definition: backend/onnx/runner.py:29
nvinfer1::IPluginCreator::getPluginName
virtual const char * getPluginName() const =0
Return the plugin name.
sample::anonymous_namespace{sampleInference.cpp}::SyncStruct::cpuStart
TimePoint cpuStart
Definition: sampleInference.cpp:171
polygraphy.backend.onnxrt.loader.SessionFromOnnxBytes
Definition: backend/onnxrt/loader.py:22
polygraphy.backend.common.loader.BytesFromPath.__init__
def __init__(self, path)
Definition: backend/common/loader.py:20
UffPoolPluginV2::UffPoolPluginV2
UffPoolPluginV2(const PluginFieldCollection &fc)
Definition: sampleUffPluginV2Ext.cpp:340
onnx_graphsurgeon.logger.logger
Definition: onnx-graphsurgeon/onnx_graphsurgeon/logger/logger.py:1
nvinfer1::IInt8LegacyCalibrator
Legacy calibrator left for backward compatibility with TensorRT 2.0.
Definition: NvInfer.h:6108
onnx_graphsurgeon.ir.graph.Graph.toposort
def toposort(self)
Definition: graph.py:254
nvinfer1::plugin::BatchedNMSDynamicPlugin::getOutputDataType
nvinfer1::DataType getOutputDataType(int index, const nvinfer1::DataType *inputType, int nbInputs) const override
Definition: batchedNMSPlugin.cpp:439
nvinfer1::plugin::SplitPlugin::getPluginNamespace
const char * getPluginNamespace() const override
Return the namespace of the plugin object.
Definition: split.h:120
text.numbers._expand_dollars
def _expand_dollars(m)
Definition: numbers.py:39
tests.calibrator_test.TestMSECalibrator.test_repr
def test_repr(self)
Definition: calibrator_test.py:245
half_float::detail::functions::isnotequal
static bool isnotequal(half x, half y)
Comparison implementation.
Definition: samples/common/half.h:2398
nvinfer1::plugin::BatchedNMSPluginCreator::deserializePlugin
IPluginV2Ext * deserializePlugin(const char *name, const void *serialData, size_t serialLength) override
Called during deserialization of plugin layer.
Definition: batchedNMSPlugin.cpp:627
nvinfer1::plugin::PriorBox::detachFromContext
void detachFromContext() override
Detach the plugin object from its execution context.
Definition: priorBoxPlugin.cpp:343
SamplePlugin::mEngine
std::shared_ptr< nvinfer1::ICudaEngine > mEngine
The TensorRT engine used to run the network.
Definition: samplePlugin.cpp:107
samplesCommon::HostBuffer
GenericBuffer< HostAllocator, HostFree > HostBuffer
Definition: buffers.h:217
nvinfer1::plugin::Normalize::configurePlugin
void configurePlugin(const Dims *inputDims, int nbInputs, const Dims *outputDims, int nbOutputs, const DataType *inputTypes, const DataType *outputTypes, const bool *inputIsBroadcast, const bool *outputIsBroadcast, PluginFormat floatFormat, int maxBatchSize) override
Definition: normalizePlugin.cpp:196
tests.fixtures.models.LeNet.fc2
fc2
Definition: tools/pytorch-quantization/tests/fixtures/models.py:32
tests.fixtures
Definition: tools/pytorch-quantization/tests/fixtures/__init__.py:1
SampleINT8API::infer
sample::Logger::TestResult infer()
Runs the TensorRT inference engine for this sample.
Definition: sampleINT8API.cpp:590
nvinfer1::plugin::PriorBox::serializeFromDevice
void serializeFromDevice(char *&hostBuffer, Weights deviceWeights) const
Definition: priorBoxPlugin.cpp:222
pytorch_quantization.calib.histogram.HistogramCalibrator._calib_bin_edges
_calib_bin_edges
Definition: histogram.py:49
polygraphy.comparator.compare.OutputCompareResult.__str__
def __str__(self)
Definition: compare.py:53
polygraphy.tools.args.trt.loader.TrtLoaderArgs._config
_config
Definition: tools/args/trt/loader.py:26
sample::InferenceTime::e2e
float e2e
Definition: sampleReporting.h:56
polygraphy.tools.inspect.inspect.Inspect
Definition: inspect.py:20
nvinfer1::IConvolutionLayer
A convolution layer in a network definition.
Definition: NvInfer.h:1199
nvinfer1::plugin::throwCudaError
void throwCudaError(const char *file, const char *function, int line, int status, const char *msg)
Definition: checkMacrosPlugin.cpp:54
tests.quant_conv_test.TestQuantConv2D.test_fake_quant_per_channel
def test_fake_quant_per_channel(self)
Definition: quant_conv_test.py:145
nvinfer1::IPluginExt
Plugin class for user-implemented layers.
Definition: NvInferRuntime.h:223
TLTMaskRCNNConfig::DETECTION_MAX_INSTANCES
static const int DETECTION_MAX_INSTANCES
Definition: tlt_mrcnn_config.h:40
nmtSample::SLPProjection::getInfo
std::string getInfo() override
get the textual description of the component
Definition: slpProjection.cpp:64
nmtSample::DeviceBuffer::~DeviceBuffer
virtual ~DeviceBuffer()
Definition: deviceBuffer.h:38
sample::defaultDevice
constexpr int defaultDevice
Definition: sampleOptions.h:41
nvinfer1::plugin::DetectionLayer::DetectionLayer
DetectionLayer(int num_classes, int keep_topk, float score_threshold, float iou_threshold)
Definition: detectionLayerPlugin.cpp:97
nvinfer1::CUDADriverWrapper::_cuLaunchKernel
CUresult(* _cuLaunchKernel)(CUfunction f, unsigned int gridDimX, unsigned int gridDimY, unsigned int gridDimZ, unsigned int blockDimX, unsigned int blockDimY, unsigned int blockDimZ, unsigned int sharedMemBytes, CUstream hStream, void **kernelParams, void **extra)
Definition: cudaDriverWrapper.h:83
model.Tacotron2.parse_output
def parse_output(self, outputs, output_lengths)
Definition: demo/Tacotron2/tacotron2/model.py:635
polygraphy.backend.onnxrt.runner.OnnxrtRunner.activate_impl
def activate_impl(self)
Definition: backend/onnxrt/runner.py:39
nvonnxparser::IParser::clearErrors
virtual void clearErrors()=0
Clear errors from prior calls to parse.
nvinfer1::plugin::GroupNormalizationPlugin::mEpsilon
float mEpsilon
Definition: groupNormalizationPlugin.h:98
data_function.TextMelLoader.audiopaths_and_text
audiopaths_and_text
Definition: tacotron2/data_function.py:33
nvinfer1::plugin::SplitPlugin::clone
nvinfer1::IPluginV2DynamicExt * clone() const override
Clone the plugin object.
Definition: split.h:98
nvinfer1::plugin::ResizeNearestPluginCreator::getFieldNames
const PluginFieldCollection * getFieldNames() override
Return a list of fields that needs to be passed to createPlugin.
Definition: resizeNearestPlugin.cpp:57
nvinfer1::plugin::GroupNormalizationPlugin::clone
nvinfer1::IPluginV2DynamicExt * clone() const override
Clone the plugin object.
Definition: groupNormalizationPlugin.cpp:193
bert::Fused_multihead_attention_params::d
int d
Definition: fused_multihead_attention.h:79
tests.optim_helper_test.TestGroupParameters.test_single_key
def test_single_key(self, resnet18)
Definition: optim_helper_test.py:53
nvcaffeparser1::parseSoftMax
nvinfer1::ILayer * parseSoftMax(nvinfer1::INetworkDefinition &network, const trtcaffe::LayerParameter &msg, CaffeWeightFactory &, BlobNameToTensor &tensors)
Definition: parseSoftMax.cpp:23
nvinfer1::plugin::NormalizePluginCreator::mChannelShared
bool mChannelShared
Definition: normalizePlugin.h:127
polygraphy.tools.args.trt.loader.TrtLoaderArgs.model_args
model_args
Definition: tools/args/trt/loader.py:71
nvinfer1::PluginField
Structure containing plugin attribute field names and associated data This information can be parsed ...
Definition: NvInferRuntimeCommon.h:846
PoolParameters::mS
int mS
Definition: sampleUffPluginV2Ext.cpp:179
nvinfer1::PaddingMode::kSAME_UPPER
@ kSAME_UPPER
Use SAME padding, with prePadding <= postPadding.
nvinfer1::IPluginV2DynamicExt::int32_t
const Dims int32_t
Definition: NvInferRuntime.h:554
nmtSample::LikelihoodCombinationOperator
Definition: likelihoodCombinationOperator.h:24
pytorch_quantization.calib.calibrator._Calibrator.__init__
def __init__(self, num_bits, axis, unsigned)
Definition: tools/pytorch-quantization/pytorch_quantization/calib/calibrator.py:31
nvinfer1::plugin::BatchedNMSPlugin::numPriors
int numPriors
Definition: batchedNMSPlugin.h:70
Batch::mStream
cudaStream_t mStream
Definition: sampleMovieLensMPS.cpp:247
tests.quant_utils_test.TestQuantUtils.test_reduce_amax
def test_reduce_amax(self)
Definition: quant_utils_test.py:32
polygraphy.logger.logger.Logger.root_dir
root_dir
Definition: Polygraphy/polygraphy/logger/logger.py:128
nvinfer1::plugin::GroupNormalizationPluginCreator::getFieldNames
const PluginFieldCollection * getFieldNames() override
Return a list of fields that needs to be passed to createPlugin.
Definition: groupNormalizationPlugin.cpp:270
nvinfer1::plugin::RPROIPlugin::initialize
int initialize() override
Initialize the layer for execution.
Definition: nvFasterRCNNPlugin.cpp:117
infer.DeviceBuffer
Definition: infer.py:32
nvinfer1::IConcatenationLayer::~IConcatenationLayer
virtual ~IConcatenationLayer()
Definition: NvInfer.h:2251
nvinfer1::plugin::PriorBoxPluginCreator::deserializePlugin
IPluginV2Ext * deserializePlugin(const char *name, const void *serialData, size_t serialLength) override
Called during deserialization of plugin layer.
Definition: priorBoxPlugin.cpp:482
nvinfer1::plugin::CoordConvACPlugin::destroy
void destroy() override
Destroy the plugin object.
Definition: coordConvACPlugin.cpp:139
nvinfer1::plugin::FlattenConcat::destroy
void destroy() override
Destroy the plugin object.
Definition: flattenConcat.cpp:279
models.meta.check_empty_tensor_expand
def check_empty_tensor_expand(runner, shapes)
Definition: meta.py:67
nvinfer1::plugin::FlattenConcatPluginCreator::createPlugin
IPluginV2Ext * createPlugin(const char *name, const PluginFieldCollection *fc) override
Return a plugin object.
Definition: flattenConcat.cpp:316
pytorch_quantization.nn.modules._utils.QuantMixin.set_default_quant_desc_input
def set_default_quant_desc_input(cls, value)
Definition: _utils.py:37
nvinfer1::plugin::BatchTilePlugin::mLayerName
const std::string mLayerName
Definition: batchTilePlugin.h:79
inference_c.question_features
def question_features(tokens, question)
Definition: inference_c.py:117
bert::xmmasM384
constexpr size_t xmmasM384
Definition: embLayerNormVarSeqlenPlugin.cpp:39
polygraphy.backend.tf.util.map_node_outputs
def map_node_outputs(graphdef)
Definition: tf/util.py:56
bert::SkipLayerNormInterleavedPlugin::serialize
void serialize(void *buffer) const override
Serialize the layer.
Definition: skipLayerNormInt8InterleavedPlugin.cpp:239
SampleAlgorithmSelector::teardown
bool teardown()
Used to clean up any state created in the sample class.
Definition: sampleAlgorithmSelector.cpp:615
util.test_cuda.TestDeviceBuffer.test_device_buffer_free
def test_device_buffer_free(self)
Definition: test_cuda.py:79
polygraphy.backend.tf.loader.SaveGraph
Definition: backend/tf/loader.py:307
nvinfer1::plugin::GridAnchorRectPluginCreator::GridAnchorRectPluginCreator
GridAnchorRectPluginCreator()
Definition: gridAnchorPlugin.cpp:502
SampleUffFasterRcnnParams::inputChannels
int inputChannels
Definition: sampleUffFasterRCNN.cpp:68
pytorch_quantization.nn.modules.quant_pooling.QuantAvgPool2d
Definition: quant_pooling.py:89
sample::ModelFormat::kCAFFE
@ kCAFFE
polygraphy.backend.trt_legacy.TrtLegacyRunner.__init__
def __init__(self, network_loader=None, max_workspace_size=None, max_batch_size=None, fp16=None, tf32=None, load_engine=None, save_engine=None, layerwise=False, plugins=[], name=None)
Definition: backend/trt_legacy.py:187
sample::printPerformanceReport
void printPerformanceReport(const std::vector< InferenceTrace > &trace, const ReportingOptions &reporting, float warmupMs, int queries, std::ostream &os)
Print and summarize a timing trace.
Definition: sampleReporting.cpp:195
Logger::TestAtom
Opaque handle that holds logging information for a particular test.
Definition: pluginLogging.h:287
onnx_models.identity_model
def identity_model()
Definition: onnx_models.py:92
nvinfer1::IRefitter::destroy
virtual void destroy()=0
nvinfer1::plugin::RegionPluginCreator::createPlugin
IPluginV2Ext * createPlugin(const char *name, const PluginFieldCollection *fc) override
Return a plugin object.
Definition: regionPlugin.cpp:494
nvinfer1::plugin::InstanceNormalizationPlugin::_d_scale
float * _d_scale
Definition: instanceNormalizationPlugin.h:94
nmtSample::SoftmaxLikelihood
calculates softmax likelihood and TopK indices for the raw input logits
Definition: softmaxLikelihood.h:30
nvinfer1::plugin::SpecialSlicePluginCreator::SpecialSlicePluginCreator
SpecialSlicePluginCreator()
Definition: specialSlicePlugin.cpp:35
nvinfer1::plugin::ProposalPluginCreator::createPlugin
IPluginV2Ext * createPlugin(const char *name, const PluginFieldCollection *fc) override
Return a plugin object.
Definition: proposalPlugin.cpp:349
pytorch_quantization.nn.modules.quant_rnn.QuantRNNBase
Definition: modules/quant_rnn.py:39
nvinfer1::plugin::detail::find_divisor
void find_divisor(int denom, unsigned int &mul_coeff, unsigned int &shift_coeff)
Definition: reducedMathPlugin.cpp:49
nvinfer1::plugin::BatchTilePluginCreator::getPluginNamespace
const char * getPluginNamespace() const override
Return the namespace of the plugin creator object.
Definition: batchTilePlugin.h:104
sample::anonymous_namespace{sampleInference.cpp}::inferenceExecution
void inferenceExecution(const InferenceOptions &inference, InferenceEnvironment &iEnv, SyncStruct &sync, int offset, int streams, int device, std::vector< InferenceTrace > &trace)
Definition: sampleInference.cpp:507
onnx_graphsurgeon.ir.node.Node.__eq__
def __eq__(self, other)
Definition: node.py:116
model.Prenet.__init__
def __init__(self, in_dim, sizes)
Definition: demo/Tacotron2/tacotron2/model.py:109
bert::fused_multihead_attention_v2_int8_128_64_kernel_sm80_cubin_len
unsigned int fused_multihead_attention_v2_int8_128_64_kernel_sm80_cubin_len
Definition: fused_multihead_attention_v2_int8_128_64_kernel.sm80.cpp:14791
export_tacotron2_onnx.DecoderIter.p_decoder_dropout
p_decoder_dropout
Definition: export_tacotron2_onnx.py:111
pytorch_quantization.nn.modules.quant_rnn.QuantRNNCell.forward
def forward(self, input, hx=None)
Definition: modules/quant_rnn.py:361
tests.optim_helper_test.TestMatchParameters.test_multi_keys
def test_multi_keys(self, resnet18)
Definition: optim_helper_test.py:39
nvinfer1::plugin::CropAndResizePlugin::serialize
void serialize(void *buffer) const override
Serialize the layer.
Definition: cropAndResizePlugin.cpp:147
polygraphy.tools.util.script.Script.suffix
suffix
Definition: script.py:142
polygraphy.backend.base.runner.BaseRunner.is_active
is_active
Definition: backend/base/runner.py:47
SampleUffPluginV2Ext::mEngine
SampleUniquePtr< nvinfer1::ICudaEngine > mEngine
Definition: sampleUffPluginV2Ext.cpp:333
polygraphy.backend.tf.loader.UseTfTrt
Definition: backend/tf/loader.py:212
bert::TFusedMultiHeadAttentionXMMAKernel::TFusedMultiHeadAttentionXMMAKernel
TFusedMultiHeadAttentionXMMAKernel(const TKernelMeta *pMetaStart, unsigned int nMetaCount, Data_type type, unsigned int sm)
Definition: fused_multihead_attention.h:203
bert::launch_large
void launch_large(cudaStream_t stream, const int ld, const int total, const int8_t *input, const int8_t *skip, const half *beta, const half *gamma, int8_t *output, const float dqScaleIn, const float dqScaleSkip, const float qScale)
SampleINT8APIArgs::dynamicRangeFileName
std::string dynamicRangeFileName
Definition: sampleINT8API.cpp:654
model.WN.cond_layers
cond_layers
Definition: demo/Tacotron2/waveglow/model.py:97
nvinfer1::ILogger::Severity::kVERBOSE
@ kVERBOSE
Verbose messages with debugging information.
bert::fused_multihead_attention_v2_int8_192_64_kernel_sm80_cubin
unsigned char fused_multihead_attention_v2_int8_192_64_kernel_sm80_cubin[]
Definition: fused_multihead_attention_v2_int8_192_64_kernel.sm80.cpp:19
export_tacotron2_onnx.Encoder.tacotron2
tacotron2
Definition: export_tacotron2_onnx.py:66
polygraphy.tools.args.data_loader.DataLoaderArgs._read
_read
Definition: tools/args/data_loader.py:24
nvinfer1::plugin::MultilevelProposeROI::mTempBboxes
std::vector< std::shared_ptr< CudaBind< float > > > mTempBboxes
Definition: multilevelProposeROIPlugin.h:108
sample::defaultDuration
constexpr int defaultDuration
Definition: sampleOptions.h:48
nvinfer1::utility::parseArgs
TRTParsedArgs parseArgs(int argc, const char *const *argv, const std::vector< TRTOption > &options)
parseArgs parses an argument list and returns a TRTParsedArgs with the fields set accordingly.
Definition: getOptions.cpp:142
tests.quant_conv_test.TestQuantConv1D.test_fake_quant_per_tensor
def test_fake_quant_per_tensor(self)
Definition: quant_conv_test.py:344
nvinfer1::plugin::DetectionOutput::attachToContext
void attachToContext(cudnnContext *cudnnContext, cublasContext *cublasContext, IGpuAllocator *gpuAllocator) override
Attach the plugin object to an execution context and grant the plugin the access to some context reso...
Definition: nmsPlugin.cpp:251
nvinfer1::plugin::SpecialSlicePluginCreator::getPluginVersion
const char * getPluginVersion() const override
Return the plugin version.
Definition: specialSlicePlugin.cpp:47
nmtSample::Vocabulary::mSosStr
static const std::string mSosStr
Definition: vocabulary.h:68
LogStreamConsumerBuffer::putOutput
void putOutput()
Definition: pluginLogging.h:68
anonymous_namespace{serialize.hpp}::Serializer
Definition: serialize.hpp:39
nvinfer1::plugin::DetectionLayerPluginCreator::DetectionLayerPluginCreator
DetectionLayerPluginCreator()
Definition: detectionLayerPlugin.cpp:35
nvinfer1::plugin::GridAnchorGenerator::getOutputDataType
DataType getOutputDataType(int index, const nvinfer1::DataType *inputTypes, int nbInputs) const override
Definition: gridAnchorPlugin.cpp:302
ParserOnnxConfig::setTextFileName
virtual void setTextFileName(const char *textFilename)
Set the File Name of the Network Description as a Text File.
Definition: parserOnnxConfig.h:116
nmtSample::Component
a functional part of the sample
Definition: component.h:30
sample::TrtCudaStream::TrtCudaStream
TrtCudaStream()
Definition: sampleDevice.h:60
tests.fixtures.models
Definition: tools/pytorch-quantization/tests/fixtures/models.py:1
nvinfer1::PluginTensorDesc::format
TensorFormat format
Definition: NvInferRuntimeCommon.h:371
sample::UffInput::parse
void parse(Arguments &arguments) override
Definition: sampleOptions.cpp:422
builder_varseqlen.onnx_to_trt_name
def onnx_to_trt_name(onnx_name)
Definition: builder_varseqlen.py:418
MultilevelProposeROIWorkSpace::sortClassPosDims
nvinfer1::DimsHW sortClassPosDims
Definition: maskRCNNKernels.h:171
polygraphy.util.cuda.DeviceBuffer.allocate
def allocate(self, nbytes)
Definition: cuda.py:133
sample::DeviceDeallocator
Definition: sampleDevice.h:281
pytorch_quantization.quant_modules.QuantModuleReplacementHelper.__init__
def __init__(self)
Definition: quant_modules.py:53
polygraphy.tools.args.tf2onnx.loader.Tf2OnnxLoaderArgs.fold_constant
fold_constant
Definition: tools/args/tf2onnx/loader.py:42
backend.trt.test_loader.TestSerializedEngineLoader.test_serialized_engine_loader_from_buffer
def test_serialized_engine_loader_from_buffer(self, identity_engine)
Definition: trt/test_loader.py:83
nvinfer1::plugin::SpecialSlice::destroy
void destroy() override
Destroy the plugin object.
Definition: specialSlicePlugin.h:53
half_float::detail::functions::read
static std::basic_istream< charT, traits > & read(std::basic_istream< charT, traits > &in, half &arg)
Input implementation.
Definition: samples/common/half.h:1636
sample::TrtCudaStream::synchronize
void synchronize()
Definition: sampleDevice.h:83
polygraphy.backend.tf.loader.OptimizeGraph.constfold
def constfold(self, graphdef, output_names)
Definition: backend/tf/loader.py:39
tests.quant_conv_transposed_test.TestQuantConvTranspose1D.test_fake_quant_per_channel
def test_fake_quant_per_channel(self)
Definition: quant_conv_transposed_test.py:428
nvinfer1::plugin::DetectionOutput::getOutputDataType
DataType getOutputDataType(int index, const nvinfer1::DataType *inputTypes, int nbInputs) const override
Definition: nmsPlugin.cpp:188
SampleMNIST
The SampleMNIST class implements the MNIST sample.
Definition: sampleMNIST.cpp:48
sample::Bindings::mBindings
std::vector< Binding > mBindings
Definition: sampleUtils.h:452
nvinfer1::utility::TRTParsedArgs::errMsg
std::string errMsg
Definition: getOptions.h:59
UffPoolPluginV2::mNamespace
std::string mNamespace
Definition: sampleUffPluginV2Ext.cpp:642
bert::Fused_multihead_attention_params::o_stride_in_bytes
int64_t o_stride_in_bytes
Definition: fused_multihead_attention.h:62
MaskRCNNUtils::BBoxInfo::label
int label
Definition: sampleUffMaskRCNN.cpp:65
nvinfer1::plugin::MultilevelProposeROIPluginCreator::getFieldNames
const PluginFieldCollection * getFieldNames() override
Return a list of fields that needs to be passed to createPlugin.
Definition: multilevelProposeROIPlugin.cpp:64
PoolParameters::pType
PoolingType pType
Definition: sampleUffPluginV2Ext.cpp:185
EntropyCalibratorImpl::mReadCache
bool mReadCache
Definition: EntropyCalibrator.h:91
SampleINT8Params::networkName
std::string networkName
The name of the network.
Definition: sampleINT8.cpp:51
nvinfer1::ReduceOperation::kAVG
@ kAVG
polygraphy.backend.trt.loader.Profile
Definition: backend/trt/loader.py:258
onnx_graphsurgeon.ir.graph.UnseenTensor.__call__
def __call__(self, tensor)
Definition: graph.py:34
nvinfer1::PluginFieldType::kUNKNOWN
@ kUNKNOWN
backend.onnx.test_loader.TestLoggerCallbacks
Definition: onnx/test_loader.py:29
Int8EntropyCalibrator2
Implements Entropy calibrator 2. CalibrationAlgoType is kENTROPY_CALIBRATION_2.
Definition: EntropyCalibrator.h:102
sample::anonymous_namespace{sampleOptions.cpp}::processShapes
void processShapes(std::unordered_map< std::string, ShapeRange > &shapes, bool minShapes, bool optShapes, bool maxShapes, bool calib)
Definition: sampleOptions.cpp:274
SampleINT8API::mEngine
std::shared_ptr< nvinfer1::ICudaEngine > mEngine
The TensorRT engine used to run the network.
Definition: sampleINT8API.cpp:106
config
Definition: samples/opensource/sampleUffFasterRCNN/config.py:1
tests.tensor_quantizer_test.TestTensorQuantizer.test_disable
def test_disable(self)
Definition: tensor_quantizer_test.py:132
tests.tensor_quant_test.TestQuantDescriptor.test_scaled_mode
def test_scaled_mode(self)
Definition: tensor_quant_test.py:271
SampleUffSSD::mPPMs
std::vector< samplesCommon::PPM< 3, 300, 300 > > mPPMs
PPMs of test images.
Definition: sampleUffSSD.cpp:95
LogStreamConsumerBuffer::LogStreamConsumerBuffer
LogStreamConsumerBuffer(LogStreamConsumerBuffer &&other)
Definition: pluginLogging.h:42
nvinfer1::PluginType::kPERMUTE
@ kPERMUTE
Permute plugin.
SampleReformatFreeIO::SampleReformatFreeIO
SampleReformatFreeIO(const samplesCommon::CaffeSampleParams &params)
Definition: sampleReformatFreeIO.cpp:210
sample::SystemOptions::plugins
std::vector< std::string > plugins
Definition: sampleOptions.h:139
polygraphy.tools.args.tf.runner.TfRunnerArgs.add_to_script
def add_to_script(self, script)
Definition: tools/args/tf/runner.py:46
nvinfer1::plugin::InstanceNormalizationPluginCreator::~InstanceNormalizationPluginCreator
~InstanceNormalizationPluginCreator() override=default
pytorch_quantization.tensor_quant
Definition: tensor_quant.py:1
anonymous_namespace{regionPlugin.cpp}::allocateChunk
void allocateChunk(T *&ptr, int count)
Definition: regionPlugin.cpp:41
nvinfer1::ElementWiseOperation
ElementWiseOperation
Enumerates the binary operations that may be performed by an ElementWise layer.
Definition: NvInfer.h:2605
anonymous_namespace{split.h}::SPLIT_PLUGIN_NAME
constexpr const char * SPLIT_PLUGIN_NAME
Definition: split.h:31
nvinfer1::IShuffleLayer
Layer type for shuffling data.
Definition: NvInfer.h:3749
onnx_graphsurgeon.ir.graph.Graph.opset
opset
Definition: graph.py:122
SampleMNIST::verifyOutput
bool verifyOutput(const samplesCommon::BufferManager &buffers, const std::string &outputTensorName, int groundTruthDigit) const
Verifies that the output is correct and prints it.
Definition: sampleMNIST.cpp:206
SampleCharRNNParams::saveEngine
std::string saveEngine
Definition: sampleCharRNN.cpp:124
onnx_graphsurgeon.ir.graph.Graph.node_ids
def node_ids(self)
Definition: graph.py:158
SampleCharRNNBase::teardown
bool teardown()
Used to clean up any state created in the sample class.
Definition: sampleCharRNN.cpp:948
pytorch_quantization.nn.modules.quant_rnn.QuantRNNBase.__setstate__
def __setstate__(self, d)
Definition: modules/quant_rnn.py:244
sample::BuildOptions::refittable
bool refittable
Definition: sampleOptions.h:113
pytorch_quantization.nn.modules._utils.QuantMixin.input_quantizer
def input_quantizer(self)
Definition: _utils.py:88
bert::TFusedMultiHeadAttentionXMMAKernel::mFunctions
std::unordered_map< uint64_t, FusedMultiHeadAttentionKernelInfo > mFunctions
Definition: fused_multihead_attention.h:287
polygraphy.tools.util.script.Script.invoke
def invoke(type_str, *args, **kwargs)
Definition: script.py:85
nvinfer1::FillOperation::kRANDOM_UNIFORM
@ kRANDOM_UNIFORM
Generate a tensor with random values drawn from a uniform distribution.
bert::DATA_TYPE_INT8
@ DATA_TYPE_INT8
Definition: fused_multihead_attention_common.h:30
nvinfer1::plugin::GenerateDetection::terminate
void terminate() override
Release resources acquired during plugin layer initialization.
Definition: generateDetectionPlugin.cpp:153
bert::QKVToContextInterleavedPlugin::mHeadSize
int mHeadSize
Definition: qkvToContextInt8InterleavedPlugin.h:84
test_ir.TestGraph.test_fold_constants_no_foldable_constants
def test_fold_constants_no_foldable_constants(self)
Definition: test_ir.py:759
bert::QKVToContextInterleavedPlugin::getPluginNamespace
const char * getPluginNamespace() const override
Return the namespace of the plugin object.
Definition: qkvToContextInt8InterleavedPlugin.cpp:216
nvinfer1::plugin::DetectionOutput::getPluginNamespace
const char * getPluginNamespace() const override
Return the namespace of the plugin object.
Definition: nmsPlugin.cpp:182
sample::TrtCudaEvent::~TrtCudaEvent
~TrtCudaEvent()
Definition: sampleDevice.h:124
SampleDynamicReshape::prepare
bool prepare()
Prepares the model for inference by creating execution contexts and allocating buffers.
Definition: sampleDynamicReshape.cpp:291
nvinfer1::IExecutionContext::enqueueV2
virtual bool enqueueV2(void **bindings, cudaStream_t stream, cudaEvent_t *inputConsumed) noexcept=0
Asynchronously execute inference.
nmtSample::SLPEmbedder
selects the embedding vector from the weight matrix using index provided in the input
Definition: slpEmbedder.h:36
polygraphy.util.misc.compress
def compress(obj)
Definition: Polygraphy/polygraphy/util/misc.py:187
SampleINT8Params::nbCalBatches
int nbCalBatches
The number of batches for calibration.
Definition: sampleINT8.cpp:49
builder.get_mha_dtype
def get_mha_dtype(config)
Definition: builder.py:115
SampleCharRNNMaps::charToID
const std::map< char, int > charToID
Definition: sampleCharRNN.cpp:99
nvinfer1::plugin::DetectionOutput::configurePlugin
void configurePlugin(const Dims *inputDims, int nbInputs, const Dims *outputDims, int nbOutputs, const DataType *inputTypes, const DataType *outputTypes, const bool *inputIsBroadcast, const bool *outputIsBroadcast, PluginFormat floatFormat, int maxBatchSize) override
Definition: nmsPlugin.cpp:215
SampleMaskRCNN::teardown
bool teardown()
Definition: sampleUffMaskRCNN.cpp:472
samplesCommon::DeviceAllocator::operator()
bool operator()(void **ptr, size_t size) const
Definition: buffers.h:182
nvinfer1::plugin::FlattenConcat::copyToDevice
Weights copyToDevice(const void *hostData, size_t count)
nmtSample::BenchmarkWriter::finalize
void finalize() override
it is called right after inference ends
Definition: benchmarkWriter.cpp:44
SampleUffFasterRcnn::verifyOutput
bool verifyOutput(const samplesCommon::BufferManager &buffers)
Filters output detections and verify results.
Definition: sampleUffFasterRCNN.cpp:413
backend.trt.test_loader.TestProfile
Definition: trt/test_loader.py:160
nvinfer1::LayerType::kPLUGIN_V2
@ kPLUGIN_V2
PluginV2 layer.
nvinfer1::plugin::SpecialSlice::clone
IPluginV2Ext * clone() const override
Clone the plugin object.
Definition: specialSlicePlugin.cpp:87
nvinfer1::ElementWiseOperation::kDIV
@ kDIV
Divide the first element by the second.
perf.DeviceBuffer.free
def free(self)
Definition: perf.py:35
builder_varseqlen.squad_output
def squad_output(prefix, config, init_dict, network, input_tensor)
Definition: builder_varseqlen.py:310
SampleMaskRCNN::SampleUniquePtr
std::unique_ptr< T, samplesCommon::InferDeleter > SampleUniquePtr
Definition: sampleUffMaskRCNN.cpp:313
bert::EmbLayerNormVarSeqlenPlugin::getWorkspaceSize
size_t getWorkspaceSize(const nvinfer1::PluginTensorDesc *inputs, int nbInputs, const nvinfer1::PluginTensorDesc *outputs, int nbOutputs) const override
Definition: embLayerNormVarSeqlenPlugin.cpp:254
SampleINT8::SampleUniquePtr
std::unique_ptr< T, samplesCommon::InferDeleter > SampleUniquePtr
Definition: sampleINT8.cpp:61
polygraphy.tools.args.tf.config.TfConfigArgs.parse
def parse(self, args)
Definition: tools/Polygraphy/polygraphy/tools/args/tf/config.py:29
nvinfer1::plugin::RegionPluginCreator
Definition: regionPlugin.h:102
nvinfer1::plugin::GridAnchorGenerator::mNumLayers
int mNumLayers
Definition: gridAnchorPlugin.h:95
MNISTBatchStream::mData
std::vector< float > mData
Definition: BatchStream.h:150
nvcaffeparser1::CaffeParser::parseNormalizeParam
std::vector< nvinfer1::PluginField > parseNormalizeParam(const trtcaffe::LayerParameter &msg, CaffeWeightFactory &weightFactory, BlobNameToTensor &tensors)
Definition: caffeParser.cpp:48
nvinfer1::IPluginExt::supportsFormat
virtual bool supportsFormat(DataType type, PluginFormat format) const =0
Check format support.
test_ir.TestNodeIO.test_append
def test_append(self, field_names)
Definition: test_ir.py:230
sample::InferenceTrace::InferenceTrace
InferenceTrace()=default
test_ir.TestNode.test_set_outputs_updates_old_outputs
def test_set_outputs_updates_old_outputs(self)
Definition: test_ir.py:171
nvinfer1::RNNOperation::kRELU
@ kRELU
Single gate RNN w/ ReLU activation function.
sample::dataTypeSize
int dataTypeSize(nvinfer1::DataType dataType)
Definition: sampleUtils.h:45
tests.quant_pooling_test.TestQuantAdaptiveAvgPool2d.test_input_fake_quant_disable
def test_input_fake_quant_disable(self)
Definition: quant_pooling_test.py:276
onnx_graphsurgeon.ir.tensor.Variable
Definition: tensor.py:149
audio_processing.dynamic_range_decompression
def dynamic_range_decompression(x, C=1)
Definition: audio_processing.py:103
polygraphy.backend.pyt.loader.BaseLoadPyt
Definition: backend/pyt/loader.py:18
backend.trt.test_runner.TestTrtRunner.test_context
def test_context(self)
Definition: trt/test_runner.py:51
SamplePlugin::teardown
bool teardown()
Used to clean up any state created in the sample class.
Definition: samplePlugin.cpp:356
BertInference::mEnableGraph
const bool mEnableGraph
Definition: bert_infer.h:338
nvinfer1::plugin::LReluPluginCreator::LReluPluginCreator
LReluPluginCreator()
Definition: lReluPlugin.cpp:132
nvinfer1::plugin::LReluPluginCreator::getPluginVersion
const char * getPluginVersion() const override
Return the plugin version.
Definition: lReluPlugin.cpp:145
onnx_graphsurgeon.ir.tensor.Tensor.__str__
def __str__(self)
Definition: tensor.py:132
backend.tf.test_runner.TestTfRunner
Definition: tf/test_runner.py:26
SampleAlgorithmSelector::mInputDims
nvinfer1::Dims mInputDims
The dimensions of the input to the network.
Definition: sampleAlgorithmSelector.cpp:361
helpers.tokenization
Definition: tokenization.py:1
backend.trt.test_loader.TestLoadPlugins.test_can_load_libnvinfer_plugins
def test_can_load_libnvinfer_plugins(self)
Definition: trt/test_loader.py:63
nvinfer1::TensorFormat::kNC2HW2
@ kNC2HW2
Deprecated name of kCHW2, provided for backwards compatibility and will be removed in TensorRT 8....
nvinfer1::IBuilderConfig::setMaxWorkspaceSize
virtual void setMaxWorkspaceSize(std::size_t workspaceSize)=0
Set the maximum workspace size.
nvinfer1::plugin::GridAnchorBasePluginCreator::mPluginName
std::string mPluginName
Definition: gridAnchorPlugin.h:120
sample::Arguments
std::unordered_multimap< std::string, std::string > Arguments
Definition: sampleOptions.h:63
polygraphy.tools.args.trt.loader.TrtLoaderArgs.get_trt_serialized_engine_loader
def get_trt_serialized_engine_loader(self)
Definition: tools/args/trt/loader.py:216
nvinfer1::EnumMax< RNNOperation >
constexpr int32_t EnumMax< RNNOperation >()
Maximum number of elements in RNNOperation enum.
Definition: NvInfer.h:2801
models.meta.Model.input_metadata
input_metadata
Definition: meta.py:35
half_float::detail::enable< T, half, half, half >::type
T type
Definition: ieee_half.h:379
sample::Logger::setReportableSeverity
void setReportableSeverity(Severity severity)
Method for controlling the verbosity of logging output.
Definition: samples/common/logging.h:254
TLTMaskRCNNConfig
Definition: tlt_mrcnn_config.h:25
nvinfer1::plugin::SpecialSlice::getNbOutputs
int getNbOutputs() const override
Get the number of outputs from the layer.
Definition: specialSlicePlugin.cpp:132
SampleReformatFreeIO::readDigits
bool readDigits(SampleBuffer &buffer, int groundTruthDigit)
Reads the digit map from the file.
Definition: sampleReformatFreeIO.cpp:444
nvinfer1::LayerType::kSHAPE
@ kSHAPE
Shape layer.
nvinfer1::plugin::GridAnchorGenerator::mPluginNamespace
std::string mPluginNamespace
Definition: gridAnchorPlugin.h:99
pytorch_quantization.nn.modules.quant_pooling.QuantMaxPool3d.forward
def forward(self, input)
Definition: quant_pooling.py:71
nvinfer1::plugin::DetectionLayer::getPluginNamespace
const char * getPluginNamespace() const override
Return the namespace of the plugin object.
Definition: detectionLayerPlugin.cpp:170
BatchStream::getDims
nvinfer1::Dims getDims() const
Definition: frcnnUtils.h:319
nvinfer1::plugin::SplitPlugin::getNbOutputs
int getNbOutputs() const override
Get the number of outputs from the layer.
Definition: split.h:124
nmtSample::Context::ptr
std::shared_ptr< Context > ptr
Definition: contextNMT.h:35
nvinfer1::plugin::SpecialSlicePluginCreator::getPluginName
const char * getPluginName() const override
Return the plugin name.
Definition: specialSlicePlugin.cpp:42
bert::EmbLayerNormVarSeqlenPlugin::mGammaDev
bert::cuda_unique_ptr< float > mGammaDev
Definition: embLayerNormVarSeqlenPlugin.h:88
pytorch_quantization.nn.modules.tensor_quantizer.TensorQuantizer._calibrator
_calibrator
Definition: tensor_quantizer.py:102
polygraphy.backend.onnx.loader.SaveOnnx.__call__
def __call__(self)
Definition: backend/onnx/loader.py:166
onnx_graphsurgeon.ir.node.Node.o
def o(self, consumer_idx=0, tensor_idx=0)
Definition: node.py:65
nvinfer1::plugin::MultilevelCropAndResize::getPluginNamespace
const char * getPluginNamespace() const override
Return the namespace of the plugin object.
Definition: multilevelCropAndResizePlugin.cpp:150
onnx_graphsurgeon.exporters.base_exporter
Definition: base_exporter.py:1
tools.test_args.TestParseMeta.test_parse_shape_only
def test_parse_shape_only(self)
Definition: test_args.py:7
nmtSample::Embedder::~Embedder
~Embedder() override=default
nvuffparser::FieldType::kINT32
@ kINT32
INT32 field type.
polygraphy.tools.args.tf.runner.TfRunnerArgs.tf_loader_args
tf_loader_args
Definition: tools/args/tf/runner.py:32
nvinfer1::plugin::CropAndResizePlugin::mNamespace
std::string mNamespace
Definition: cropAndResizePlugin.h:102
BertInference::mExecGraph
cudaGraphExec_t mExecGraph
Definition: bert_infer.h:355
nvinfer1::IAlgorithmVariant::getTactic
virtual int64_t getTactic() const =0
Return tactic of the algorithm.
half_float::detail::enable< T, expr, void, void >::type
T type
Definition: ieee_half.h:374
pytorch_quantization.nn.modules.quant_conv._QuantConvTransposeNd
Definition: quant_conv.py:220
nvinfer1::plugin::ProposalLayer::setPluginNamespace
void setPluginNamespace(const char *libNamespace) override
Set the namespace that this plugin object belongs to.
Definition: proposalLayerPlugin.cpp:182
nvinfer1::ScaleMode
ScaleMode
Controls how shift, scale and power are applied in a Scale layer.
Definition: NvInfer.h:2068
nvcaffeparser1::IPluginFactoryV2::isPluginV2
virtual bool isPluginV2(const char *layerName)=0
A user implemented function that determines if a layer configuration is provided by an IPluginV2.
onnx_graphsurgeon.ir.graph.Graph.__repr__
def __repr__(self)
Definition: graph.py:471
pytorch_quantization.tensor_quant.ScaledQuantDescriptor.__eq__
def __eq__(self, rhs)
Definition: tensor_quant.py:170
pytorch_quantization.nn.modules.quant_pooling.QuantAdaptiveAvgPool2d.forward
def forward(self, input)
Definition: quant_pooling.py:136
polygraphy.tools.args.tf.config.TfConfigArgs.xla
xla
Definition: tools/Polygraphy/polygraphy/tools/args/tf/config.py:32
MaskRCNNConfig::NUM_CLASSES
static const int NUM_CLASSES
Definition: plugin/proposalLayerPlugin/mrcnn_config.h:60
bert::getXMMAKernelsV2
const FusedMultiHeadAttentionXMMAKernelV2 * getXMMAKernelsV2(Data_type type, unsigned int sm)
Definition: fused_multihead_attention_v2.h:522
generate.name
name
Definition: 09_shape_operations_with_the_layer_api/generate.py:54
polygraphy.backend.tf.loader.GraphFromCkpt.dir
dir
Definition: backend/tf/loader.py:175
half_float::detail::operator==
enable< bool, T, U >::type operator==(T x, U y)
Comparison for equality.
Definition: ieee_half.h:2127
nvinfer1::PaddingMode::kEXPLICIT_ROUND_DOWN
@ kEXPLICIT_ROUND_DOWN
Use explicit padding, rounding output size down.
samplesCommon::CaffeSampleParams::meanFileName
std::string meanFileName
Filename of mean file of a network.
Definition: argsParser.h:55
samplesCommon::GpuTimer::stop
void stop()
Definition: samples/common/common.h:825
builder_varseqlen.attention_layer_opt
def attention_layer_opt(prefix, config, init_dict, network, input_tensor, mask_idx, cu_seqlens, max_seqlen)
Definition: builder_varseqlen.py:120
sample::Bindings::dumpBindings
void dumpBindings(const nvinfer1::IExecutionContext &context, bool(*predicate)(const Binding &b), std::ostream &os) const
Definition: sampleUtils.h:401
bert::SkipLayerNormInterleavedPlugin::mLayerName
const std::string mLayerName
Definition: skipLayerNormInt8InterleavedPlugin.h:71
sample::ModelFormat::kONNX
@ kONNX
model.Decoder.attention_rnn
attention_rnn
Definition: demo/Tacotron2/tacotron2/model.py:258
nvinfer1::plugin::GroupNormalizationPlugin::enqueue
int enqueue(const nvinfer1::PluginTensorDesc *inputDesc, const nvinfer1::PluginTensorDesc *outputDesc, const void *const *inputs, void *const *outputs, void *workspace, cudaStream_t stream) override
Execute the layer.
Definition: groupNormalizationPlugin.cpp:107
polygraphy.backend.tf.util.get_input_metadata
def get_input_metadata(graph)
Definition: tf/util.py:83
pytorch_quantization.nn.modules.quant_pooling.QuantAvgPool1d
Definition: quant_pooling.py:76
bert::DATA_TYPE_E8M7
@ DATA_TYPE_E8M7
Definition: fused_multihead_attention_common.h:26
polygraphy.tools.args.trt.loader.TrtLoaderArgs.plugins
plugins
Definition: tools/args/trt/loader.py:85
bert::fused_multihead_attention_fp16_96_64_kernel_sm80_cu_o_len
unsigned int fused_multihead_attention_fp16_96_64_kernel_sm80_cu_o_len
Definition: fused_multihead_attention_fp16_96_64_kernel.sm80.cpp:2345
nvinfer1::plugin::FlattenConcat::getOutputDataType
DataType getOutputDataType(int index, const nvinfer1::DataType *inputTypes, int nbInputs) const override
Definition: flattenConcat.cpp:221
test_ir.TestNode.test_set_inputs_updates_old_inputs
def test_set_inputs_updates_old_inputs(self)
Definition: test_ir.py:165
BatchStream::getLabels
float * getLabels()
Definition: frcnnUtils.h:304
backend.trt.test_loader.TestConfigLoader.test_multiple_profiles
def test_multiple_profiles(self, identity_builder_network)
Definition: trt/test_loader.py:234
half_float::detail::uint16
unsigned short uint16
Unsigned integer of (at least) 16 bits width.
Definition: ieee_half.h:331
pytorch_quantization.nn.modules._utils.QuantMixin.weight_quantizer
def weight_quantizer(self)
Definition: _utils.py:92
tests.quant_linear_test.TestQuantLinear.test_unused_kwargs
def test_unused_kwargs(self)
Definition: quant_linear_test.py:228
SampleINT8::constructNetwork
bool constructNetwork(SampleUniquePtr< nvinfer1::IBuilder > &builder, SampleUniquePtr< nvinfer1::INetworkDefinition > &network, SampleUniquePtr< nvinfer1::IBuilderConfig > &config, SampleUniquePtr< nvcaffeparser1::ICaffeParser > &parser, DataType dataType)
Parses a Caffe model and creates a TensorRT network.
Definition: sampleINT8.cpp:203
samplesCommon::enableDLA
void enableDLA(IBuilder *builder, IBuilderConfig *config, int useDLACore, bool allowGPUFallback=true)
Definition: samples/common/common.h:575
nvinfer1::plugin::PyramidROIAlign::getOutputDataType
DataType getOutputDataType(int index, const nvinfer1::DataType *inputTypes, int nbInputs) const override
Definition: pyramidROIAlignPlugin.cpp:247
nvinfer1::IPluginCreator::getTensorRTVersion
virtual int32_t getTensorRTVersion() const
Return the version of the API the plugin creator was compiled with.
Definition: NvInferRuntimeCommon.h:896
PluginFactory
Definition: samples/opensource/samplePlugin/fcPlugin.h:321
BertInference::mBindings
std::vector< void * > mBindings
Definition: bert_infer.h:342
polygraphy.backend.trt.loader.CreateConfig.max_workspace_size
max_workspace_size
Definition: backend/trt/loader.py:321
helpers.tokenization.FullTokenizer.__init__
def __init__(self, vocab_file, do_lower_case=True)
Definition: tokenization.py:163
polygraphy.tools.precision.precision.STCheckerBase.__init__
def __init__(self, name)
Definition: precision.py:32
nvinfer1::PluginRegistrar
Register the plugin creator to the registry The static registry object will be instantiated when the ...
Definition: NvInferRuntimeCommon.h:1397
nvinfer1::DimensionOperation::kEQUAL
@ kEQUAL
1 if operands are equal, 0 otherwise.
bert::fused_multihead_attention_fp16_384_64_kernel_sm80_cu_o
unsigned char fused_multihead_attention_fp16_384_64_kernel_sm80_cu_o[]
Definition: fused_multihead_attention_fp16_384_64_kernel.sm80.cpp:20
onnx_graphsurgeon.importers.base_importer.BaseImporter
Definition: base_importer.py:19
polygraphy.tools.base.tool.Tool.add_parser_args
def add_parser_args(self, parser)
Definition: tool.py:42
nmtSample::TextWriter::getInfo
std::string getInfo() override
get the textual description of the component
Definition: textWriter.cpp:40
nvinfer1::plugin::CoordConvACPlugin::iW
int iW
Definition: coordConvACPlugin.h:86
polygraphy.backend.onnx.util.get_dtype
def get_dtype(tensor)
Definition: onnx/util.py:132
nvinfer1::plugin::Normalize::~Normalize
~Normalize() override=default
test_struct.TestTensorMetadata.test_str_no_dtype
def test_str_no_dtype(self)
Definition: common/test_struct.py:26
nvinfer1::IPluginV2::getPluginVersion
virtual const char * getPluginVersion() const =0
Return the plugin version.
onnx_graphsurgeon.ir.tensor
Definition: tensor.py:1
tests.print_test.TestPrint.test_print_descriptor
def test_print_descriptor(self)
Definition: print_test.py:35
bert::EmbLayerNormVarSeqlenPluginCreator::mPluginAttributes
static std::vector< nvinfer1::PluginField > mPluginAttributes
Definition: embLayerNormVarSeqlenPlugin.h:136
nvinfer1::TacticSource::kCUBLAS
@ kCUBLAS
cuBLAS tactics.
nvuffparser::FieldCollection
Definition: NvUffParser.h:87
nvinfer1::ElementWiseOperation::kOR
@ kOR
Logical OR of two elements.
util.test_misc.test_shape_matching
def test_shape_matching(case)
Definition: test_misc.py:72
sample::anonymous_namespace{sampleInference.cpp}::EnqueueExplicit
Functor to enqueue inference with explict batch.
Definition: sampleInference.cpp:214
sample::BuildOptions::engine
std::string engine
Definition: sampleOptions.h:121
polygraphy.backend.tf.loader.OptimizeGraph
Definition: backend/tf/loader.py:27
tests.tensor_quantizer_test.TestTensorQuantizer.test_init_calib
def test_init_calib(self)
Definition: tensor_quantizer_test.py:168
polygraphy.tools.args.trt.loader.TrtLoaderArgs._wrap_if_plugins
def _wrap_if_plugins(self, script, obj_name)
Definition: tools/args/trt/loader.py:108
nvinfer1::EnumMax< DimensionType >
constexpr int32_t EnumMax< DimensionType >()
Maximum number of elements in DimensionType enum.
Definition: NvInferRuntimeCommon.h:175
std::numeric_limits< half_float::half >::denorm_min
static half_float::half denorm_min()
Smallest positive subnormal value.
Definition: samples/common/half.h:4264
polygraphy.backend.onnx.loader.SaveOnnx.__init__
def __init__(self, model, path=None)
Definition: backend/onnx/loader.py:154
tests.utils.copy_state_and_quantize
def copy_state_and_quantize(dst, src, num_bits)
Definition: tools/pytorch-quantization/tests/utils.py:40
half_float::detail::half_caster< half, half, R >::cast
static half cast(half arg)
Definition: samples/common/half.h:2684
nvinfer1::plugin::DetectionLayerPluginCreator::~DetectionLayerPluginCreator
~DetectionLayerPluginCreator()
Definition: detectionLayerPlugin.h:113
nvinfer1::LayerType::kRESIZE
@ kRESIZE
Resize Layer.
nvinfer1::utils::reorderSubBuffers
bool reorderSubBuffers(void *input, const int32_t *order, int32_t num, int32_t size)
Takes an input stream and re-orders num chunks of the data given the size and order.
loss_function.WaveGlowLoss.forward
def forward(self, model_output, clean_audio)
Definition: waveglow/loss_function.py:24
half_float::detail::hypot
expr hypot(half x, half y)
Hypotenuse function.
Definition: ieee_half.h:2394
FCPlugin::mDeviceKernel
void * mDeviceKernel
Definition: samples/opensource/samplePlugin/fcPlugin.h:312
nvinfer1::plugin::BatchTilePlugin::getOutputDataType
DataType getOutputDataType(int index, const nvinfer1::DataType *inputTypes, int nbInputs) const override
Definition: batchTilePlugin.cpp:91
polygraphy.comparator.struct.AccuracyResult.stats
def stats(self, runner_pair=None)
Definition: comparator/struct.py:194
onnx_graphsurgeon.exporters.onnx_exporter.OnnxExporter.export_node
onnx.NodeProto export_node(Node node)
Definition: onnx_exporter.py:54
nvinfer1::plugin::PriorBox::setPluginNamespace
void setPluginNamespace(const char *pluginNamespace) override
Set the namespace that this plugin object belongs to.
Definition: priorBoxPlugin.cpp:283
nvinfer1::plugin::PriorBoxParameters::stepW
float stepW
Definition: NvInferPluginUtils.h:99
SampleAlgorithmSelector::SampleUniquePtr
std::unique_ptr< T, samplesCommon::InferDeleter > SampleUniquePtr
Definition: sampleAlgorithmSelector.cpp:315
sample::InferenceOptions::streams
int streams
Definition: sampleOptions.h:153
half_float::detail::asin
expr asin(half arg)
Arc sine.
Definition: ieee_half.h:2438
tools.test_run.TestOther.test_custom_tolerance
def test_custom_tolerance(self)
Definition: test_run.py:229
nvinfer1::plugin::GridAnchorParameters::W
int32_t W
Definition: NvInferPluginUtils.h:147
model.WaveGlow.__init__
def __init__(self, n_mel_channels, n_flows, n_group, n_early_every, n_early_size, WN_config)
Definition: demo/Tacotron2/waveglow/model.py:158
nvinfer1::plugin::PluginCreatorRegistry::addPluginCreator
void addPluginCreator(void *logger, const char *libNamespace)
Definition: InferPlugin.cpp:79
BufferDesc
Definition: sampleReformatFreeIO.cpp:64
samplesCommon::SampleParams
The SampleParams structure groups the basic parameters required by all sample networks.
Definition: argsParser.h:36
layers.TacotronSTFT.__init__
def __init__(self, filter_length=1024, hop_length=256, win_length=1024, n_mel_channels=80, sampling_rate=22050, mel_fmin=0.0, mel_fmax=8000.0)
Definition: layers.py:58
nvinfer1::plugin::GroupNormalizationPlugin::detachFromContext
void detachFromContext() override
Detach the plugin object from its execution context.
Definition: groupNormalizationPlugin.cpp:101
helpers.tokenization.FullTokenizer.convert_ids_to_tokens
def convert_ids_to_tokens(self, ids)
Definition: tokenization.py:180
SampleMNISTAPIParams::inputW
int inputW
The input width.
Definition: sampleMNISTAPI.cpp:49
polygraphy.util.cuda.Cuda.destroy_stream
def destroy_stream(self, stream)
Definition: cuda.py:55
nvinfer1::plugin::FlattenConcatPluginCreator::mPluginAttributes
static std::vector< PluginField > mPluginAttributes
Definition: flattenConcat.h:129
nvinfer1::IRefitter::getDynamicRangeMin
virtual float getDynamicRangeMin(const char *tensorName) const =0
Get minimum of dynamic range.
nvinfer1::plugin::SplitPlugin::destroy
void destroy() override
Destroy the plugin object.
Definition: split.h:102
nvinfer1::IPluginRegistry::getErrorRecorder
virtual IErrorRecorder * getErrorRecorder() const noexcept=0
set the ErrorRecorder assigned to this interface.
helpers.tokenization.BasicTokenizer
Definition: tokenization.py:215
nvinfer1::plugin::TRTException::setMessage
void setMessage(const char *msg)
Definition: checkMacrosPlugin.h:87
SampleUffMNIST::mInputDims
nvinfer1::Dims mInputDims
Definition: sampleUffMNIST.cpp:103
pytorch_quantization.nn.modules.quant_rnn.QuantRNNBase.hidden_size
hidden_size
Definition: modules/quant_rnn.py:47
nvinfer1::plugin::FlattenConcat::getSerializationSize
size_t getSerializationSize() const override
Find the size of the serialization buffer required.
Definition: flattenConcat.cpp:161
nvinfer1::ReduceOperation
ReduceOperation
Enumerates the reduce operations that may be performed by a Reduce layer.
Definition: NvInfer.h:3556
polygraphy.tools.args.trt.loader.TrtLoaderArgs
Definition: tools/args/trt/loader.py:24
half_float::detail::half_caster< T, expr, R >::cast_impl
static T cast_impl(half arg, false_type)
Definition: samples/common/half.h:2676
tools.common.run_polygraphy_run
def run_polygraphy_run(additional_opts=[], disable_verbose=False)
Definition: tools/common.py:44
nvinfer1::plugin::ProposalPlugin::supportsFormat
bool supportsFormat(DataType type, PluginFormat format) const override
Check format support.
Definition: proposalPlugin.cpp:235
nvinfer1::ActivationType::kRELU
@ kRELU
Rectified linear activation.
models.meta.check_identity_identity
def check_identity_identity(runner)
Definition: meta.py:55
polygraphy.common.struct.MetadataTuple
MetadataTuple
Definition: common/struct.py:20
onnx_graphsurgeon.util.misc.SynchronizedList.insert
def insert(self, i, x)
Definition: onnx-graphsurgeon/onnx_graphsurgeon/util/misc.py:107
bert::SkipLayerNormInterleavedPlugin::getNbOutputs
int getNbOutputs() const override
Get the number of outputs from the layer.
Definition: skipLayerNormInt8InterleavedPlugin.cpp:219
sample::TrtDestroyer
Definition: sampleUtils.h:457
polygraphy.backend.tf.loader.CreateConfig
Definition: backend/tf/loader.py:357
tools.common.run_polygraphy_inspect
def run_polygraphy_inspect(additional_opts=[], disable_verbose=False)
Definition: tools/common.py:48
nvinfer1::plugin::DetectionOutput::getSerializationSize
size_t getSerializationSize() const override
Find the size of the serialization buffer required.
Definition: nmsPlugin.cpp:124
sample::Bindings::getInputBindings
std::unordered_map< std::string, int > getInputBindings() const
Definition: sampleUtils.h:418
SampleCharRNNWeightNames::RNNB_L1_NAME
const std::string RNNB_L1_NAME
Definition: sampleCharRNN.cpp:76
nvinfer1::plugin::BatchedNMSPlugin::param
NMSParameters param
Definition: batchedNMSPlugin.h:67
nvinfer1::plugin::InstanceNormalizationPlugin
Definition: instanceNormalizationPlugin.h:32
polygraphy.backend.trt_legacy.LoadNetworkFromCaffe
Definition: backend/trt_legacy.py:143
polygraphy.tools.args.onnx.loader.OnnxLoaderArgs.save_onnx
save_onnx
Definition: tools/args/onnx/loader.py:62
nmtSample::getVolume
int getVolume(nvinfer1::Dims dims)
Definition: trtUtil.cpp:36
anonymous_namespace{pluginLogging.h}::LOG_VERBOSE
LogStreamConsumer LOG_VERBOSE(const Logger &logger)
produces a LogStreamConsumer object that can be used to log messages of severity kVERBOSE
Definition: pluginLogging.h:473
bert::EmbLayerNormVarSeqlenPluginCreator::getPluginNamespace
const char * getPluginNamespace() const override
Return the namespace of the plugin creator object.
Definition: embLayerNormVarSeqlenPlugin.cpp:519
bert::SkipLayerNormInterleavedPlugin
Definition: skipLayerNormInt8InterleavedPlugin.h:30
nvinfer1::plugin::BatchTilePlugin::getWorkspaceSize
size_t getWorkspaceSize(int) const override
Definition: batchTilePlugin.cpp:86
backend.trt.test_loader.TestConfigLoader.test_workspace_size
def test_workspace_size(self, identity_builder_network)
Definition: trt/test_loader.py:185
nvinfer1::ILogger::Severity
Severity
Definition: NvInferRuntimeCommon.h:1084
nvinfer1::CUDADriverWrapper::cuLinkAddData
CUresult cuLinkAddData(CUlinkState state, CUjitInputType type, void *data, size_t size, const char *name, unsigned int numOptions, CUjit_option *options, void **optionValues) const
Definition: cudaDriverWrapper.cpp:118
nvinfer1::plugin::BatchTilePluginCreator::mFC
static PluginFieldCollection mFC
Definition: batchTilePlugin.h:110
anonymous_namespace{pyramidROIAlignPlugin.cpp}::PYRAMIDROIALGIN_PLUGIN_VERSION
const char * PYRAMIDROIALGIN_PLUGIN_VERSION
Definition: pyramidROIAlignPlugin.cpp:28
comparator.test_struct.TestRunResults.test_setitem
def test_setitem(self, run_results)
Definition: comparator/test_struct.py:61
nvinfer1::plugin::BatchedNMSBasePluginCreator::getPluginVersion
const char * getPluginVersion() const override
Return the plugin version.
Definition: batchedNMSPlugin.cpp:501
sample::ShapeRange
std::array< std::vector< int >, nvinfer1::EnumMax< nvinfer1::OptProfileSelector >()> ShapeRange
Definition: sampleOptions.h:67
test_ir.TestGraph.test_layer_with_iterables
def test_layer_with_iterables(self)
Definition: test_ir.py:547
pytorch_quantization.nn.modules.quant_rnn.GRUCell.hidden_size
hidden_size
Definition: modules/quant_rnn.py:429
nvinfer1::plugin::CoordConvACPluginCreator
Definition: coordConvACPlugin.h:92
SampleCharRNNParams::inputSentences
std::vector< std::string > inputSentences
Definition: sampleCharRNN.cpp:131
polygraphy.tools.args.model.ModelArgs.input_shapes
input_shapes
Definition: tools/Polygraphy/polygraphy/tools/args/model.py:88
half_float::detail::binary_specialized::fmax
static expr fmax(float x, float y)
Maximum implementation.
Definition: samples/common/half.h:2569
tests.quant_conv_test.TestQuantConv1D.test_weight_fake_quant_per_channel
def test_weight_fake_quant_per_channel(self)
Definition: quant_conv_test.py:307
nvinfer1::IHostMemory::size
virtual std::size_t size() const noexcept=0
The size in bytes of the data that was allocated.
onnx_graphsurgeon.ir.tensor.Variable.__init__
def __init__(self, str name, np.dtype dtype=None, Sequence[Union[int, str]] shape=None)
Definition: tensor.py:155
sample::ModelOptions::prototxt
std::string prototxt
Definition: sampleOptions.h:97
FCPlugin::configureWithFormat
void configureWithFormat(const nvinfer1::Dims *inputDims, int nbInputs, const nvinfer1::Dims *outputDims, int nbOutputs, nvinfer1::DataType type, nvinfer1::PluginFormat format, int maxBatchSize) override
Definition: samples/opensource/samplePlugin/fcPlugin.h:119
nvinfer1::plugin::LReLU::serialize
void serialize(void *buffer) const override
Serialize the layer.
Definition: lReluPlugin.cpp:73
polygraphy.tools.surgeon.surgeon.Surgeon
MAIN TOOL #################################.
Definition: surgeon.py:21
pytorch_quantization.nn.modules.quant_bert.QuantBertSelfAttention.query
query
Definition: quant_bert.py:64
model.Prenet.layers
layers
Definition: demo/Tacotron2/tacotron2/model.py:112
nvinfer1::plugin::GenerateDetection::GenerateDetection
GenerateDetection(int num_classes, int keep_topk, float score_threshold, float iou_threshold, const nvinfer1::Dims &image_size)
Definition: generateDetectionPlugin.cpp:108
polygraphy.util.misc.FreeOnException.__init__
def __init__(self, objs)
Definition: Polygraphy/polygraphy/util/misc.py:433
nvinfer1::plugin::BatchTilePlugin
Definition: batchTilePlugin.h:27
onnx_graphsurgeon.util.misc.SynchronizedList.parent_obj
parent_obj
Definition: onnx-graphsurgeon/onnx_graphsurgeon/util/misc.py:71
BatchStream::mPrefix
std::string mPrefix
Batch file name prefix.
Definition: BatchStream.h:383
pytorch_quantization.nn.modules.quant_rnn.QuantRNNBase.__init__
def __init__(self, mode, input_size, hidden_size, num_layers=1, bias=True, batch_first=False, dropout=0, bidirectional=False, **kwargs)
Definition: modules/quant_rnn.py:43
tests.integration_test.TestNetwork.test_amp_fp16
def test_amp_fp16(self)
Definition: integration_test.py:81
SampleCharRNNLoop::LstmParams::recurrentBias
nvinfer1::ITensor * recurrentBias
Definition: sampleCharRNN.cpp:253
SampleFasterRCNN::mParams
SampleFasterRCNNParams mParams
The parameters for the sample.
Definition: sampleFasterRCNN.cpp:83
helpers.tokenization.convert_ids_to_tokens
def convert_ids_to_tokens(inv_vocab, ids)
Definition: tokenization.py:147
half_float::detail::operator<<
enable< std::basic_ostream< charT, traits > &, T >::type operator<<(std::basic_ostream< charT, traits > &out, T arg)
Output operator.
Definition: ieee_half.h:2211
nvinfer1::plugin::GridAnchorGenerator::~GridAnchorGenerator
~GridAnchorGenerator() override
Definition: gridAnchorPlugin.cpp:160
polygraphy.tools.args.onnx.loader.OnnxLoaderArgs.register
def register(self, maker)
Definition: tools/args/onnx/loader.py:46
polygraphy.util.format.FormatManager.determine_format
def determine_format(shape)
Definition: format.py:66
perf.DeviceBuffer.__init__
def __init__(self, shape, dtype=trt.int32)
Definition: perf.py:29
SampleSSD::SampleSSD
SampleSSD(const SampleSSDParams &params)
Definition: sampleSSD.cpp:66
LogStreamConsumerBase::mBuffer
LogStreamConsumerBuffer mBuffer
Definition: pluginLogging.h:117
nvinfer1::plugin::SplitPlugin::attachToContext
void attachToContext(cudnnContext *, cublasContext *, nvinfer1::IGpuAllocator *) override
Attach the plugin object to an execution context and grant the plugin the access to some context reso...
Definition: split.h:128
nvinfer1::plugin::RegionPluginCreator::getPluginVersion
const char * getPluginVersion() const override
Return the plugin version.
Definition: regionPlugin.cpp:484
AlgorithmCacheReader::AlgorithmCacheItem::nbOutputs
int32_t nbOutputs
Definition: sampleAlgorithmSelector.cpp:240
pytorch_quantization.calib.calibrator._Calibrator._num_bits
_num_bits
Definition: tools/pytorch-quantization/pytorch_quantization/calib/calibrator.py:32
test_ir.TestGraph.test_register_opset
def test_register_opset(self)
Definition: test_ir.py:473
polygraphy.tools.args.tf.loader.TfLoaderArgs.trt_loader_args
trt_loader_args
Definition: tools/args/tf/loader.py:57
SampleUffSSD::SampleUffSSD
SampleUffSSD(const SampleUffSSDParams &params)
Definition: sampleUffSSD.cpp:69
samplesCommon::HostMemory::HostMemory
HostMemory(std::size_t size, DataType type)
Definition: samples/common/common.h:324
nmtSample::Context::addToModel
void addToModel(nvinfer1::INetworkDefinition *network, nvinfer1::ITensor *actualInputSequenceLengths, nvinfer1::ITensor *memoryStates, nvinfer1::ITensor *alignmentScores, nvinfer1::ITensor **contextOutput)
add the context vector calculation to the network
Definition: contextNMT.cpp:24
SampleConfig::mPrintLayercInfo
bool mPrintLayercInfo
Definition: sampleConfig.h:54
UffPoolPluginV2::isOutputBroadcastAcrossBatch
bool isOutputBroadcastAcrossBatch(int outputIndex, const bool *inputIsBroadcasted, int nbInputs) const override
Definition: sampleUffPluginV2Ext.cpp:568
nvinfer1::BuilderFlag
BuilderFlag
List of valid modes that the builder can enable when creating an engine from a network definition.
Definition: NvInfer.h:6384
SampleCharRNNParams::dataSize
int dataSize
Definition: sampleCharRNN.cpp:119
nvinfer1::plugin::ProposalLayer::getOutputDataType
DataType getOutputDataType(int index, const nvinfer1::DataType *inputTypes, int nbInputs) const override
Definition: proposalLayerPlugin.cpp:330
anonymous_namespace{nmsPlugin.cpp}::NMS_PLUGIN_NAME
const char * NMS_PLUGIN_NAME
Definition: nmsPlugin.cpp:31
BufferDesc::scalarPerVector
int scalarPerVector
Definition: sampleReformatFreeIO.cpp:119
samplesCommon::vPPM::fileName
std::string fileName
Definition: samples/common/common.h:662
Logger::reportFail
static int reportFail(const TestAtom &testAtom)
Definition: pluginLogging.h:374
BatchStream::mFileLabels
std::vector< float > mFileLabels
List of label files.
Definition: BatchStream.h:382
SampleReformatFreeIO::constructNetwork
void constructNetwork(SampleUniquePtr< nvcaffeparser1::ICaffeParser > &parser, SampleUniquePtr< nvinfer1::INetworkDefinition > &network)
uses a Caffe parser to create the single layer Network and marks the output layers
Definition: sampleReformatFreeIO.cpp:358
nvinfer1::plugin::ResizeNearest::getNbOutputs
int getNbOutputs() const override
Get the number of outputs from the layer.
Definition: resizeNearestPlugin.cpp:88
MaskRCNNConfig::MASK_THRESHOLD
static const float MASK_THRESHOLD
Definition: plugin/proposalLayerPlugin/mrcnn_config.h:33
onnx_graphsurgeon.logger.logger.LoggerIndent.indent
indent
Definition: onnx-graphsurgeon/onnx_graphsurgeon/logger/logger.py:30
nvinfer1::plugin::CublasError::CublasError
CublasError(const char *fl, const char *fn, int ln, int stat, const char *msg=nullptr)
Definition: checkMacrosPlugin.h:122
polygraphy.backend.trt.loader.NetworkFromOnnxPath.__call__
def __call__(self)
Definition: backend/trt/loader.py:153
nmtSample::DeviceBuffer::ptr
std::shared_ptr< DeviceBuffer< T > > ptr
Definition: deviceBuffer.h:30
polygraphy.tools.args.onnx.runner.OnnxtfRunnerArgs
Definition: tools/args/onnx/runner.py:20
pytorch_quantization.tensor_quant.QuantDescriptor
QuantDescriptor
Definition: tensor_quant.py:222
nvinfer1::plugin::GroupNormalizationPlugin::bnBias
void * bnBias
Definition: groupNormalizationPlugin.h:106
samplesCommon::BufferManager::~BufferManager
~BufferManager()=default
nvinfer1::plugin::NMSParameters::topK
int32_t topK
Definition: NvInferPluginUtils.h:250
bert::fused_multihead_attention_int8_384_64_kernel_sm80_cu_o_len
unsigned int fused_multihead_attention_int8_384_64_kernel_sm80_cu_o_len
Definition: fused_multihead_attention_int8_384_64_kernel.sm80.cpp:2721
bert::SkipLayerNormInterleavedPlugin::getPluginType
const char * getPluginType() const override
Return the plugin type.
Definition: skipLayerNormInt8InterleavedPlugin.cpp:209
comparator.test_comparator.TestComparator.test_validate_inf
def test_validate_inf(self)
Definition: test_comparator.py:139
tests.quant_pooling_test.TestQuantMaxPool3d.test_input_fake_quant
def test_input_fake_quant(self)
Definition: quant_pooling_test.py:117
tools.test_inspect.TestInspectModel.test_model_trt_engine_sanity
def test_model_trt_engine_sanity(self, run_inspect_model)
Definition: test_inspect.py:281
nmtSample::SequenceProperties::getEndSequenceId
virtual int getEndSequenceId()=0
nvinfer1::plugin::GenerateDetection::clone
IPluginV2Ext * clone() const override
Clone the plugin object.
Definition: generateDetectionPlugin.cpp:175
layers.LinearNorm
Definition: layers.py:23
comparator.test_postprocess.TestTopK.test_k_can_exceed_array_len
def test_k_can_exceed_array_len(self)
Definition: test_postprocess.py:28
SampleUffSSDParams
The SampleUffSSDParams structure groups the additional parameters required by the Uff SSD sample.
Definition: sampleUffSSD.cpp:48
nvinfer1::plugin::DetectionLayer::initialize
int initialize() override
Initialize the layer for execution.
Definition: detectionLayerPlugin.cpp:123
nvinfer1::INetworkDefinition
A network definition for input to the builder.
Definition: NvInfer.h:4907
object
helpers.tokenization.BasicTokenizer._tokenize_chinese_chars
def _tokenize_chinese_chars(self, text)
Definition: tokenization.py:281
nvinfer1::plugin::InstanceNormalizationPlugin::_h_scale
std::vector< float > _h_scale
Definition: instanceNormalizationPlugin.h:92
nvinfer1::IReduceLayer
Layer that represents a reduction operator across Shape, Int32, Float, and Half tensors.
Definition: NvInfer.h:3579
nvuffparser
The TensorRT UFF parser API namespace.
builder.BertConfig.head_size
head_size
Definition: builder.py:96
nvinfer1::plugin::GenerateDetectionPluginCreator::~GenerateDetectionPluginCreator
~GenerateDetectionPluginCreator()
Definition: generateDetectionPlugin.h:117
nvinfer1::plugin::BatchedNMSPlugin::getOutputDataType
nvinfer1::DataType getOutputDataType(int index, const nvinfer1::DataType *inputType, int nbInputs) const override
Definition: batchedNMSPlugin.cpp:429
polygraphy.comparator.struct
Definition: comparator/struct.py:1
test_func.TestExtend.test_extend_0_args_1_rv
def test_extend_0_args_1_rv(self)
Definition: test_func.py:46
MaskRCNNConfig::TOP_DOWN_PYRAMID_SIZE
static const int TOP_DOWN_PYRAMID_SIZE
Definition: plugin/proposalLayerPlugin/mrcnn_config.h:57
SampleConfig::mFullTextFilename
std::string mFullTextFilename
Definition: sampleConfig.h:39
tests.print_test.TestPrint.linear
linear
Definition: print_test.py:49
Logger::testResultString
static const char * testResultString(TestResult result)
returns an appropriate string for prefixing a test result message with the given result
Definition: pluginLogging.h:416
nvinfer1::plugin::Region::classes
int classes
Definition: regionPlugin.h:95
nvinfer1::ElementWiseOperation::kPOW
@ kPOW
The first element to the power of the second element.
nvinfer1::plugin::InstanceNormalizationPlugin::_d_bias
float * _d_bias
Definition: instanceNormalizationPlugin.h:95
polygraphy.backend.base.runner.BaseRunner.deactivate_impl
def deactivate_impl(self)
Definition: backend/base/runner.py:135
option::has_arg
int has_arg
Definition: getopt.h:77
test_ir.TestGraph.test_layer_with_tensors
def test_layer_with_tensors(self)
Definition: test_ir.py:504
MaskRCNNUtils::Mask::raw
float raw[MaskRCNNConfig::MASK_POOL_SIZE *2 *MaskRCNNConfig::MASK_POOL_SIZE *2]
Definition: sampleUffMaskRCNN.cpp:59
util.test_cuda.TestDeviceBuffer.test_empty_tensor_to_host
def test_empty_tensor_to_host(self)
Definition: test_cuda.py:88
half_float::detail::functions::frexp
static half frexp(half arg, int *exp)
Decompression implementation.
Definition: samples/common/half.h:2153
dumpTFWts.type
type
Definition: dumpTFWts.py:35
helpers.tokenization.printable_text
def printable_text(text)
Definition: tokenization.py:97
nmtSample::Encoder::getMemoryStatesSize
virtual int getMemoryStatesSize()=0
get the size of the memory state vector
nvcaffeparser1::BinaryProtoBlob
Definition: binaryProtoBlob.h:26
nvinfer1::Dims2::Dims2
Dims2()
Construct an empty Dims2 object.
Definition: NvInfer.h:71
pytorch_quantization.tensor_quant.ScaledQuantDescriptor._num_bits
_num_bits
Definition: tensor_quant.py:79
half_float::half::operator/=
detail::enable< half &, T >::type operator/=(T rhs)
Arithmetic assignment.
Definition: samples/common/half.h:1471
nvinfer1::plugin::GridAnchorPluginCreator::~GridAnchorPluginCreator
~GridAnchorPluginCreator() override=default
nvinfer1::EnumMax< TensorFormat >
constexpr int32_t EnumMax< TensorFormat >()
Maximum number of elements in TensorFormat enum.
Definition: NvInferRuntimeCommon.h:352
nvinfer1::IParametricReLULayer::~IParametricReLULayer
virtual ~IParametricReLULayer() noexcept
Definition: NvInfer.h:4323
remove
Definition: remove.py:1
comparator.test_data_loader.TestDataLoader.test_no_shape_tensor_false_positive_negative_dims
def test_no_shape_tensor_false_positive_negative_dims(self)
Definition: test_data_loader.py:62
stft.STFT.filter_length
filter_length
Definition: stft.py:46
SampleINT8API::setLayerPrecision
void setLayerPrecision(SampleUniquePtr< nvinfer1::INetworkDefinition > &network)
Sets computation precision for network layers.
Definition: sampleINT8API.cpp:211
pytorch_quantization.nn.modules.quant_bert.QuantBertSelfAttention.attention_head_size
attention_head_size
Definition: quant_bert.py:60
nvinfer1::IInt8EntropyCalibrator::getAlgorithm
CalibrationAlgoType getAlgorithm() override
Signal that this is the entropy calibrator.
Definition: NvInfer.h:6069
nvinfer1::plugin::PriorBox::minSize
Weights minSize
Definition: priorBoxPlugin.h:94
nvinfer1::IPluginV2DynamicExt::configurePlugin
virtual void configurePlugin(const DynamicPluginTensorDesc *in, int32_t nbInputs, const DynamicPluginTensorDesc *out, int32_t nbOutputs)=0
Configure the layer.
backend.onnxrt.test_runner.TestLoggerCallbacks
Definition: onnxrt/test_runner.py:24
onnx_graphsurgeon.logger.logger.LoggerSuppress.__init__
def __init__(self, logger, severity)
Definition: onnx-graphsurgeon/onnx_graphsurgeon/logger/logger.py:42
nvinfer1::plugin::GroupNormalizationPluginCreator::getPluginName
const char * getPluginName() const override
Return the plugin name.
Definition: groupNormalizationPlugin.cpp:260
bert::getXMMAKernels
const FusedMultiHeadAttentionXMMAKernel * getXMMAKernels(Data_type type, unsigned int sm)
Definition: fused_multihead_attention.h:334
SampleGoogleNet::mParams
samplesCommon::CaffeSampleParams mParams
Definition: sampleGoogleNet.cpp:72
nvinfer1::plugin::Reorg::isOutputBroadcastAcrossBatch
bool isOutputBroadcastAcrossBatch(int outputIndex, const bool *inputIsBroadcasted, int nbInputs) const override
Definition: reorgPlugin.cpp:142
nvinfer1::plugin::PyramidROIAlign::mFeatureSpatialSize
xy_t mFeatureSpatialSize[mFeatureMapCount]
Definition: pyramidROIAlignPlugin.h:100
onnx_graphsurgeon.logger.logger.Logger.suppress
def suppress(self, severity=CRITICAL)
Definition: onnx-graphsurgeon/onnx_graphsurgeon/logger/logger.py:142
nvinfer1::Weights::values
const void * values
The weight values, in a contiguous array.
Definition: NvInferRuntime.h:81
nvinfer1::plugin::GenerateDetection::mScoreThreshold
float mScoreThreshold
Definition: generateDetectionPlugin.h:97
samplesCommon::TimerBase::milliseconds
float milliseconds() const noexcept
Definition: samples/common/common.h:790
half_float::detail::functions::lround
static long lround(half arg)
Nearest integer implementation.
Definition: samples/common/half.h:2110
anonymous_namespace{nvFasterRCNNPlugin.cpp}::RPROI_PLUGIN_VERSION
const char * RPROI_PLUGIN_VERSION
Definition: nvFasterRCNNPlugin.cpp:30
nvinfer1::PluginTensorDesc::dims
Dims dims
Definition: NvInferRuntimeCommon.h:369
OutputParams::itemProbPairVec
std::vector< std::pair< int32_t, float > > itemProbPairVec
Definition: sampleMovieLens.cpp:55
mrcnn_to_trt_single.CocoConfig
Definition: mrcnn_to_trt_single.py:75
SampleOnnxMnistCoordConvAC::mParams
samplesCommon::OnnxSampleParams mParams
The parameters for the sample.
Definition: sampleOnnxMnistCoordConvAC.cpp:75
SampleConfig::mTextFilename
std::string mTextFilename
Definition: sampleConfig.h:38
nvinfer1::plugin::ProposalPluginCreator::mFC
static PluginFieldCollection mFC
Definition: proposalPlugin.h:143
Bbox::Bbox
Bbox(T xmin, T ymin, T xmax, T ymax)
Definition: bboxUtils.h:29
half_float::detail::half_caster< half, U, R >::cast_impl
static half cast_impl(U arg, false_type)
Definition: samples/common/half.h:2632
bert::Fused_multihead_attention_params_v2::force_unroll
bool force_unroll
Definition: fused_multihead_attention_v2.h:70
nvinfer1::IBuilder
Builds an engine from a network definition.
Definition: NvInfer.h:6928
SampleReformatFreeIO::mMeanBlob
SampleUniquePtr< nvcaffeparser1::IBinaryProtoBlob > mMeanBlob
Definition: sampleReformatFreeIO.cpp:270
tests.quant_rnn_test.TestQuantLSTM.test_no_quant_all_modes
def test_no_quant_all_modes(self, verbose)
Definition: quant_rnn_test.py:338
half_float::detail::functions::floor
static half floor(half arg)
Floor implementation.
Definition: samples/common/half.h:2078
nvinfer1::plugin::PyramidROIAlign::serialize
void serialize(void *buffer) const override
Serialize the layer.
Definition: pyramidROIAlignPlugin.cpp:206
sample::anonymous_namespace{sampleEngines.cpp}::RndInt8Calibrator::getBatch
bool getBatch(void *bindings[], const char *names[], int nbBindings) override
Definition: sampleEngines.cpp:222
MNISTBatchStream::readDataFile
void readDataFile(const std::string &dataFilePath)
Definition: BatchStream.h:99
half_float::detail::nextafter
half nextafter(half from, half to)
Next representable value.
Definition: ieee_half.h:2684
nvinfer1::plugin::SpecialSlice::setPluginNamespace
void setPluginNamespace(const char *libNamespace) override
Set the namespace that this plugin object belongs to.
Definition: specialSlicePlugin.cpp:94
half_float::detail::functions::exp2
static expr exp2(float arg)
Binary exponential implementation.
Definition: samples/common/half.h:1787
nvinfer1::DataType::kINT8
@ kINT8
8-bit integer representing a quantized floating-point value.
samplesCommon::TimerBase::reset
void reset() noexcept
Definition: samples/common/common.h:798
tools.test_run.TestTrtLegacy.test_trt_legacy_onnx
def test_trt_legacy_onnx(self)
Definition: test_run.py:56
nvinfer1::plugin::FlattenConcat::FlattenConcat
FlattenConcat()=delete
nvinfer1::plugin::PriorBox::getNbOutputs
int getNbOutputs() const override
Get the number of outputs from the layer.
Definition: priorBoxPlugin.cpp:143
half_float::detail::fabs
half fabs(half arg)
Absolute value.
Definition: ieee_half.h:2235
SampleUffPluginV2Ext::infer
bool infer()
Runs the TensorRT inference engine for this sample.
Definition: sampleUffPluginV2Ext.cpp:265
EntropyCalibratorImpl::mInputCount
size_t mInputCount
Definition: EntropyCalibrator.h:88
SampleMNISTAPIParams::weightsFile
std::string weightsFile
The filename of the weights file.
Definition: sampleMNISTAPI.cpp:51
model.WN.n_channels
n_channels
Definition: demo/Tacotron2/waveglow/model.py:94
nvinfer1::IDeconvolutionLayer::~IDeconvolutionLayer
virtual ~IDeconvolutionLayer()
Definition: NvInfer.h:2436
nmtSample::SLPAttention::mOutputChannelCount
int mOutputChannelCount
Definition: slpAttention.h:50
nvinfer1::TensorFormat::kDLA_HWC4
@ kDLA_HWC4
DLA image format.
nvinfer1::plugin::CropAndResizePlugin::setPluginNamespace
void setPluginNamespace(const char *pluginNamespace) override
Set the namespace that this plugin object belongs to.
Definition: cropAndResizePlugin.cpp:189
nvinfer1::ICudaEngine::getBindingDimensions
virtual Dims getBindingDimensions(int32_t bindingIndex) const noexcept=0
Get the dimensions of a binding.
SampleMNIST::constructNetwork
bool constructNetwork(SampleUniquePtr< nvcaffeparser1::ICaffeParser > &parser, SampleUniquePtr< nvinfer1::INetworkDefinition > &network)
uses a Caffe parser to create the MNIST Network and marks the output layers
Definition: sampleMNIST.cpp:240
classification_flow.get_parser
def get_parser()
Definition: classification_flow.py:54
builder_varseqlen.BertConfig.__init__
def __init__(self, bert_config_path, use_fp16, use_int8, use_qat, interleaved)
Definition: builder_varseqlen.py:90
IBatchStream::getLabels
virtual float * getLabels()=0
nmtSample::BeamSearchPolicy::processTimestep
void processTimestep(int validSampleCount, const float *hCombinedLikelihoods, const int *hVocabularyIndices, const int *hRayOptionIndices, int *hSourceRayIndices, float *hSourceLikelihoods)
Definition: beamSearchPolicy.cpp:59
nvinfer1::CUDADriverWrapper::_cuFuncSetAttribute
CUresult(* _cuFuncSetAttribute)(CUfunction, CUfunction_attribute, int)
Definition: cudaDriverWrapper.h:71
SampleReformatFreeIO::infer
bool infer(SampleBuffer &inputBuf, SampleBuffer &outputBuf)
Runs the TensorRT inference engine for this sample.
Definition: sampleReformatFreeIO.cpp:393
nvonnxparser::IOnnxConfig::getModelDtype
virtual nvinfer1::DataType getModelDtype() const =0
Get the Model Data Type.
nvinfer1::plugin::SplitPlugin::getPluginVersion
const char * getPluginVersion() const override
Return the plugin version.
Definition: split.h:106
nvinfer1::plugin::GroupNormalizationPlugin::getOutputDataType
DataType getOutputDataType(int index, const nvinfer1::DataType *inputTypes, int nbInputs) const override
Definition: groupNormalizationPlugin.cpp:228
polygraphy.tools.surgeon.subtool
Definition: tools/Polygraphy/polygraphy/tools/surgeon/subtool/__init__.py:1
model.Encoder.__init__
def __init__(self, encoder_n_convolutions, encoder_embedding_dim, encoder_kernel_size)
Definition: demo/Tacotron2/tacotron2/model.py:179
nvinfer1::plugin::GridAnchorRectPluginCreator::~GridAnchorRectPluginCreator
~GridAnchorRectPluginCreator() override=default
pytorch_quantization.nn.modules._utils.QuantInputMixin
Definition: _utils.py:97
generate.output
output
Definition: 05_folding_constants/generate.py:36
samplesCommon::GenericBuffer
The GenericBuffer class is a templated class for buffers.
Definition: buffers.h:52
SampleMaskRCNN::processInput
bool processInput(const samplesCommon::BufferManager &buffers)
Definition: sampleUffMaskRCNN.cpp:481
nvinfer1::PluginFieldType::kINT32
@ kINT32
INT32 field type.
nvinfer1::ICudaEngine::hasImplicitBatchDimension
virtual bool hasImplicitBatchDimension() const =0
Query whether the engine was built with an implicit batch dimension.
onnx_graphsurgeon.ir.node.Node.__repr__
def __repr__(self)
Definition: node.py:112
nvcaffeparser1::IPluginFactory::isPlugin
virtual bool isPlugin(const char *layerName)=0
A user implemented function that determines if a layer configuration is provided by an IPlugin.
tests.quant_conv_test.TestQuantConv2D.test_fake_quant_per_tensor
def test_fake_quant_per_tensor(self)
Definition: quant_conv_test.py:129
polygraphy.backend.tf.loader.SessionFromGraph.__call__
def __call__(self)
Definition: backend/tf/loader.py:409
polygraphy.backend.trt.loader.BaseNetworkFromOnnx.__call__
def __call__(self)
Definition: backend/trt/loader.py:104
nvcaffeparser1::CaffeWeightFactory::getBlob
const trtcaffe::BlobProto * getBlob(const std::string &layerName, int index)
Definition: caffeWeightFactory.cpp:105
fp16::__half2float
float __half2float(__half h)
Definition: fp16.h:86
nvinfer1::INetworkDefinition::getNbOutputs
virtual int32_t getNbOutputs() const =0
Get the number of outputs in the network.
export_tacotron2_onnx.DecoderIter.linear_projection
linear_projection
Definition: export_tacotron2_onnx.py:128
nvinfer1::plugin::SplitPluginCreator::deserializePlugin
nvinfer1::IPluginV2DynamicExt * deserializePlugin(const char *, const void *serialData, size_t serialLength)
Called during deserialization of plugin layer.
Definition: split.h:164
SampleUffMNIST::build
bool build()
Builds the network engine.
Definition: sampleUffMNIST.cpp:115
perf.DeviceBuffer.binding
def binding(self)
Definition: perf.py:32
SampleMaskRCNN::verifyOutput
bool verifyOutput(const samplesCommon::BufferManager &buffers)
Definition: sampleUffMaskRCNN.cpp:578
sample::TrtCudaStream::mStream
cudaStream_t mStream
Definition: sampleDevice.h:100
nvinfer1::PluginFieldType::kINT16
@ kINT16
INT16 field type.
polygraphy.backend.onnxrt.runner
Definition: backend/onnxrt/runner.py:1
polygraphy.backend.trt.loader.ShapeTuple.__repr__
def __repr__(self)
Definition: backend/trt/loader.py:254
text.cleaners.transliteration_cleaners
def transliteration_cleaners(text)
Definition: cleaners.py:90
train.adjust_learning_rate
def adjust_learning_rate(iteration, epoch, optimizer, learning_rate, anneal_steps, anneal_factor, rank)
Definition: train.py:305
polygraphy.backend.trt.loader.NetworkFromOnnxPath
Definition: backend/trt/loader.py:140
Batch::~Batch
~Batch()
Definition: sampleMovieLensMPS.cpp:236
common.check_file_non_empty
def check_file_non_empty(path)
Definition: common.py:23
nvinfer1::plugin::InstanceNormalizationPlugin::_b_desc
cudnnTensorDescriptor_t _b_desc
Definition: instanceNormalizationPlugin.h:98
nvinfer1::Weights
An array of weights used as a layer parameter.
Definition: NvInferRuntime.h:77
polygraphy.tools.args.trt_legacy.TrtLegacyArgs.uff_order
uff_order
Definition: tools/args/trt_legacy.py:63
polygraphy.backend.base.runner.BaseRunner.__exit__
def __exit__(self, exc_type, exc_value, traceback)
Definition: backend/base/runner.py:71
nvinfer1::TensorFormat::kHWC8
@ kHWC8
Eight channel format where C is padded to a multiple of 8.
nvcaffeparser1::CaffeParser::parse
const IBlobNameToTensor * parse(const char *deploy, const char *model, nvinfer1::INetworkDefinition &network, nvinfer1::DataType weightType) override
Parse a prototxt file and a binaryproto Caffe model to extract network definition and weights associa...
Definition: caffeParser.cpp:341
sample::anonymous_namespace{sampleReporting.cpp}::traceToTiming
InferenceTime traceToTiming(const InferenceTrace &a)
Definition: sampleReporting.cpp:71
pytorch_quantization.calib.histogram.HistogramCalibrator.__repr__
def __repr__(self)
Definition: histogram.py:251
ProposalWorkSpace::preRefineBboxDims
nvinfer1::DimsHW preRefineBboxDims
Definition: maskRCNNKernels.h:128
sample::gLogInfo
LogStreamConsumer gLogInfo
Definition: logger.cpp:24
main.parse_args
def parse_args(parser)
Definition: main.py:21
half_float::detail::operator-
enable< expr, T, U >::type operator-(T x, U y)
Subtract halfs.
Definition: ieee_half.h:2178
profiler.main
def main()
Definition: profiler.py:157
pytorch_quantization.nn.modules.tensor_quantizer.TensorQuantizer.__init__
def __init__(self, quant_desc=QuantDescriptor(), disabled=False, if_quant=True, if_clip=False, if_calib=False)
Definition: tensor_quantizer.py:70
preprocess_audio2mel.main
def main()
Definition: preprocess_audio2mel.py:70
helpers.tokenization.BasicTokenizer._is_chinese_char
def _is_chinese_char(self, cp)
Definition: tokenization.py:294
prn_utils.printHeader
def printHeader(allFeatures, selection, gp=False, count=False)
Definition: prn_utils.py:42
polygraphy.backend.onnx.loader.ModifyOnnx
Definition: backend/onnx/loader.py:107
polygraphy.backend.trt_legacy.TrtLegacyRunner.fp16
fp16
Definition: backend/trt_legacy.py:217
backend.onnx.test_runner.TestOnnxTfRunner.test_can_name_runner
def test_can_name_runner(self)
Definition: onnx/test_runner.py:22
LogStreamConsumer
Convenience object used to facilitate use of C++ stream syntax when logging messages....
Definition: pluginLogging.h:129
nvinfer1::EnumMax< DataType >
constexpr int32_t EnumMax< DataType >()
Maximum number of elements in DataType enum.
Definition: NvInferRuntimeCommon.h:156
polygraphy.tools.surgeon.subtool.extract.Extract.__init__
def __init__(self)
Definition: extract.py:28
nvinfer1::IDeconvolutionLayer
A deconvolution layer in a network definition.
Definition: NvInfer.h:2283
half_float::detail::functions::isinf
static bool isinf(half arg)
Classification implementation.
Definition: samples/common/half.h:2351
nvinfer1
The TensorRT API version 1 namespace.
SampleUffMNIST::mParams
samplesCommon::UffSampleParams mParams
Definition: sampleUffMNIST.cpp:101
sample::InferenceTime::operator=
InferenceTime & operator=(const InferenceTime &)=default
nvinfer1::plugin::SplitPlugin::getOutputDimensions
nvinfer1::DimsExprs getOutputDimensions(int outputIndex, const nvinfer1::DimsExprs *inputs, int nbInputs, nvinfer1::IExprBuilder &exprBuilder) override
nvinfer1::plugin::ResizeNearest::getWorkspaceSize
size_t getWorkspaceSize(int) const override
Definition: resizeNearestPlugin.cpp:126
nvonnxparser::IParser::supportsModel
virtual bool supportsModel(void const *serialized_onnx_model, size_t serialized_onnx_model_size, SubGraphCollection_t &sub_graph_collection)=0
Check whether TensorRT supports a particular ONNX model.
helpers.calibrator.BertCalibrator.doc_stride
doc_stride
Definition: demo/BERT/helpers/calibrator.py:42
SampleCharRNNBindingNames::HIDDEN_IN_BLOB_NAME
const char * HIDDEN_IN_BLOB_NAME
Definition: sampleCharRNN.cpp:88
nvinfer1::plugin::ProposalPlugin::detachFromContext
void detachFromContext() override
Detach the plugin object from its execution context.
Definition: proposalPlugin.cpp:315
nvinfer1::IPluginCreator::getPluginNamespace
virtual const char * getPluginNamespace() const =0
Return the namespace of the plugin creator object.
nvinfer1::IPluginV2IOExt::supportsFormatCombination
virtual bool supportsFormatCombination(int32_t pos, const PluginTensorDesc *inOut, int32_t nbInputs, int32_t nbOutputs) const =0
Return true if plugin supports the format and datatype for the input/output indexed by pos.
backend.trt.test_loader.TestOnnxNetworkLoader.test_loader
def test_loader(self)
Definition: trt/test_loader.py:90
onnx_graphsurgeon.logger.logger.LoggerSuppress
Definition: onnx-graphsurgeon/onnx_graphsurgeon/logger/logger.py:41
nvinfer1::CUDADriverWrapper::_cuLinkAddData
CUresult(* _cuLinkAddData)(CUlinkState, CUjitInputType, void *, size_t, const char *, unsigned int, CUjit_option *, void **)
Definition: cudaDriverWrapper.h:79
tools.test_run.TestTf.test_tf_save_tensorboard
def test_tf_save_tensorboard(self)
Definition: test_run.py:155
nvinfer1::plugin::SplitPlugin
Definition: split.h:38
nmtSample::Decoder
encodes single input into output states
Definition: decoder.h:33
backend.onnx.test_loader.TestSaveOnnx
Definition: onnx/test_loader.py:74
polygraphy.tools.args.model.ModelArgs.parse
def parse(self, args)
Definition: tools/Polygraphy/polygraphy/tools/args/model.py:46
test_ir.TestNodeIO.test_insert
def test_insert(self, field_names)
Definition: test_ir.py:245
tests.quant_pooling_test.TestQuantAvgPool3d
Definition: quant_pooling_test.py:197
polygraphy.backend.onnx.loader.OnnxFromTfGraph.optimize
optimize
Definition: backend/onnx/loader.py:70
nvinfer1::plugin::DetectionLayer::mNbClasses
int mNbClasses
Definition: detectionLayerPlugin.h:94
samplesCommon::TypedHostMemory::TypedHostMemory
TypedHostMemory(std::size_t size)
Definition: samples/common/common.h:338
model.Encoder
Definition: demo/Tacotron2/tacotron2/model.py:174
SampleUffSSD::teardown
bool teardown()
Cleans up any state created in the sample class.
Definition: sampleUffSSD.cpp:274
half_float::detail::isgreater
bool isgreater(half x, half y)
Comparison for greater than.
Definition: ieee_half.h:2773
polygraphy.tools.args.data_loader.DataLoaderArgs.add_to_script
def add_to_script(self, script)
Definition: tools/args/data_loader.py:66
nvinfer1::EnumMax< MatrixOperation >
constexpr int32_t EnumMax< MatrixOperation >()
Maximum number of elements in MatrixOperation enum.
Definition: NvInfer.h:4156
polygraphy.backend.tf.loader.UseTfTrt.minimum_segment_size
minimum_segment_size
Definition: backend/tf/loader.py:231
polygraphy.tools.args.base.BaseArgs.register
def register(self, maker)
Definition: args/base.py:42
polygraphy.backend.tf.runner.TfRunner._sess
_sess
Definition: backend/tf/runner.py:51
test_ir.TestNode.setup_method
def setup_method(self)
Definition: test_ir.py:135
loss_function.WaveGlowLoss.__init__
def __init__(self, sigma=1.0)
Definition: waveglow/loss_function.py:20
nmtSample::Vocabulary::add
void add(const std::string &token)
add new token to vocabulary, ID is auto-generated
Definition: vocabulary.cpp:34
bert::FusedMultiHeadAttentionKernelMetaInfoV1::mThreadsPerCTA
unsigned int mThreadsPerCTA
Definition: fused_multihead_attention.h:125
polygraphy.tools.args.tf.runner
Definition: tools/args/tf/runner.py:1
example.W
W
Definition: onnx-graphsurgeon/examples/02_creating_a_model_with_initializer/example.py:24
SampleErrorRecorder::errorStack
std::vector< errorPair > errorStack
Definition: ErrorRecorder.h:40
SamplePlugin::SamplePlugin
SamplePlugin(const samplesCommon::CaffeSampleParams &params)
Definition: samplePlugin.cpp:61
perf_varseqlen.DeviceBuffer.buf
buf
Definition: perf_varseqlen.py:30
BatchStream::BatchStream
BatchStream(int batchSize, int maxBatches, const nvinfer1::Dims &dims, const std::string &listFile, const std::vector< std::string > &directories)
Definition: frcnnUtils.h:227
polygraphy.tools.args.trt.runner.TrtRunnerArgs.trt_loader_args
trt_loader_args
Definition: tools/args/trt/runner.py:39
pytorch_quantization.nn.modules.quant_instancenorm.QuantInstanceNorm3d.__init__
def __init__(self, int num_features, float eps=1e-5, float momentum=0.1, bool affine=False, bool track_running_stats=False, **kwargs)
Definition: quant_instancenorm.py:68
nvinfer1::plugin::BatchTilePlugin::supportsFormat
bool supportsFormat(DataType type, PluginFormat format) const override
Check format support.
Definition: batchTilePlugin.cpp:147
sample::helpHelp
void helpHelp(std::ostream &os)
Definition: sampleOptions.cpp:1300
nvinfer1::plugin::detail::div_up
__host__ __device__ int div_up(int a, int b)
Definition: reducedMathPlugin.h:47
FCPlugin::initialize
int initialize() override
Initialize the layer for execution.
Definition: samples/opensource/samplePlugin/fcPlugin.h:127
nvinfer1::IPlugin::configure
virtual void configure(const Dims *inputDims, int32_t nbInputs, const Dims *outputDims, int32_t nbOutputs, int32_t maxBatchSize)=0
Configure the layer.
nvcaffeparser1::BinaryProtoBlob::destroy
void destroy() override
Definition: binaryProtoBlob.h:51
SampleDynamicReshape
The SampleDynamicReshape class implementes the dynamic reshape sample.
Definition: sampleDynamicReshape.cpp:45
prn_utils.printCsv
def printCsv(data, count=False)
Definition: prn_utils.py:61
nvinfer1::plugin::DetectionLayer::attachToContext
void attachToContext(cudnnContext *cudnnContext, cublasContext *cublasContext, IGpuAllocator *gpuAllocator) override
Attach the plugin object to an execution context and grant the plugin the access to some context reso...
Definition: detectionLayerPlugin.cpp:307
samplesCommon::vPPM
Definition: samples/common/common.h:660
model.Decoder.prenet
prenet
Definition: demo/Tacotron2/tacotron2/model.py:254
samplesCommon::TimerBase::start
virtual void start()
Definition: samples/common/common.h:784
bert::SkipLayerNormInterleavedPlugin::getOutputDataType
nvinfer1::DataType getOutputDataType(int index, const nvinfer1::DataType *inputTypes, int nbInputs) const override
Definition: skipLayerNormInt8InterleavedPlugin.cpp:201
BatchStream::BatchStream
BatchStream(int batchSize, int maxBatches, const std::string &prefix, const std::string &suffix, const std::vector< std::string > &directories)
Definition: frcnnUtils.h:194
polygraphy.tools.inspect.subtool.model.Model
Definition: tools/Polygraphy/polygraphy/tools/inspect/subtool/model.py:23
sample::Logger::reportWaive
static int reportWaive(const TestAtom &testAtom)
Definition: samples/common/logging.h:359
nvinfer1::plugin::ResizeNearest::serialize
void serialize(void *buffer) const override
Serialize the layer.
Definition: resizeNearestPlugin.cpp:137
nvinfer1::plugin::CoordConvACPlugin::supportsFormat
bool supportsFormat(DataType type, PluginFormat format) const override
Check format support.
Definition: coordConvACPlugin.cpp:124
nvinfer1::INetworkDefinition::hasImplicitBatchDimension
virtual bool hasImplicitBatchDimension() const =0
Query whether the network was created with an implicit batch dimension.
polygraphy.tools.args.comparator.ComparatorCompareArgs.atol
atol
Definition: tools/args/comparator.py:106
SampleMNISTAPI::processInput
bool processInput(const samplesCommon::BufferManager &buffers)
Reads the input and stores the result in a managed buffer.
Definition: sampleMNISTAPI.cpp:304
tools.test_run.TestOther.test_top_k
def test_top_k(self)
Definition: test_run.py:239
nvinfer1::plugin::ProposalLayer::destroy
void destroy() override
Destroy the plugin object.
Definition: proposalLayerPlugin.cpp:155
nvinfer1::plugin::PriorBoxParameters::flip
bool flip
Definition: NvInferPluginUtils.h:95
polygraphy.comparator.data_loader.DataLoader.float_range
float_range
Definition: comparator/data_loader.py:90
anonymous_namespace{specialSlicePlugin.cpp}::SPECIALSLICE_PLUGIN_VERSION
const char * SPECIALSLICE_PLUGIN_VERSION
Definition: specialSlicePlugin.cpp:28
half_float::detail::half_caster< T, half, R >::cast_impl
static T cast_impl(half arg, false_type)
Definition: samples/common/half.h:2654
onnx_graphsurgeon.ir.tensor.Variable.to_constant
def to_constant(self, np.ndarray values)
Definition: tensor.py:171
nvcaffeparser1::IBinaryProtoBlob::~IBinaryProtoBlob
virtual ~IBinaryProtoBlob()
Definition: NvCaffeParser.h:80
tests.tensor_quantizer_test.TestTensorQuantizer.test_per_channel_scale
def test_per_channel_scale(self, verbose)
Definition: tensor_quantizer_test.py:67
nvinfer1::plugin::reduced_divisor::mod
__host__ __device__ __forceinline__ int mod(int x) const
Definition: reducedMathPlugin.h:76
BatchStream::mMaxBatches
int mMaxBatches
Definition: BatchStream.h:374
sample::anonymous_namespace{sampleReporting.cpp}::findPercentile
float findPercentile(float percentage, const std::vector< InferenceTime > &timings, const T &toFloat)
Find percentile in an ascending sequence of timings.
Definition: sampleReporting.cpp:40
nmtSample::DebugUtil::DumpTensorPlugin::mData
PinnedHostBuffer< float >::ptr mData
Definition: debugUtil.h:72
SampleUffPluginV2Ext::mParams
samplesCommon::UffSampleParams mParams
Definition: sampleUffPluginV2Ext.cpp:334
SampleReformatFreeIO::mParams
samplesCommon::CaffeSampleParams mParams
Definition: sampleReformatFreeIO.cpp:262
pytorch_quantization.tensor_quant.FakeTensorQuantFunction
Definition: tensor_quant.py:300
util.test_cuda.TestStream.test_handle_is_ctypes_ptr
def test_handle_is_ctypes_ptr(self)
Definition: test_cuda.py:102
models.meta.Model.__init__
def __init__(self, path, LoaderType, check_runner, input_metadata=None)
Definition: meta.py:31
layers.LinearNorm.forward
def forward(self, x)
Definition: layers.py:32
nvinfer1::plugin::RPROIPlugin::RPROIPlugin
RPROIPlugin(RPROIParams params, const float *anchorsRatios, const float *anchorsScales)
Definition: nvFasterRCNNPlugin.cpp:37
tests.tensor_quant_test.TestTensorQuant.test_backward
def test_backward(self)
Definition: tensor_quant_test.py:84
polygraphy.backend.onnxrt.runner.OnnxrtRunner.__init__
def __init__(self, sess, name=None)
Definition: backend/onnxrt/runner.py:29
tests.quant_pooling_test.TestQuantMaxPool2d.test_input_fake_quant
def test_input_fake_quant(self)
Definition: quant_pooling_test.py:69
nvinfer1::IConstantLayer
Layer that represents a constant value.
Definition: NvInfer.h:4270
half_float::detail::functions::tgamma
static expr tgamma(float arg)
Gamma implementation.
Definition: samples/common/half.h:2052
sample::AllOptions
Definition: sampleOptions.h:186
inference_perf.parse_args
def parse_args(parser)
Definition: inference_perf.py:31
nmtSample::DebugUtil::DumpTensorPlugin::getSerializationSize
size_t getSerializationSize() override
Find the size of the serialization buffer required.
Definition: debugUtil.cpp:112
MaskRCNNUtils::PPM::w
int w
Definition: sampleUffMaskRCNN.cpp:75
nvinfer1::IInt8LegacyCalibrator::getAlgorithm
CalibrationAlgoType getAlgorithm() override
Signal that this is the legacy calibrator.
Definition: NvInfer.h:6114
bert::fused_multihead_attention_v2_fp16_256_64_kernel_sm75_cubin_len
unsigned int fused_multihead_attention_v2_fp16_256_64_kernel_sm75_cubin_len
Definition: fused_multihead_attention_v2_fp16_256_64_kernel.sm75.cpp:5789
BertInference::run
void run(const void *inputIds, const void *segmentIds, const void *inputMask, int warmUps, int iterations)
Definition: bert_infer.h:283
bert::EmbLayerNormVarSeqlenPlugin::getSerializationSize
size_t getSerializationSize() const override
Find the size of the serialization buffer required.
Definition: embLayerNormVarSeqlenPlugin.cpp:362
nvinfer1::IPluginV2IOExt::int32_t
int32_t
Definition: NvInferRuntimeCommon.h:792
nvinfer1::plugin::GridAnchorBasePluginCreator::deserializePlugin
IPluginV2Ext * deserializePlugin(const char *name, const void *serialData, size_t serialLength) override
Called during deserialization of plugin layer.
Definition: gridAnchorPlugin.cpp:487
nvinfer1::plugin::GridAnchorBasePluginCreator::getPluginName
const char * getPluginName() const override
Return the plugin name.
Definition: gridAnchorPlugin.cpp:364
model.Attention.v
v
Definition: demo/Tacotron2/tacotron2/model.py:55
nmtSample::SLPEmbedder::mNumInputs
int mNumInputs
Definition: slpEmbedder.h:53
bert::EmbLayerNormVarSeqlenPluginCreator::setPluginNamespace
void setPluginNamespace(const char *pluginNamespace) override
Set the namespace of the plugin creator based on the plugin library it belongs to.
Definition: embLayerNormVarSeqlenPlugin.cpp:514
test_examples.test_cli_examples
def test_cli_examples(example)
Definition: Polygraphy/tests/test_examples.py:118
nvinfer1::IHostMemory::~IHostMemory
virtual ~IHostMemory()
Definition: NvInferRuntime.h:103
nvinfer1::plugin::PriorBox::maxSize
Weights maxSize
Definition: priorBoxPlugin.h:94
half_float::detail::functions::nextafter
static half nextafter(half from, half to)
Enumeration implementation.
Definition: samples/common/half.h:2287
half_float::half::operator-=
detail::enable< half &, T >::type operator-=(T rhs)
Arithmetic assignment.
Definition: samples/common/half.h:1451
nvinfer1::plugin::GenerateDetection::destroy
void destroy() override
Destroy the plugin object.
Definition: generateDetectionPlugin.cpp:155
nvinfer1::plugin::BatchedNMSDynamicPlugin::boxesSize
int boxesSize
Definition: batchedNMSPlugin.h:113
tools.test_run.TestTf.test_tftrt
def test_tftrt(self)
Definition: test_run.py:172
sample::TrtCudaGraph::mGraphExec
cudaGraphExec_t mGraphExec
Definition: sampleDevice.h:206
anonymous_namespace{regionPlugin.cpp}::REGION_PLUGIN_NAME
const char * REGION_PLUGIN_NAME
Definition: regionPlugin.cpp:28
nvinfer1::IInt8Calibrator
Application-implemented interface for calibration.
Definition: NvInfer.h:5998
ProposalWorkSpace::totalSize
size_t totalSize
Definition: maskRCNNKernels.h:152
nvonnxparser::ErrorCode
ErrorCode
Definition: NvOnnxParser.h:66
tests.quant_instancenorm_test.TestQuantInstanceNorm2D.test_fake_quant_per_channel
def test_fake_quant_per_channel(self)
Definition: quant_instancenorm_test.py:130
nvinfer1::plugin::MultilevelProposeROI::mAnchorBoxesDevice
std::vector< std::shared_ptr< CudaBind< float > > > mAnchorBoxesDevice
Definition: multilevelProposeROIPlugin.h:105
nvinfer1::plugin::GroupNormalizationPlugin::serialize
void serialize(void *buffer) const override
Serialize the layer.
Definition: groupNormalizationPlugin.cpp:167
evaluate-v1.file
file
Definition: evaluate-v1.1.py:110
SampleMaskRCNNParams::maskThreshold
float maskThreshold
Definition: sampleUffMaskRCNN.cpp:307
nvinfer1::IRuntime::getErrorRecorder
virtual IErrorRecorder * getErrorRecorder() const noexcept=0
get the ErrorRecorder assigned to this interface.
nvinfer1::plugin::GenerateDetection::mAnchorsCnt
int mAnchorsCnt
Definition: generateDetectionPlugin.h:101
pytorch_quantization.nn.modules.quant_conv.QuantConvTranspose1d.__init__
def __init__(self, in_channels, out_channels, kernel_size, stride=1, padding=0, output_padding=0, groups=1, bias=True, dilation=1, padding_mode='zeros', **kwargs)
Definition: quant_conv.py:274
polygraphy.tools.args.onnx.loader.OnnxLoaderArgs.__init__
def __init__(self, write=True, outputs=True, shape_inference_default=None)
Definition: tools/args/onnx/loader.py:23
SampleINT8::build
bool build(DataType dataType)
Function builds the network engine.
Definition: sampleINT8.cpp:127
sample::IOFormat
std::pair< nvinfer1::DataType, nvinfer1::TensorFormats > IOFormat
Definition: sampleOptions.h:65
half_float::detail::atan2
expr atan2(half x, half y)
Arc tangent function.
Definition: ieee_half.h:2460
comparator.test_compare.TestBasicCompareFunc.test_per_output_tol
def test_per_output_tol(self, mode)
Definition: test_compare.py:34
nvuffparser::createUffParser
IUffParser * createUffParser()
Creates a IUffParser object.
nvinfer1::EngineCapability::kDEFAULT
@ kDEFAULT
Full capability, TensorRT mode without any restrictions using TensorRT nvinfer1 APIs.
FCPlugin::supportsFormat
bool supportsFormat(nvinfer1::DataType type, nvinfer1::PluginFormat format) const override
Check format support.
Definition: samples/opensource/samplePlugin/fcPlugin.h:107
nvinfer1::Weights::count
int64_t count
The number of weights in the array.
Definition: NvInferRuntime.h:82
pytorch_quantization.nn.modules.quant_conv.QuantConv1d.__init__
def __init__(self, in_channels, out_channels, kernel_size, stride=1, padding=0, dilation=1, groups=1, bias=True, padding_mode='zeros', **kwargs)
Definition: quant_conv.py:183
nvinfer1::DimensionOperation::kFLOOR_DIV
@ kFLOOR_DIV
Floor division of the first element by the second.
SamplePlugin::mMeanBlob
SampleUniquePtr< nvcaffeparser1::IBinaryProtoBlob > mMeanBlob
The mean blob, which need to keep around until build time.
Definition: samplePlugin.cpp:112
nvinfer1::IBuilder::buildEngineWithConfig
virtual nvinfer1::ICudaEngine * buildEngineWithConfig(INetworkDefinition &network, IBuilderConfig &config)=0
Builds an engine for the given INetworkDefinition and given IBuilderConfig.
MaskRCNNUtils::PPM::magic
std::string magic
Definition: sampleUffMaskRCNN.cpp:74
SampleConfig::InputDataFormat::kASCII
@ kASCII
Logger::TestResult::kWAIVED
@ kWAIVED
The test was waived.
nvinfer1::plugin::CoordConvACPlugin::getOutputDimensions
Dims getOutputDimensions(int index, const Dims *inputs, int nbInputDims) override
Definition: coordConvACPlugin.cpp:71
text._should_keep_symbol
def _should_keep_symbol(s)
Definition: demo/Tacotron2/tacotron2/text/__init__.py:73
polygraphy.backend.tf.util.load_graph
def load_graph(path)
Definition: tf/util.py:23
onnx_graphsurgeon.ir.tensor.Variable.inputs
inputs
Definition: tensor.py:165
polygraphy.tools.args.trt.loader.TrtLoaderArgs.explicit_precision
explicit_precision
Definition: tools/args/trt/loader.py:89
nvinfer1::plugin::PriorBoxPluginCreator::~PriorBoxPluginCreator
~PriorBoxPluginCreator() override
Definition: priorBoxPlugin.cpp:363
nvinfer1::plugin::MultilevelCropAndResize::mThresh
float mThresh
Definition: multilevelCropAndResizePlugin.h:98
nmtSample::MultiplicativeAlignment::mWeights
ComponentWeights::ptr mWeights
Definition: multiplicativeAlignment.h:51
nvinfer1::plugin::RPROIPlugin::serialize
void serialize(void *buffer) const override
Serialize the layer.
Definition: nvFasterRCNNPlugin.cpp:182
pytorch_quantization.tensor_quant.ScaledQuantDescriptor.num_bits
def num_bits(self)
Definition: tensor_quant.py:112
inference.unwrap_distributed
def unwrap_distributed(state_dict)
Definition: Tacotron2/inference.py:86
export_waveglow_onnx.convert_conv_1d_to_2d
def convert_conv_1d_to_2d(conv1d)
Definition: export_waveglow_onnx.py:54
sample::InferenceOptions::overlap
bool overlap
Definition: sampleOptions.h:154
nvinfer1::plugin::InstanceNormalizationPluginCreator::getFieldNames
const PluginFieldCollection * getFieldNames() override
Return a list of fields that needs to be passed to createPlugin.
Definition: instanceNormalizationPlugin.cpp:345
nvinfer1::plugin::ResizeNearest::clone
IPluginV2Ext * clone() const override
Clone the plugin object.
Definition: resizeNearestPlugin.cpp:175
polygraphy.util.misc.try_send_on_queue
def try_send_on_queue(queue, obj)
Definition: Polygraphy/polygraphy/util/misc.py:227
TLTMaskRCNNConfig::POOL_SIZE
static const int POOL_SIZE
Definition: tlt_mrcnn_config.h:30
nvinfer1::plugin::CoordConvACPluginCreator::mPluginAttributes
static std::vector< PluginField > mPluginAttributes
Definition: coordConvACPlugin.h:111
AlgorithmCacheReader::areSame
static bool areSame(const AlgorithmCacheItem &algoCacheItem, const IAlgorithm &algoChoice)
The combination of implementation, tactic and input/output formats is unique to an algorithm,...
Definition: sampleAlgorithmSelector.cpp:247
onnx_graphsurgeon.ir.graph.Graph.cleanup
def cleanup(self, remove_unused_node_outputs=False)
Definition: graph.py:198
nvinfer1::plugin::DetectionOutput::param
DetectionOutputParameters param
Definition: nmsPlugin.h:89
nvinfer1::ICudaEngine
An engine for executing inference on a built network, with functionally unsafe features.
Definition: NvInferRuntime.h:1157
test_ir.tensors_linear_graph
def tensors_linear_graph()
Definition: test_ir.py:319
nvinfer1::TensorLocation::kHOST
@ kHOST
Data stored on host.
nvinfer1::IConstantLayer::~IConstantLayer
virtual ~IConstantLayer()
Definition: NvInfer.h:4310
nvinfer1::plugin::LReLU::getSerializationSize
size_t getSerializationSize() const override
Find the size of the serialization buffer required.
Definition: lReluPlugin.cpp:67
test_importers.TestOnnxImporter.test_import_graph_value_info
def test_import_graph_value_info(self)
Definition: test_importers.py:98
backend.test_tensorrt_legacy.test_can_construct_onnx_loader
def test_can_construct_onnx_loader()
Definition: test_tensorrt_legacy.py:37
nvinfer1::plugin::MultilevelProposeROI::mNameSpace
std::string mNameSpace
Definition: multilevelProposeROIPlugin.h:117
tests.fixtures.verbose
def verbose(request)
Definition: tools/pytorch-quantization/tests/fixtures/__init__.py:20
util.test_cuda.ResizeTestCase.new
new
Definition: test_cuda.py:29
LogStreamConsumerBuffer::mShouldLog
bool mShouldLog
Definition: pluginLogging.h:100
copyright-scan.update
def update(filename, args)
Definition: copyright-scan.py:109
polygraphy.backend.tf.loader.UseTfTrt.__call__
def __call__(self)
Definition: backend/tf/loader.py:235
pytorch_quantization.nn.modules.tensor_quantizer.TensorQuantizer._axis
_axis
Definition: tensor_quantizer.py:76
nvinfer1::plugin::FlattenConcat::getPluginType
const char * getPluginType() const override
Return the plugin type.
Definition: flattenConcat.cpp:269
polygraphy.backend.trt.util.mark_outputs
def mark_outputs(network, outputs)
Definition: trt/util.py:197
pytorch_quantization.nn.modules.quant_rnn.QuantRNNCellBase.extra_repr
def extra_repr(self)
Definition: modules/quant_rnn.py:310
nvinfer1::IBuilderConfig::setFlag
virtual void setFlag(BuilderFlag builderFlag)=0
Set a single build mode flag.
nvinfer1::plugin::NMSPluginCreator::createPlugin
IPluginV2Ext * createPlugin(const char *name, const PluginFieldCollection *fc) override
Return a plugin object.
Definition: nmsPlugin.cpp:299
sample::BuildOptions::builderCache
bool builderCache
Definition: sampleOptions.h:119
polygraphy.tools.args.tf.loader.TfLoaderArgs.outputs
outputs
Definition: tools/args/tf/loader.py:74
nvinfer1::plugin::MultilevelProposeROI::mKeepTopK
int mKeepTopK
Definition: multilevelProposeROIPlugin.h:96
SampleCharRNNLoop::LstmIO::data
nvinfer1::ITensor * data
Definition: sampleCharRNN.cpp:243
nvinfer1::plugin::GenerateDetection::mType
nvinfer1::DataType mType
Definition: generateDetectionPlugin.h:103
nvinfer1::plugin::DetectionOutput::mPluginNamespace
std::string mPluginNamespace
Definition: nmsPlugin.h:91
nvcaffeparser1::WeightType::kVARIANCE
@ kVARIANCE
nmtSample::MultiplicativeAlignment::mOutputChannelCount
int mOutputChannelCount
Definition: multiplicativeAlignment.h:54
nvinfer1::plugin::GroupNormalizationPlugin::terminate
void terminate() override
Release resources acquired during plugin layer initialization.
Definition: groupNormalizationPlugin.cpp:181
nvinfer1::UnaryOperation::kATANH
@ kATANH
Inverse hyperbolic tangent.
EntropyCalibratorImpl::mCalibrationCache
std::vector< char > mCalibrationCache
Definition: EntropyCalibrator.h:93
polygraphy.tools.args.trt.runner.TrtRunnerArgs.register
def register(self, maker)
Definition: tools/args/trt/runner.py:32
nvcaffeparser1::BlobNameToTensor::isOK
bool isOK()
Definition: blobNameToTensor.h:61
nvinfer1::plugin::Region::getPluginType
const char * getPluginType() const override
Return the plugin type.
Definition: regionPlugin.cpp:383
SampleConfig::mCalibrationFilename
std::string mCalibrationFilename
Definition: sampleConfig.h:43
bert::EmbLayerNormVarSeqlenPlugin::supportsFormatCombination
bool supportsFormatCombination(int pos, const nvinfer1::PluginTensorDesc *inOut, int nbInputs, int nbOutputs) override
Definition: embLayerNormVarSeqlenPlugin.cpp:174
tools.test_run.TestOnnxrt.test_tf2onnx_save_onnx
def test_tf2onnx_save_onnx(self)
Definition: test_run.py:186
SampleUffFasterRcnn::SampleUffFasterRcnn
SampleUffFasterRcnn(const SampleUffFasterRcnnParams &params)
Definition: sampleUffFasterRCNN.cpp:99
SampleSSD::mPPMs
std::vector< samplesCommon::PPM< 3, 300, 300 > > mPPMs
PPMs of test images.
Definition: sampleSSD.cpp:92
half_float::detail::result< half, half >::type
half type
Definition: ieee_half.h:394
test_ir.TestNode.test_equals_name_mismatch
def test_equals_name_mismatch(self)
Definition: test_ir.py:143
polygraphy.backend.pyt.runner.PytRunner._model
_model
Definition: backend/pyt/runner.py:38
model.Attention.score_mask_value
score_mask_value
Definition: demo/Tacotron2/tacotron2/model.py:59
nvinfer1::UnaryOperation::kEXP
@ kEXP
Exponentiation.
polygraphy.backend.onnx.loader.OnnxFromPath
Definition: backend/onnx/loader.py:24
test_examples.Example.__str__
def __str__(self)
Definition: Polygraphy/tests/test_examples.py:87
SampleSSD::mParams
SampleSSDParams mParams
The parameters for the sample.
Definition: sampleSSD.cpp:88
model.Attention.memory_layer
memory_layer
Definition: demo/Tacotron2/tacotron2/model.py:53
SampleBuffer::destroy
void destroy()
Definition: sampleReformatFreeIO.cpp:174
nvinfer1::plugin::TRTException::log
virtual void log(std::ostream &logStream) const
Definition: checkMacrosPlugin.cpp:112
BatchStream::BatchStream
BatchStream(int batchSize, int maxBatches, nvinfer1::Dims dims, std::string listFile, std::vector< std::string > directories)
Definition: BatchStream.h:191
samplesCommon::HostMemory::size
std::size_t size() const noexcept override
The size in bytes of the data that was allocated.
Definition: samples/common/common.h:314
nvinfer1::ITensor::~ITensor
virtual ~ITensor()
Definition: NvInfer.h:608
nvinfer1::plugin::DetectionOutputParameters::keepTopK
int32_t keepTopK
Definition: NvInferPluginUtils.h:184
helpers.calibrator.BertCalibrator.write_histogram_cache
def write_histogram_cache(self, ptr, length)
Definition: demo/BERT/helpers/calibrator.py:110
helpers.tokenization.BasicTokenizer.__init__
def __init__(self, do_lower_case=True)
Definition: tokenization.py:218
polygraphy.backend.base.runner.BaseRunner.inference_time
inference_time
Definition: backend/base/runner.py:45
nvinfer1::DynamicPluginTensorDesc
Definition: NvInferRuntime.h:387
test_struct.TestTensorMetadata.test_str_no_shape
def test_str_no_shape(self)
Definition: common/test_struct.py:31
models.batchnorm_to_float
def batchnorm_to_float(module)
Definition: demo/Tacotron2/models.py:37
onnx_graphsurgeon.logger.logger.Logger.error
def error(self, message, mode=LogMode.EACH)
Definition: onnx-graphsurgeon/onnx_graphsurgeon/logger/logger.py:239
nvinfer1::utility::TRTParsedArgs::values
std::vector< std::pair< int, std::vector< std::string > > > values
Definition: getOptions.h:60
SampleINT8APIParams::dataDirs
std::vector< std::string > dataDirs
Definition: sampleINT8API.cpp:63
sample::BuildOptions::minTiming
int minTiming
Definition: sampleOptions.h:110
nmtSample::BenchmarkWriter::~BenchmarkWriter
~BenchmarkWriter() override=default
nvinfer1::plugin::NMSParameters::backgroundLabelId
int32_t backgroundLabelId
Definition: NvInferPluginUtils.h:250
polygraphy.backend.base.runner.BaseRunner
Definition: backend/base/runner.py:23
sample::anonymous_namespace{sampleInference.cpp}::Iteration::wait
void wait(EventType e, StreamType s)
Definition: sampleInference.cpp:413
bert::QKVToContextInterleavedPlugin::QKVToContextInterleavedPlugin
QKVToContextInterleavedPlugin()=delete
builder_varseqlen.load_onnx_weights_and_quant
def load_onnx_weights_and_quant(path, config)
Definition: builder_varseqlen.py:456
polygraphy.tools.args.tf.loader.TfLoaderArgs.parse
def parse(self, args)
Definition: tools/args/tf/loader.py:72
tools.test_args.TestParseMeta.test_parse_shape_dtype_auto
def test_parse_shape_dtype_auto(self)
Definition: test_args.py:34
pytorch_quantization.nn.modules.quant_rnn.QuantRNNBase._all_weights
_all_weights
Definition: modules/quant_rnn.py:74
polygraphy.backend.base.runner.BaseRunner.name
name
Definition: backend/base/runner.py:44
nvinfer1::IRefitter::getDynamicRangeMax
virtual float getDynamicRangeMax(const char *tensorName) const =0
Get maximum of dynamic range.
infer
Definition: infer.py:1
onnx_graphsurgeon.importers.base_importer
Definition: base_importer.py:1
MaskRCNNUtils::BBoxInfo::prob
float prob
Definition: sampleUffMaskRCNN.cpp:66
polygraphy.tools.args.comparator.ComparatorCompareArgs.no_shape_check
no_shape_check
Definition: tools/args/comparator.py:104
helpers.tokenization.BertTokenizer.tokenize
def tokenize(self, text)
Definition: tokenization.py:194
polygraphy.tools.args.comparator.ComparatorCompareArgs.top_k
top_k
Definition: tools/args/comparator.py:110
half_float::detail::half2int_up
T half2int_up(uint16 value)
Convert half-precision floating point to integer using round-to-nearest-away-from-zero.
Definition: ieee_half.h:1001
nvinfer1::plugin::GenerateDetection::getPluginNamespace
const char * getPluginNamespace() const override
Return the namespace of the plugin object.
Definition: generateDetectionPlugin.cpp:185
inference_c.parse_args
def parse_args()
Definition: inference_c.py:39
nmtSample::BenchmarkWriter
all it does is to measure the performance of sequence generation
Definition: benchmarkWriter.h:32
nmtSample::BenchmarkWriter::initialize
void initialize() override
it is called right before inference starts
Definition: benchmarkWriter.cpp:39
SampleBuffer::SampleBuffer
SampleBuffer()
Definition: sampleReformatFreeIO.cpp:136
nvinfer1::plugin::ProposalLayerPluginCreator::createPlugin
IPluginV2Ext * createPlugin(const char *name, const PluginFieldCollection *fc) override
Return a plugin object.
Definition: proposalLayerPlugin.cpp:66
test_examples.Example.run
def run(self, command)
Definition: Polygraphy/tests/test_examples.py:65
SamplePlugin::verifyOutput
bool verifyOutput(const samplesCommon::BufferManager &buffers, const std::string &outputTensorName, int groundTruthDigit) const
Verifies that the output is correct and prints it.
Definition: samplePlugin.cpp:327
bert::Fused_multihead_attention_params::scale_softmax
uint32_t scale_softmax
Definition: fused_multihead_attention.h:81
bert::EmbLayerNormVarSeqlenPlugin::mType
nvinfer1::DataType mType
Definition: embLayerNormVarSeqlenPlugin.h:102
inference.MeasureTime.__exit__
def __exit__(self, exc_type, exc_value, exc_traceback)
Definition: Tacotron2/inference.py:175
tests.calibrator_test.TestMSECalibrator.test_two_tensor
def test_two_tensor(self, verbose)
Definition: calibrator_test.py:228
nvinfer1::plugin::CropAndResizePluginCreator::~CropAndResizePluginCreator
~CropAndResizePluginCreator() override
Definition: cropAndResizePlugin.cpp:251
nvinfer1::plugin::InstanceNormalizationPlugin::getPluginVersion
const char * getPluginVersion() const override
Return the plugin version.
Definition: instanceNormalizationPlugin.cpp:244
SampleErrorRecorder::errorPair
std::pair< ErrorCode, std::string > errorPair
Definition: ErrorRecorder.h:39
tests.quant_linear_test.TestQuantLinear.test_fake_quant_per_tensor
def test_fake_quant_per_tensor(self)
Definition: quant_linear_test.py:104
polygraphy.tools.args.tf.loader
Definition: tools/args/tf/loader.py:1
SampleCharRNNBase::addReshape
nvinfer1::ITensor * addReshape(SampleUniquePtr< nvinfer1::INetworkDefinition > &network, nvinfer1::ITensor &tensor, nvinfer1::Dims dims)
Definition: sampleCharRNN.cpp:558
nvcaffeparser1::ICaffeParser::~ICaffeParser
virtual ~ICaffeParser()
Definition: NvCaffeParser.h:267
denoiser.Denoiser
Definition: denoiser.py:23
sample::Logger::getReportableSeverity
Severity getReportableSeverity() const
Definition: samples/common/logging.h:370
onnx_graphsurgeon.ir.tensor.Constant.to_variable
def to_variable(self, np.dtype dtype=None, Sequence[Union[int, str]] shape=[])
Definition: tensor.py:205
nvinfer1::IAlgorithm::getAlgorithmVariant
virtual const IAlgorithmVariant & getAlgorithmVariant() const =0
Returns the algorithm variant.
polygraphy.backend.trt.loader.BaseNetworkFromOnnx.explicit_precision
explicit_precision
Definition: backend/trt/loader.py:100
polygraphy.backend.cntk.runner.CNTKRunner.model
model
Definition: backend/cntk/runner.py:28
polygraphy.backend.common.loader.BytesFromPath.path
path
Definition: backend/common/loader.py:27
polygraphy.tools.args.trt_legacy.TrtLegacyArgs.check_registered
def check_registered(self)
Definition: tools/args/trt_legacy.py:53
sample::HostDeallocator
Definition: sampleDevice.h:297
nvinfer1::OptProfileSelector::kOPT
@ kOPT
This is used to set or get the value that is used in the optimization (kernel selection).
bert::EmbLayerNormVarSeqlenPlugin::mPosEmbDev
bert::cuda_unique_ptr< void > mPosEmbDev
Definition: embLayerNormVarSeqlenPlugin.h:92
nvinfer1::plugin::BaseCreator::mNamespace
std::string mNamespace
Definition: plugin.h:74
tests.tensor_quant_test.TestTensorQuant
Definition: tensor_quant_test.py:38
nvinfer1::plugin::NormalizePluginCreator
Definition: normalizePlugin.h:107
nvinfer1::utility::validateTRTOption
std::string validateTRTOption(const std::set< char > &seenShortNames, const std::set< std::string > &seenLongNames, const TRTOption &opt)
validateTRTOption will return a string containing an error message if options contain non-numeric cha...
Definition: getOptions.cpp:79
bert::SkipLayerNormInterleavedPlugin::mGamma
bert::WeightsWithOwnership mGamma
Definition: skipLayerNormInt8InterleavedPlugin.h:77
SampleINT8APIParams::modelFileName
std::string modelFileName
Definition: sampleINT8API.cpp:62
nvinfer1::plugin::DetectionOutput::detachFromContext
void detachFromContext() override
Detach the plugin object from its execution context.
Definition: nmsPlugin.cpp:257
nvinfer1::IInt8MinMaxCalibrator::~IInt8MinMaxCalibrator
virtual ~IInt8MinMaxCalibrator()
Definition: NvInfer.h:6101
bert::FusedMultiHeadAttentionKernelMetaInfoV2::mInterleaved
bool mInterleaved
Definition: fused_multihead_attention_v2.h:170
nvinfer1::UnaryOperation::kATAN
@ kATAN
Inverse tangent.
MNISTBatchStream
Definition: BatchStream.h:40
pytorch_quantization.quant_modules.QuantModuleReplacementHelper
Definition: quant_modules.py:36
sample::MirroredBuffer::getHostBuffer
void * getHostBuffer() const
Definition: sampleDevice.h:328
nvinfer1::plugin::SplitPluginCreator::getFieldNames
const nvinfer1::PluginFieldCollection * getFieldNames()
Return a list of fields that needs to be passed to createPlugin.
Definition: split.h:152
nvinfer1::ActivationType::kTHRESHOLDED_RELU
@ kTHRESHOLDED_RELU
Thresholded ReLU activation: x>alpha ? x : 0.
nvinfer1::plugin::BatchedNMSPlugin::enqueue
int enqueue(int batchSize, const void *const *inputs, void **outputs, void *workspace, cudaStream_t stream) override
Definition: batchedNMSPlugin.cpp:210
polygraphy.backend.trt.loader.CreateNetwork.explicit_batch
explicit_batch
Definition: backend/trt/loader.py:72
polygraphy.tools.surgeon.surgeon
Definition: surgeon.py:1
polygraphy.tools.run.run.Run.build_script
def build_script(self, args)
Definition: run.py:157
utils.load_filepaths_and_text
def load_filepaths_and_text(dataset_path, filename, split="|")
Definition: demo/Tacotron2/common/utils.py:36
samplesCommon::CpuTimer::start
void start()
Definition: samples/common/common.h:845
UffPoolPluginV2::copyDeviceToInt8Output
void copyDeviceToInt8Output(const void *src, void *dst)
Definition: sampleUffPluginV2Ext.cpp:613
SampleMNISTAPIParams::mnistMeansProto
std::string mnistMeansProto
The proto file containing means.
Definition: sampleMNISTAPI.cpp:52
polygraphy.tools.args.comparator.ComparatorCompareArgs.parse
def parse(self, args)
Definition: tools/args/comparator.py:92
polygraphy.logger.logger.Logger.__init__
def __init__(self, severity=INFO, colors=True, letter=True, timestamp=False, line_info=False, exit_on_errors=False)
Definition: Polygraphy/polygraphy/logger/logger.py:115
helpers.tokenization.FullTokenizer
Definition: tokenization.py:160
pytorch_quantization.nn.modules.tensor_quantizer.TensorQuantizer._short_amax
def _short_amax(self, fmt='.4f')
Definition: tensor_quantizer.py:347
tests.quant_conv_transposed_test.TestQuantConvTranspose1D.test_no_quant
def test_no_quant(self)
Definition: quant_conv_transposed_test.py:336
nvinfer1::LayerType::kIDENTITY
@ kIDENTITY
Identity layer.
nvinfer1::plugin::MultilevelCropAndResize::getOutputDimensions
Dims getOutputDimensions(int index, const Dims *inputs, int nbInputDims) override
Definition: multilevelCropAndResizePlugin.cpp:175
pytorch_quantization.nn.modules.tensor_quantizer.TensorQuantizer.disable_quant
def disable_quant(self)
Definition: tensor_quantizer.py:182
pytorch_quantization.nn.modules.quant_conv.QuantConvTranspose1d
Definition: quant_conv.py:269
polygraphy.tools.args.trt.loader.TrtLoaderArgs.outputs
outputs
Definition: tools/args/trt/loader.py:86
nvinfer1::ILogger::Severity::kERROR
@ kERROR
Application error has occurred.
bert::fused_multihead_attention_v2_int8_256_64_kernel_sm80_cubin_len
unsigned int fused_multihead_attention_v2_int8_256_64_kernel_sm80_cubin_len
Definition: fused_multihead_attention_v2_int8_256_64_kernel.sm80.cpp:13062
samplesCommon::BBox
Definition: samples/common/common.h:667
comparator.test_struct.TestRunResults.test_getitem
def test_getitem(self, run_results)
Definition: comparator/test_struct.py:46
nvinfer1::IPluginV2::destroy
virtual void destroy()=0
Destroy the plugin object.
nvinfer1::plugin::Region::C
int C
Definition: regionPlugin.h:97
comparator.test_data_loader.TestDataLoader.test_shape_tensor_detected
def test_shape_tensor_detected(self)
Definition: test_data_loader.py:50
tests.quant_conv_transposed_test.TestQuantConvTranspose2D.test_fake_quant_per_channel
def test_fake_quant_per_channel(self)
Definition: quant_conv_transposed_test.py:142
nvinfer1::IPlugin::getOutputDimensions
virtual Dims getOutputDimensions(int32_t index, const Dims *inputs, int32_t nbInputDims)=0
Get the dimension of an output tensor.
half_float::detail::functions::acosh
static expr acosh(float arg)
Hyperbolic area cosine implementation.
Definition: samples/common/half.h:1981
nvinfer1::BuilderFlag::kSTRICT_TYPES
@ kSTRICT_TYPES
Enables strict type constraints.
nmtSample::Encoder
encodes input sentences into output states
Definition: encoder.h:33
TLTMaskRCNNConfig::MASK_POOL_SIZE
static const int MASK_POOL_SIZE
Definition: tlt_mrcnn_config.h:31
std::numeric_limits< half_float::half >::quiet_NaN
static half_float::half quiet_NaN()
Quiet NaN.
Definition: samples/common/half.h:4252
SharedMemory
Definition: sampleMovieLensMPS.cpp:118
nvinfer1::RNNGateType::kOUTPUT
@ kOUTPUT
Output gate (o).
bert::fused_multihead_attention_fp16_64_64_kernel_sm80_cu_o_len
unsigned int fused_multihead_attention_fp16_64_64_kernel_sm80_cu_o_len
Definition: fused_multihead_attention_fp16_64_64_kernel.sm80.cpp:1417
polygraphy.backend.trt_legacy.LoadNetworkFromUff.uff_order
uff_order
Definition: backend/trt_legacy.py:83
nvinfer1::plugin::CoordConvACPlugin::iC
int iC
Definition: coordConvACPlugin.h:86
polygraphy.util.misc.pickle_save
def pickle_save(path, obj)
Definition: Polygraphy/polygraphy/util/misc.py:202
nvinfer1::TensorFormat::kNHWC8
@ kNHWC8
Deprecated name of kHWC8, provided for backwards compatibility and will be removed in TensorRT 8....
generate.x
x
Definition: 06_removing_nodes/generate.py:23
polygraphy.tools.util
Definition: tools/Polygraphy/polygraphy/tools/util/__init__.py:1
polygraphy.backend.trt.util.get_input_metadata_from_profile
def get_input_metadata_from_profile(profile, network)
Definition: trt/util.py:339
pytorch_quantization.tensor_quant.FakeAffineTensorQuantFunction.backward
def backward(ctx, grad_outputs)
Definition: tensor_quant.py:410
nvinfer1::plugin::BatchedNMSDynamicPlugin::getOutputDimensions
DimsExprs getOutputDimensions(int outputIndex, const DimsExprs *inputs, int nbInputs, IExprBuilder &exprBuilder) override
Definition: batchedNMSPlugin.cpp:128
nvinfer1::plugin::BatchedNMSBasePluginCreator::getPluginName
const char * getPluginName() const override
Return the plugin name.
Definition: batchedNMSPlugin.cpp:496
nmtSample::DebugUtil
container for static debug utility functions
Definition: debugUtil.h:35
mrcnn_to_trt_single.convert_model
def convert_model(inference_model, output_path, output_nodes=[], preprocessor=None, text=False, list_nodes=False)
Definition: mrcnn_to_trt_single.py:126
nvinfer1::plugin::BatchTilePlugin::destroy
void destroy() override
Destroy the plugin object.
Definition: batchTilePlugin.cpp:161
polygraphy.backend.onnx.loader.SaveOnnx.path
path
Definition: backend/onnx/loader.py:163
nvinfer1::plugin::GridAnchorGenerator::getOutputDimensions
Dims getOutputDimensions(int index, const Dims *inputs, int nbInputDims) override
Definition: gridAnchorPlugin.cpp:178
nvonnxparser::IParser::parseWithWeightDescriptors
virtual bool parseWithWeightDescriptors(void const *serialized_onnx_model, size_t serialized_onnx_model_size, uint32_t weight_count, onnxTensorDescriptorV1 const *weight_descriptors)=0
Parse a serialized ONNX model into the TensorRT network with consideration of user provided weights.
polygraphy.util.misc.is_empty_shape
def is_empty_shape(shape)
Definition: Polygraphy/polygraphy/util/misc.py:168
test_examples.Example.__enter__
def __enter__(self)
Definition: Polygraphy/tests/test_examples.py:60
samplesCommon::BufferManager::kINVALID_SIZE_VALUE
static const size_t kINVALID_SIZE_VALUE
Definition: buffers.h:240
EntropyCalibratorImpl::mDeviceInput
void * mDeviceInput
Definition: EntropyCalibrator.h:92
pytorch_quantization.nn.modules.quant_pooling.QuantAvgPool3d.forward
def forward(self, input)
Definition: quant_pooling.py:112
nvinfer1::plugin::DetectionLayerPluginCreator::createPlugin
IPluginV2Ext * createPlugin(const char *name, const PluginFieldCollection *fc) override
Return a plugin object.
Definition: detectionLayerPlugin.cpp:62
nvinfer1::LayerType::kDECONVOLUTION
@ kDECONVOLUTION
Deconvolution layer.
SampleUffMNIST::mEngine
std::shared_ptr< nvinfer1::ICudaEngine > mEngine
The TensorRT engine used to run the network.
Definition: sampleUffMNIST.cpp:99
SampleDynamicReshape::SampleDynamicReshape
SampleDynamicReshape(const samplesCommon::OnnxSampleParams &params)
Definition: sampleDynamicReshape.cpp:51
SampleMovieLensParams::strict
bool strict
Definition: sampleMovieLens.cpp:69
bert::kSM_TURING
static constexpr int32_t kSM_TURING
Definition: qkvToContextInt8InterleavedPlugin.h:30
sample::InferenceEnvironment::context
std::vector< TrtUniquePtr< nvinfer1::IExecutionContext > > context
Definition: sampleInference.h:37
pytorch_quantization.nn._functions.quant_rnn.Recurrent
def Recurrent(inner, reverse=False)
Definition: _functions/quant_rnn.py:130
model.Tacotron2
Definition: demo/Tacotron2/tacotron2/model.py:587
TLTMaskRCNNConfig::MODEL_INPUT
static const std::string MODEL_INPUT
Definition: tlt_mrcnn_config.h:171
comparator.test_data_loader.TestDataLoader.test_non_user_provided_inputs_never_shape_tensors
def test_non_user_provided_inputs_never_shape_tensors(self)
Definition: test_data_loader.py:88
pytorch_quantization.nn.modules.quant_rnn.QuantLSTMCell.__init__
def __init__(self, input_size, hidden_size, bias=True, **kwargs)
Definition: modules/quant_rnn.py:385
nvinfer1::plugin::DetectionOutputParameters::shareLocation
bool shareLocation
Definition: NvInferPluginUtils.h:183
util.test_format.FormatTestCase
Definition: test_format.py:23
test_ir.TestGraph.test_cleanup_independent_path
def test_cleanup_independent_path(self)
Definition: test_ir.py:665
nvinfer1::IPoolingLayer
A Pooling layer in a network definition.
Definition: NvInfer.h:1731
export_tacotron2_onnx.DecoderIter.p_attention_dropout
p_attention_dropout
Definition: export_tacotron2_onnx.py:110
nvinfer1::plugin::ProposalLayerPluginCreator::getPluginName
const char * getPluginName() const override
Return the plugin name.
Definition: proposalLayerPlugin.cpp:51
polygraphy.backend.trt_legacy.TrtLegacyRunner.HostDeviceMem.host
host
Definition: backend/trt_legacy.py:181
bert::FusedMultiHeadAttentionKernelMetaInfoV2::mSharedMemBytes
unsigned int mSharedMemBytes
Definition: fused_multihead_attention_v2.h:167
polygraphy.logger.logger.LoggerVerbosity.__exit__
def __exit__(self, exc_type, exc_value, traceback)
Definition: Polygraphy/polygraphy/logger/logger.py:68
polygraphy.tools.util.script.Inline
Definition: script.py:27
bert::Fused_multihead_attention_params::scale_bmm2
uint32_t scale_bmm2
Definition: fused_multihead_attention.h:81
bert::fused_multihead_attention_v2_fp16_128_64_kernel_sm75_cubin
unsigned char fused_multihead_attention_v2_fp16_128_64_kernel_sm75_cubin[]
Definition: fused_multihead_attention_v2_fp16_128_64_kernel.sm75.cpp:19
bert::TFusedMultiHeadAttentionXMMAKernel::mSM
unsigned int mSM
Definition: fused_multihead_attention.h:280
samplesCommon::TimerBase::microseconds
float microseconds() const noexcept
Definition: samples/common/common.h:786
nvinfer1::plugin::SpecialSlicePluginCreator::mFC
static PluginFieldCollection mFC
Definition: specialSlicePlugin.h:117
onnx_graphsurgeon.logger.logger.Logger._severity
_severity
Definition: onnx-graphsurgeon/onnx_graphsurgeon/logger/logger.py:100
nvinfer1::plugin::CoordConvACPluginCreator::getPluginName
const char * getPluginName() const override
Return the plugin name.
Definition: coordConvACPlugin.cpp:180
nvinfer1::plugin::Region::attachToContext
void attachToContext(cudnnContext *cudnnContext, cublasContext *cublasContext, IGpuAllocator *gpuAllocator) override
Attach the plugin object to an execution context and grant the plugin the access to some context reso...
Definition: regionPlugin.cpp:463
pytorch_quantization.nn.modules.quant_rnn.GRUCell
Definition: modules/quant_rnn.py:422
polygraphy.tools.args.model.ModelArgs.__init__
def __init__(self, model_required=False, inputs="--inputs", model_type=None)
Definition: tools/Polygraphy/polygraphy/tools/args/model.py:26
nvinfer1::IPluginRegistry::getPluginCreatorList
virtual IPluginCreator *const * getPluginCreatorList(int32_t *numCreators) const noexcept=0
Return all the registered plugin creators and the number of registered plugin creators.
polygraphy.backend.tf.runner
Definition: backend/tf/runner.py:1
half_float::detail::signbit
bool signbit(half arg)
Check sign.
Definition: ieee_half.h:2760
bert::Fused_multihead_attention_params_v2::o_stride_in_bytes
int64_t o_stride_in_bytes
Definition: fused_multihead_attention_v2.h:39
builder.BertConfig.use_int8_multihead
use_int8_multihead
Definition: builder.py:102
half_float::detail::functions::expm1
static expr expm1(float arg)
Exponential implementation.
Definition: samples/common/half.h:1775
samplesCommon::UffSampleParams::uffFileName
std::string uffFileName
Filename of uff file of a network.
Definition: argsParser.h:73
nvinfer1::PluginType::kNORMALIZE
@ kNORMALIZE
Normalize plugin.
SampleUffFasterRcnn::constructNetwork
bool constructNetwork(SampleUniquePtr< nvinfer1::IBuilder > &builder, SampleUniquePtr< nvinfer1::INetworkDefinition > &network, SampleUniquePtr< nvuffparser::IUffParser > &parser)
Parses an UFF model for SSD and creates a TensorRT network.
Definition: sampleUffFasterRCNN.cpp:243
helpers.tokenization.BertTokenizer
Definition: tokenization.py:184
tests.utils
Definition: tools/pytorch-quantization/tests/utils.py:1
UffPoolPluginV2::canBroadcastInputAcrossBatch
bool canBroadcastInputAcrossBatch(int inputIndex) const override
Definition: sampleUffPluginV2Ext.cpp:573
polygraphy.tools.args.trt.runner.TrtRunnerArgs.add_to_parser
def add_to_parser(self, parser)
Definition: tools/args/trt/runner.py:26
SampleOnnxMnistCoordConvAC::processInput
bool processInput(const samplesCommon::BufferManager &buffers)
Reads the input and stores the result in a managed buffer.
Definition: sampleOnnxMnistCoordConvAC.cpp:246
nvinfer1::plugin::Normalize
Definition: normalizePlugin.h:31
nvcaffeparser1::parseDeconvolution
nvinfer1::ILayer * parseDeconvolution(nvinfer1::INetworkDefinition &network, const trtcaffe::LayerParameter &msg, CaffeWeightFactory &weightFactory, BlobNameToTensor &tensors)
Definition: parseDeconv.cpp:23
tools.test_precision.test_polygraphy_precision_worst_first_sanity
def test_polygraphy_precision_worst_first_sanity()
Definition: test_precision.py:48
evaluate-v1.normalize_answer
def normalize_answer(s)
Definition: evaluate-v1.1.py:29
nvinfer1::Dims4::Dims4
Dims4()
Construct an empty Dims2 object.
Definition: NvInfer.h:296
nvinfer1::RNNDirection::kUNIDIRECTION
@ kUNIDIRECTION
Network iterations from first input to last input.
polygraphy.tools.args.comparator.ComparatorRunArgs.add_to_script
def add_to_script(self, script, data_loader_name)
Definition: tools/args/comparator.py:49
sample::DeviceDeallocator::operator()
void operator()(void *ptr)
Definition: sampleDevice.h:283
polygraphy.backend.tf.runner.TfRunner.activate_impl
def activate_impl(self)
Definition: backend/tf/runner.py:64
nvinfer1::IHostMemory::destroy
virtual void destroy() noexcept=0
Destroy the allocated memory.
BertInferenceRunner::bert
BertInference bert
Definition: infer_c.cpp:50
nvinfer1::plugin::BatchedNMSDynamicPluginCreator
Definition: batchedNMSPlugin.h:149
nvinfer1::plugin::PluginCreatorRegistry::mRegistryList
std::unordered_set< std::string > mRegistryList
Definition: InferPlugin.cpp:144
nmtSample::LSTMEncoder::mGateBiasWeights
std::vector< nvinfer1::Weights > mGateBiasWeights
Definition: lstmEncoder.h:52
polygraphy.tools.args.comparator.ComparatorRunArgs.parse
def parse(self, args)
Definition: tools/args/comparator.py:42
samplesCommon::SampleParams::inputTensorNames
std::vector< std::string > inputTensorNames
Definition: argsParser.h:43
nvinfer1::plugin::MultilevelProposeROI::detachFromContext
void detachFromContext() override
Detach the plugin object from its execution context.
Definition: multilevelProposeROIPlugin.cpp:472
nvonnxparser::IParser::getRefitMap
virtual int getRefitMap(const char **weightNames, const char **layerNames, nvinfer1::WeightsRole *roles)=0
Get description of all ONNX weights that can be refitted.
nvinfer1::plugin::FlattenConcat::mOutputConcatAxis
int mOutputConcatAxis
Definition: flattenConcat.h:102
half_float::detail::functions
Wrapper implementing unspecialized half-precision functions.
Definition: ieee_half.h:1215
MinimumWorkspaceAlgorithmSelector::reportAlgorithms
void reportAlgorithms(const nvinfer1::IAlgorithmContext *const *algoContexts, const nvinfer1::IAlgorithm *const *algoChoices, int32_t nbAlgorithms) override
Called by TensorRT to report choices it made.
Definition: sampleAlgorithmSelector.cpp:300
nvinfer1::plugin::ReorgPluginCreator::createPlugin
IPluginV2Ext * createPlugin(const char *name, const PluginFieldCollection *fc) override
Return a plugin object.
Definition: reorgPlugin.cpp:205
model.Postnet.forward
def forward(self, x)
Definition: demo/Tacotron2/tacotron2/model.py:162
RefineDetectionWorkSpace::sortClassPosDims
nvinfer1::DimsHW sortClassPosDims
Definition: maskRCNNKernels.h:104
nmtSample::ngramCounts
Count_t ngramCounts(const Segment_t &segment, int maxOrder=4)
Definition: bleuScoreWriter.cpp:70
data_function.MelAudioLoader
Definition: waveglow/data_function.py:23
bert::Fused_multihead_attention_params::b
int b
Definition: fused_multihead_attention.h:79
nvinfer1::UnaryOperation::kACOSH
@ kACOSH
Inverse hyperbolic cosine.
polygraphy.tools.util.script
Definition: script.py:1
SampleMNISTAPI::SampleMNISTAPI
SampleMNISTAPI(const SampleMNISTAPIParams &params)
Definition: sampleMNISTAPI.cpp:65
nvinfer1::IConcatenationLayer
A concatenation layer in a network definition.
Definition: NvInfer.h:2248
nmtSample::DataWriter::ptr
std::shared_ptr< DataWriter > ptr
Definition: dataWriter.h:36
sample::anonymous_namespace{sampleInference.cpp}::EventType::kINPUT_S
@ kINPUT_S
FCPlugin::FCPlugin
FCPlugin(const nvinfer1::Weights *weights, int nbWeights, int nbOutputChannels)
Definition: samples/opensource/samplePlugin/fcPlugin.h:39
tests.integration_test.TestNetwork.test_backward
def test_backward(self)
Definition: integration_test.py:69
generate.reduce_prod
def reduce_prod(self, a, axes, keepdims=True)
Definition: 09_shape_operations_with_the_layer_api/generate.py:31
nvinfer1::plugin::GroupNormalizationPluginCreator::~GroupNormalizationPluginCreator
~GroupNormalizationPluginCreator() override=default
polygraphy.backend.trt_legacy.TrtLegacyRunner.tf32
tf32
Definition: backend/trt_legacy.py:218
SampleINT8APIArgs::writeNetworkTensors
bool writeNetworkTensors
Definition: sampleINT8API.cpp:650
nvinfer1::plugin::PriorBox::clone
IPluginV2Ext * clone() const override
Clone the plugin object.
Definition: priorBoxPlugin.cpp:262
anonymous_namespace{pluginLogging.h}::LOG_ERROR
LogStreamConsumer LOG_ERROR(const Logger &logger)
produces a LogStreamConsumer object that can be used to log messages of severity kERROR
Definition: pluginLogging.h:509
samplesCommon::BufferManager::getDeviceBindings
std::vector< void * > & getDeviceBindings()
Returns a vector of device buffers that you can use directly as bindings for the execute and enqueue ...
Definition: buffers.h:278
half_float::detail::enable< T, expr, half, void >::type
T type
Definition: ieee_half.h:377
MaskRCNNUtils::RawDetection::score
float score
Definition: sampleUffMaskRCNN.cpp:54
sample::dumpInputs
void dumpInputs(const nvinfer1::IExecutionContext &context, const Bindings &bindings, std::ostream &os)
Print input tensors to stream.
Definition: sampleReporting.cpp:323
pytorch_quantization.nn.modules.quant_bert.QuantBertSelfAttention.dropout
dropout
Definition: quant_bert.py:68
nvinfer1::plugin::GroupNormalizationPlugin::destroy
void destroy() override
Destroy the plugin object.
Definition: groupNormalizationPlugin.cpp:187
model.Invertible1x1Conv.infer
def infer(self, z)
Definition: demo/Tacotron2/waveglow/model.py:65
samplesCommon::GpuTimer::mStop
cudaEvent_t mStop
Definition: samples/common/common.h:835
nvinfer1::IPluginRegistry::registerCreator
virtual bool registerCreator(IPluginCreator &creator, const char *pluginNamespace) noexcept=0
Register a plugin creator.
polygraphy.tools.args.comparator.ComparatorRunArgs.warm_up
warm_up
Definition: tools/args/comparator.py:43
BertInference::mOutputDims
std::vector< int > mOutputDims
Definition: bert_infer.h:351
nvinfer1::plugin::PriorBoxPluginCreator::getFieldNames
const PluginFieldCollection * getFieldNames() override
Return a list of fields that needs to be passed to createPlugin.
Definition: priorBoxPlugin.cpp:378
tests.clip_test.TestClip.test_backward
def test_backward(self)
Definition: clip_test.py:50
nvinfer1::INetworkDefinition::addGather
virtual IGatherLayer * addGather(ITensor &data, ITensor &indices, int32_t axis)=0
Add a gather layer to the network.
nvinfer1::plugin::MultilevelCropAndResize::mFeatureLength
int mFeatureLength
Definition: multilevelCropAndResizePlugin.h:96
polygraphy.backend.tf.loader.SaveGraph.__init__
def __init__(self, graph, path=None, tensorboard_dir=None, engine_dir=None)
Definition: backend/tf/loader.py:308
nvinfer1::TripLimit::kWHILE
@ kWHILE
Tensor is a scalar of type kBOOL. Loop terminates when value is false.
tests.quant_pooling_test.TestQuantMaxPool2d.test_raise
def test_raise(self)
Definition: quant_pooling_test.py:61
sample::HostAllocator::operator()
void operator()(void **ptr, size_t size)
Definition: sampleDevice.h:291
nvinfer1::plugin::Normalize::setPluginNamespace
void setPluginNamespace(const char *pluginNamespace) override
Set the namespace that this plugin object belongs to.
Definition: normalizePlugin.cpp:166
nvinfer1::plugin::GridAnchorGenerator::deserializeToDevice
Weights deserializeToDevice(const char *&hostBuffer, size_t count)
Definition: gridAnchorPlugin.cpp:268
SampleErrorRecorder::clear
void clear() noexcept final
Clear the error stack on the error recorder.
Definition: ErrorRecorder.h:65
SampleReformatFreeIO
The SampleReformatFreeIO class implements the reformat free I/O sample.
Definition: sampleReformatFreeIO.cpp:204
nvinfer1::plugin::NormalizePluginCreator::mNbWeights
int mNbWeights
Definition: normalizePlugin.h:129
nvinfer1::plugin::MultilevelProposeROI::getPluginVersion
const char * getPluginVersion() const override
Return the plugin version.
Definition: multilevelProposeROIPlugin.cpp:216
Logger::severityPrefix
static const char * severityPrefix(Severity severity)
returns an appropriate string for prefixing a log message with the given severity
Definition: pluginLogging.h:400
pytorch_quantization.calib.histogram.HistogramCalibrator.compute_amax
def compute_amax(self, str method, *int stride=1, int start_bin=128, float percentile=99.99)
Definition: histogram.py:214
UffPoolPluginV2::configurePlugin
void configurePlugin(const PluginTensorDesc *in, int nbInput, const PluginTensorDesc *out, int nbOutput) override
Definition: sampleUffPluginV2Ext.cpp:501
nvinfer1::plugin::DetectionOutput::getPluginVersion
const char * getPluginVersion() const override
Return the plugin version.
Definition: nmsPlugin.cpp:154
bert::EmbLayerNormVarSeqlenPlugin::mWordEmbDev
bert::cuda_unique_ptr< void > mWordEmbDev
Definition: embLayerNormVarSeqlenPlugin.h:90
nvinfer1::RNNGateType
RNNGateType
Identifies an individual gate within an RNN cell.
Definition: NvInfer.h:3201
nvinfer1::plugin::GroupNormalizationPlugin::desc
cudnnTensorDescriptor_t desc
Definition: groupNormalizationPlugin.h:103
nvinfer1::LayerType::kSELECT
@ kSELECT
Select layer.
bert::TFusedMultiHeadAttentionXMMAKernel::mKernelMeta
const TKernelMeta * mKernelMeta
Definition: fused_multihead_attention.h:278
nvonnxparser::ErrorCode::kSUCCESS
@ kSUCCESS
nmtSample::TextWriter::initialize
void initialize() override
it is called right before inference starts
Definition: textWriter.cpp:36
nvinfer1::plugin::Normalize::mWeights
Weights mWeights
Definition: normalizePlugin.h:96
nmtSample::LikelihoodCombinationOperator::smallerThanMinimalLikelihood
virtual float smallerThanMinimalLikelihood() const =0
bert::TFusedMultiHeadAttentionXMMAKernel::mValidSequences
std::set< int > mValidSequences
Definition: fused_multihead_attention.h:288
nvinfer1::ILoop::addIterator
virtual IIteratorLayer * addIterator(ITensor &tensor, int32_t axis=0, bool reverse=false) noexcept=0
Return layer that subscripts tensor by loop iteration.
SampleConfig::setModelDtype
void setModelDtype(const nvinfer1::DataType mdt)
Set the Model Data Type.
Definition: sampleConfig.h:84
nvinfer1::plugin::GenerateDetection::serialize
void serialize(void *buffer) const override
Serialize the layer.
Definition: generateDetectionPlugin.cpp:195
model.Invertible1x1Conv.conv
conv
Definition: demo/Tacotron2/waveglow/model.py:41
half_float::detail::scalbn
half scalbn(half arg, int exp)
Multiply by power of two.
Definition: ieee_half.h:2651
nvinfer1::plugin::MultilevelProposeROI::getPluginNamespace
const char * getPluginNamespace() const override
Return the namespace of the plugin object.
Definition: multilevelProposeROIPlugin.cpp:231
builder.BertConfig
Definition: builder.py:88
polygraphy.backend.trt.util.get_layer_class_mapping
def get_layer_class_mapping()
Definition: trt/util.py:41
nvinfer1::plugin::MultilevelCropAndResize::configurePlugin
void configurePlugin(const Dims *inputDims, int nbInputs, const Dims *outputDims, int nbOutputs, const DataType *inputTypes, const DataType *outputTypes, const bool *inputIsBroadcast, const bool *outputIsBroadcast, PluginFormat floatFormat, int maxBatchSize) override
Definition: multilevelCropAndResizePlugin.cpp:274
nvinfer1::IScaleLayer
A Scale layer in a network definition.
Definition: NvInfer.h:2104
nvinfer1::plugin::NMSPluginCreator::mPluginAttributes
static std::vector< PluginField > mPluginAttributes
Definition: nmsPlugin.h:116
SampleCharRNNWeightNames::EMBED_NAME
const std::string EMBED_NAME
Definition: sampleCharRNN.cpp:79
nvcaffeparser1::parseConvolution
nvinfer1::ILayer * parseConvolution(nvinfer1::INetworkDefinition &network, const trtcaffe::LayerParameter &msg, CaffeWeightFactory &weightFactory, BlobNameToTensor &tensors)
Definition: parseConv.cpp:23
nvonnxparser::IParserError::line
virtual int line() const =0
source line at which the error occurred
half_float::detail::functions::hypot
static expr hypot(float x, float y)
Hypotenuse implementation.
Definition: samples/common/half.h:1863
pytorch_quantization.nn.modules.quant_rnn.QuantLSTMCell.hidden_size
hidden_size
Definition: modules/quant_rnn.py:388
model.Tacotron2.n_mel_channels
n_mel_channels
Definition: demo/Tacotron2/tacotron2/model.py:591
polygraphy.logger.logger.Logger.once_logged
once_logged
Definition: Polygraphy/polygraphy/logger/logger.py:129
tests.quant_linear_test.TestQuantLinear.test_weight_fake_per_channel
def test_weight_fake_per_channel(self)
Definition: quant_linear_test.py:71
onnx_graphsurgeon.logger.logger.LoggerIndent.logger
logger
Definition: onnx-graphsurgeon/onnx_graphsurgeon/logger/logger.py:28
tests.tensor_quant_test.TestQuantDescriptor.test_from_to_yaml
def test_from_to_yaml(self)
Definition: tensor_quant_test.py:338
nvinfer1::plugin::CropAndResizePlugin::attachToContext
void attachToContext(cudnnContext *cudnnContext, cublasContext *cublasContext, IGpuAllocator *gpuAllocator) override
Attach the plugin object to an execution context and grant the plugin the access to some context reso...
Definition: cropAndResizePlugin.cpp:235
nvinfer1::plugin::ProposalLayer::check_valid_inputs
void check_valid_inputs(const nvinfer1::Dims *inputs, int nbInputDims)
Definition: proposalLayerPlugin.cpp:236
nvinfer1::plugin::NormalizePluginCreator::mFC
static PluginFieldCollection mFC
Definition: normalizePlugin.h:125
stft.STFT.hop_length
hop_length
Definition: stft.py:47
polygraphy.tools.args.tf.loader.TfLoaderArgs.tftrt
tftrt
Definition: tools/args/tf/loader.py:78
BboxInfo::BboxInfo
BboxInfo(T conf_score, int label, int bbox_idx, bool kept)
Definition: bboxUtils.h:46
nvinfer1::plugin::GridAnchorParameters::minSize
float minSize
Definition: NvInferPluginUtils.h:145
nmtSample::DataReader::DataReader
DataReader()=default
pytorch_quantization.nn.modules._utils.QuantInputMixin.init_quantizer
def init_quantizer(self, quant_desc_input)
Definition: _utils.py:112
polygraphy.backend.onnxrt.loader.SessionFromOnnxBytes._model_bytes
_model_bytes
Definition: backend/onnxrt/loader.py:30
half_float::detail::functions::log2
static expr log2(float arg)
Binary logarithm implementation.
Definition: samples/common/half.h:1827
polygraphy.tools.args.base.BaseArgs.add_to_parser
def add_to_parser(self, parser)
Definition: args/base.py:22
samplesCommon::GenericBuffer::nbBytes
size_t nbBytes() const
Returns the size (in bytes) of the buffer.
Definition: buffers.h:136
nvinfer1::plugin::RPROIPluginCreator::anchorsRatios
std::vector< float > anchorsRatios
Definition: nvFasterRCNNPlugin.h:123
nvcaffeparser1::IPluginFactory::createPlugin
virtual nvinfer1::IPlugin * createPlugin(const char *layerName, const nvinfer1::Weights *weights, int32_t nbWeights)=0
Creates a plugin.
polygraphy.tools.args.trt.loader.TrtLoaderArgs.get_trt_network_loader
def get_trt_network_loader(self)
Definition: tools/args/trt/loader.py:202
nvinfer1::plugin::InstanceNormalizationPlugin::configurePlugin
void configurePlugin(const nvinfer1::DynamicPluginTensorDesc *in, int nbInputs, const nvinfer1::DynamicPluginTensorDesc *out, int nbOutputs) override
Definition: instanceNormalizationPlugin.cpp:297
nvinfer1::CUDADriverWrapper::_cuLinkAddFile
CUresult(* _cuLinkAddFile)(CUlinkState, CUjitInputType, const char *, unsigned int, CUjit_option *, void **)
Definition: cudaDriverWrapper.h:78
onnx_graphsurgeon.ir.graph.Graph.__eq__
def __eq__(self, "Graph" other)
Definition: graph.py:151
sample::exportJSONOutput
void exportJSONOutput(const nvinfer1::IExecutionContext &context, const Bindings &bindings, const std::string &fileName)
Export output tensors to JSON file.
Definition: sampleReporting.cpp:335
nvinfer1::plugin::ResizeNearest::getPluginType
const char * getPluginType() const override
Return the plugin type.
Definition: resizeNearestPlugin.cpp:165
SampleCharRNNWeightNames::RNNW_L0_NAME
const std::string RNNW_L0_NAME
Definition: sampleCharRNN.cpp:73
nvinfer1::ITensor::setBroadcastAcrossBatch
virtual void setBroadcastAcrossBatch(bool broadcastAcrossBatch)=0
Set whether to enable broadcast of tensor across the batch.
SimpleProfiler::mLayerNames
std::vector< std::string > mLayerNames
Definition: samples/common/common.h:221
nvinfer1::plugin::PriorBox::getSerializationSize
size_t getSerializationSize() const override
Find the size of the serialization buffer required.
Definition: priorBoxPlugin.cpp:185
MultilevelProposeROIWorkSpace::sortClassScoreDims
nvinfer1::DimsHW sortClassScoreDims
Definition: maskRCNNKernels.h:167
nvinfer1::plugin::FlattenConcat::mNumInputs
int mNumInputs
Definition: flattenConcat.h:102
sample::TrtCudaBuffer::TrtCudaBuffer
TrtCudaBuffer(size_t size)
Definition: sampleDevice.h:244
tools.test_surgeon.TestSurgeonExtract.test_sanity
def test_sanity(self)
Definition: test_surgeon.py:25
generate.A
A
Definition: 07_creating_a_model_with_the_layer_api/generate.py:84
text.numbers._expand_number
def _expand_number(m)
Definition: numbers.py:64
nmtSample::LSTMEncoder::addToModel
void addToModel(nvinfer1::INetworkDefinition *network, int maxInputSequenceLength, nvinfer1::ITensor *inputEmbeddedData, nvinfer1::ITensor *actualInputSequenceLengths, nvinfer1::ITensor **inputStates, nvinfer1::ITensor **memoryStates, nvinfer1::ITensor **lastTimestepStates) override
add the memory and last timestep states to the network lastTimestepHiddenStates is the pointer to the...
Definition: lstmEncoder.cpp:60
nvinfer1::IActivationLayer::~IActivationLayer
virtual ~IActivationLayer()
Definition: NvInfer.h:1663
helpers.tokenization.FullTokenizer.basic_tokenizer
basic_tokenizer
Definition: tokenization.py:166
polygraphy.tools.args.tf.loader.TfLoaderArgs
Definition: tools/args/tf/loader.py:22
half_float::detail::functions::fdim
static expr fdim(float x, float y)
Positive difference implementation.
Definition: samples/common/half.h:1734
polygraphy.tools.util.misc.parse_meta
def parse_meta(meta_args, includes_shape=True, includes_dtype=True)
Definition: Polygraphy/polygraphy/tools/util/misc.py:51
pytorch_quantization.tensor_quant.ScaledQuantDescriptor.narrow_range
def narrow_range(self)
Definition: tensor_quant.py:148
tests.quant_rnn_test.TestQuantLSTMCell.test_quant_different_prec
def test_quant_different_prec(self, verbose)
Definition: quant_rnn_test.py:223
bert::EmbLayerNormVarSeqlenPluginCreator::getFieldNames
const nvinfer1::PluginFieldCollection * getFieldNames() override
Return a list of fields that needs to be passed to createPlugin.
Definition: embLayerNormVarSeqlenPlugin.cpp:435
nvinfer1::plugin::CropAndResizePlugin::detachFromContext
void detachFromContext() override
Detach the plugin object from its execution context.
Definition: cropAndResizePlugin.cpp:241
pytorch_quantization.calib.histogram.HistogramCalibrator.__init__
def __init__(self, num_bits, axis, unsigned, num_bins=2048, grow_method=None, skip_zeros=False)
Definition: histogram.py:44
EntropyCalibratorImpl::EntropyCalibratorImpl
EntropyCalibratorImpl(TBatchStream stream, int firstBatch, std::string networkName, const char *inputBlobName, bool readCache=true)
Definition: EntropyCalibrator.h:31
nvinfer1::plugin::FlattenConcat::mCHW
nvinfer1::Dims mCHW
Definition: flattenConcat.h:103
nvinfer1::ElementWiseOperation::kEQUAL
@ kEQUAL
Check if two elements are equal.
half_float::half::operator-=
detail::enable< half &, T >::type operator-=(T rhs)
Arithmetic assignment.
Definition: ieee_half.h:1133
onnx_graphsurgeon.logger.logger.Logger.log
def log(self, message, severity, mode=LogMode.EACH, stack_depth=2)
Definition: onnx-graphsurgeon/onnx_graphsurgeon/logger/logger.py:155
half_float::detail::log1p
expr log1p(half arg)
Natural logorithm.
Definition: ieee_half.h:2361
nmtSample::Component::getInfo
virtual std::string getInfo()=0
get the textual description of the component
polygraphy.tools.util.script.Script.summary
summary
Definition: script.py:143
nvonnxparser::IParser::getNbErrors
virtual int getNbErrors() const =0
Get the number of errors that occurred during prior calls to parse.
nvinfer1::IGatherLayer::~IGatherLayer
virtual ~IGatherLayer()
Definition: NvInfer.h:2709
sample::TrtCudaBuffer::operator=
TrtCudaBuffer & operator=(const TrtCudaBuffer &)=delete
onnx_graphsurgeon.ir.graph
Definition: graph.py:1
half_float::half::operator/=
detail::enable< half &, T >::type operator/=(T rhs)
Arithmetic assignment.
Definition: ieee_half.h:1145
test_ir.TestNode.test_equals
def test_equals(self)
Definition: test_ir.py:140
sample::anonymous_namespace{sampleInference.cpp}::Iteration::mBindings
Bindings & mBindings
Definition: sampleInference.cpp:457
SampleCharRNNBase::convertRNNBias
nvinfer1::Weights convertRNNBias(nvinfer1::Weights input)
Converts RNN Biases from TensorFlow's format to TensorRT's format.
Definition: sampleCharRNN.cpp:471
polygraphy.backend.trt_legacy.LoadNetworkFromCaffe.model
model
Definition: backend/trt_legacy.py:146
nvinfer1::plugin::PriorBox::copyToDevice
Weights copyToDevice(const void *hostData, size_t count)
Definition: priorBoxPlugin.cpp:214
nvinfer1::INetworkDefinition::addInput
virtual ITensor * addInput(const char *name, DataType type, Dims dimensions)=0
Add an input tensor to the network.
nvcaffeparser1::CaffeParser::mPluginFactoryV2
nvcaffeparser1::IPluginFactoryV2 * mPluginFactoryV2
Definition: caffeParser.h:88
SampleUffSSD::mEngine
std::shared_ptr< nvinfer1::ICudaEngine > mEngine
The TensorRT engine used to run the network.
Definition: sampleUffSSD.cpp:97
Logger::TestAtom::TestAtom
TestAtom(bool started, const std::string &name, const std::string &cmdline)
Definition: pluginLogging.h:295
SampleUffMNIST::SampleUniquePtr
std::unique_ptr< T, samplesCommon::InferDeleter > SampleUniquePtr
Definition: sampleUffMNIST.cpp:56
polygraphy.tools.util.script.process_args
def process_args(args, kwargs)
Definition: script.py:31
nvinfer1::plugin::RPROIParams::preNmsTop
int32_t preNmsTop
Definition: NvInferPluginUtils.h:121
polygraphy.backend.onnx.register_callback
def register_callback()
Definition: tools/Polygraphy/polygraphy/backend/onnx/__init__.py:5
Batch::Batch
Batch(ICudaEngine *engine, void *userInputPtr, void *itemInputPtr, const Args &args)
Definition: sampleMovieLensMPS.cpp:191
MNISTBatchStream::next
bool next() override
Definition: BatchStream.h:58
util.test_cuda.ResizeTestCase.old_bytes
old_bytes
Definition: test_cuda.py:28
nvinfer1::plugin::SplitPlugin::initialize
int initialize() override
Initialize the layer for execution.
anonymous_namespace{serialize.hpp}::Serializer< std::vector< T >, typename std::enable_if< std::is_arithmetic< T >::value||std::is_enum< T >::value||std::is_pod< T >::value >::type >::serialized_size
static size_t serialized_size(std::vector< T > const &value)
Definition: serialize.hpp:91
nvinfer1::plugin::SpecialSlice::mNameSpace
std::string mNameSpace
Definition: specialSlicePlugin.h:96
SampleMLP::infer
bool infer()
Runs the TensorRT inference engine for this sample.
Definition: sampleMLP.cpp:255
nmtSample::SLPEmbedder::getInputDimensionSize
int getInputDimensionSize() override
get the upper bound for the possible values of indices
Definition: slpEmbedder.cpp:59
bert::SkipLayerNormInterleavedPlugin::mParamsOnDevice
bool mParamsOnDevice
Definition: skipLayerNormInt8InterleavedPlugin.h:81
model.WN.forward
def forward(self, forward_input)
Definition: demo/Tacotron2/waveglow/model.py:133
nvinfer1::plugin::InstanceNormalizationPluginCreator::mNamespace
std::string mNamespace
Definition: instanceNormalizationPlugin.h:122
pytorch_quantization.tensor_quant.ScaledQuantDescriptor.amax
def amax(self)
Definition: tensor_quant.py:124
bert::fused_multihead_attention_v2_int8_256_64_kernel_cubin_len
unsigned int fused_multihead_attention_v2_int8_256_64_kernel_cubin_len
Definition: fused_multihead_attention_v2_int8_256_64_kernel.sm72.cpp:20009
nvinfer1::plugin::PriorBox::enqueue
int enqueue(int batchSize, const void *const *inputs, void **outputs, void *workspace, cudaStream_t stream) override
Definition: priorBoxPlugin.cpp:173
backend.tf.test_loader.TestLoggerCallbacks
Definition: tf/test_loader.py:28
onnx_models.Model.load
def load(self)
Definition: onnx_models.py:42
Int8EntropyCalibrator2::readCalibrationCache
const void * readCalibrationCache(size_t &length) override
Definition: EntropyCalibrator.h:121
polygraphy.logger.logger.LoggerIndent
Definition: Polygraphy/polygraphy/logger/logger.py:43
nvinfer1::plugin::ResizeNearest::supportsFormat
bool supportsFormat(DataType type, PluginFormat format) const override
Check format support.
Definition: resizeNearestPlugin.cpp:192
export_tacotron2_onnx.DecoderIter.prenet
prenet
Definition: export_tacotron2_onnx.py:112
nvinfer1::plugin::GridAnchorGenerator::GridAnchorGenerator
GridAnchorGenerator(const GridAnchorParameters *param, int numLayers, const char *version)
Definition: gridAnchorPlugin.cpp:36
pytorch_quantization.nn.modules.quant_instancenorm.QuantInstanceNorm1d
Definition: quant_instancenorm.py:33
nmtSample::LSTMDecoder::~LSTMDecoder
~LSTMDecoder() override=default
nvinfer1::plugin::FlattenConcatPluginCreator::~FlattenConcatPluginCreator
~FlattenConcatPluginCreator() override=default
nvinfer1::plugin::RPROIParams::minBoxSize
float minBoxSize
Definition: NvInferPluginUtils.h:126
pytorch_quantization.calib.max.MaxCalibrator.compute_amax
def compute_amax(self)
Definition: max.py:88
export_onnx2trt.parse_args
def parse_args(parser)
Definition: export_onnx2trt.py:28
tests.tensor_quant_test.TestFakeTensorQuant.test_overflow_fp16
def test_overflow_fp16(self)
Definition: tensor_quant_test.py:244
samplesCommon::HostMemory::mSize
std::size_t mSize
Definition: samples/common/common.h:330
sample::SystemOptions::fallback
bool fallback
Definition: sampleOptions.h:138
nvinfer1::plugin::ResizeNearestPluginCreator::mScale
float mScale
Definition: resizeNearestPlugin.h:115
polygraphy.backend.trt.loader.NetworkFromOnnxPath.path
path
Definition: backend/trt/loader.py:150
nvinfer1::IPluginV2::setPluginNamespace
virtual void setPluginNamespace(const char *pluginNamespace)=0
Set the namespace that this plugin object belongs to.
nvinfer1::plugin::Reorg::getNbOutputs
int getNbOutputs() const override
Get the number of outputs from the layer.
Definition: reorgPlugin.cpp:51
polygraphy.backend.tf.loader.OptimizeGraph._graph
_graph
Definition: backend/tf/loader.py:36
backend.trt.test_calibrator.TestCalibrator
Definition: test_calibrator.py:38
BufferDesc::dims
int dims[4]
Definition: sampleReformatFreeIO.cpp:117
tests.quant_conv_transposed_test.TestQuantConvTranspose1D.test_against_unquantized
def test_against_unquantized(self)
Definition: quant_conv_transposed_test.py:497
nmtSample::DebugUtil::DumpTensorPlugin::configure
void configure(const nvinfer1::Dims *inputDims, int nbInputs, const nvinfer1::Dims *outputDims, int nbOutputs, int maxBatchSize) override
Definition: debugUtil.cpp:44
half_float::detail::functions::tanh
static expr tanh(float arg)
Hyperbolic tangent implementation.
Definition: samples/common/half.h:1959
bert::SkipLayerNormInterleavedPlugin::enqueue
int enqueue(const nvinfer1::PluginTensorDesc *inputDesc, const nvinfer1::PluginTensorDesc *outputDesc, const void *const *inputs, void *const *outputs, void *workspace, cudaStream_t stream) override
Execute the layer.
Definition: skipLayerNormInt8InterleavedPlugin.cpp:161
pytorch_quantization.nn.modules.quant_conv.QuantConv1d.forward
def forward(self, input)
Definition: quant_conv.py:204
polygraphy.tools.args.tf2onnx.loader.Tf2OnnxLoaderArgs
Definition: tools/args/tf2onnx/loader.py:22
polygraphy.backend.onnx.loader.ModifyOnnx._model
_model
Definition: backend/onnx/loader.py:123
SampleDynamicReshape::mPreprocessorEngine
SampleUniquePtr< nvinfer1::ICudaEngine > mPreprocessorEngine
Definition: sampleDynamicReshape.cpp:84
tests.calibrator_test.TestPercentileCalibrator.test_repr
def test_repr(self)
Definition: calibrator_test.py:294
nvinfer1::plugin::MultilevelProposeROI::initialize
int initialize() override
Initialize the layer for execution.
Definition: multilevelProposeROIPlugin.cpp:143
polygraphy.tools.precision.precision.STBisect.__init__
def __init__(self)
Definition: precision.py:254
bert::QKVToContextInterleavedPlugin::setPluginNamespace
void setPluginNamespace(const char *pluginNamespace) override
Set the namespace that this plugin object belongs to.
Definition: qkvToContextInt8InterleavedPlugin.cpp:211
samplesCommon::BufferManager::mDeviceBindings
std::vector< void * > mDeviceBindings
The vector of device buffers needed for engine execution.
Definition: buffers.h:458
pytorch_quantization.calib.histogram.HistogramCalibrator._compute_amax_mse
def _compute_amax_mse(self, stride, start_bin)
Definition: histogram.py:166
tests.quant_pooling_test.TestQuantMaxPool2d.test_input_fake_quant_disable
def test_input_fake_quant_disable(self)
Definition: quant_pooling_test.py:96
MaskRCNNConfig::POOL_SIZE
static const int POOL_SIZE
Definition: plugin/proposalLayerPlugin/mrcnn_config.h:29
generate.min
def min(self, *args)
Definition: 08_replacing_a_subgraph/generate.py:23
polygraphy.backend.tf.loader.SessionFromGraph.config
config
Definition: backend/tf/loader.py:406
util.test_misc.FindInDictCase.index
index
Definition: test_misc.py:38
sample::anonymous_namespace{sampleInference.cpp}::EnqueueExplicit::operator()
void operator()(TrtCudaStream &stream) const
Definition: sampleInference.cpp:223
SampleCharRNNParams::seqSize
int seqSize
Definition: sampleCharRNN.cpp:118
sample::LogStreamConsumerBuffer::mPrefix
std::string mPrefix
Definition: samples/common/logging.h:103
polygraphy.util.cuda.Cuda.free
def free(self, ptr)
Definition: cuda.py:65
nvinfer1::IInt8EntropyCalibrator2::~IInt8EntropyCalibrator2
virtual ~IInt8EntropyCalibrator2()
Definition: NvInfer.h:6086
nmtSample::TextReader::TextReader
TextReader(std::shared_ptr< std::istream > textInput, Vocabulary::ptr vocabulary)
Definition: textReader.cpp:26
half_float::detail::nanh
half nanh(const char *)
Get NaN value.
Definition: ieee_half.h:2316
builder.set_tensor_name
def set_tensor_name(tensor, prefix, name)
Definition: builder.py:106
nmtSample::BeamSearchPolicy::getTailWithNoWorkRemaining
int getTailWithNoWorkRemaining()
Definition: beamSearchPolicy.cpp:119
SampleMLPParams::weightsFile
std::string weightsFile
The filename of the weights file.
Definition: sampleMLP.cpp:50
polygraphy.tools.util.script.Script
SCRIPT ################################## Used to generate a script that uses the Polygraphy API.
Definition: script.py:66
half_float::detail::unary_specialized::fabs
static half fabs(half arg)
Absolute value implementation.
Definition: samples/common/half.h:2524
polygraphy.tools.surgeon.subtool.sanitize.Sanitize.__init__
def __init__(self)
Definition: sanitize.py:23
SampleMNISTAPI::mEngine
std::shared_ptr< nvinfer1::ICudaEngine > mEngine
The TensorRT engine used to run the network.
Definition: sampleMNISTAPI.cpp:95
BertInference::mInputSizes
std::vector< size_t > mInputSizes
Definition: bert_infer.h:349
sample::setCudaDevice
void setCudaDevice(int device, std::ostream &os)
Definition: sampleDevice.h:354
nvonnxparser::IOnnxConfig::addVerbosity
virtual void addVerbosity()=0
Increase the Verbosity Level.
nvinfer1::NetworkDefinitionCreationFlag::kEXPLICIT_PRECISION
@ kEXPLICIT_PRECISION
Setting the network to be an explicit precision network has the following implications: 1) Precision ...
polygraphy.tools.args.onnx.loader.OnnxLoaderArgs.do_shape_inference
do_shape_inference
Definition: tools/args/onnx/loader.py:64
helpers.tokenization.validate_case_matches_checkpoint
def validate_case_matches_checkpoint(do_lower_case, init_checkpoint)
Definition: tokenization.py:27
polygraphy.backend.onnx.loader.OnnxFromPath.path
path
Definition: backend/onnx/loader.py:32
bert::SkipLayerNormInterleavedPlugin::getOutputDimensions
nvinfer1::DimsExprs getOutputDimensions(int outputIndex, const nvinfer1::DimsExprs *inputs, int nbInputs, nvinfer1::IExprBuilder &exprBuilder) override
Definition: skipLayerNormInt8InterleavedPlugin.cpp:110
SampleINT8API::build
sample::Logger::TestResult build()
Builds the network engine.
Definition: sampleINT8API.cpp:490
nmtSample::SLPEmbedder::getInfo
std::string getInfo() override
get the textual description of the component
Definition: slpEmbedder.cpp:64
nvinfer1::plugin::SpecialSlice::getOutputDimensions
Dims getOutputDimensions(int index, const Dims *inputs, int nbInputDims) override
Definition: specialSlicePlugin.cpp:145
tests.quant_pooling_test.TestQuantMaxPool3d.test_raise
def test_raise(self)
Definition: quant_pooling_test.py:109
SampleMovieLens::mEngine
std::shared_ptr< nvinfer1::ICudaEngine > mEngine
The TensorRT engine used to run the network.
Definition: sampleMovieLens.cpp:151
ParserOnnxConfig::addVerbosity
virtual void addVerbosity()
Increase the Verbosity Level.
Definition: parserOnnxConfig.h:99
SampleCharRNNLoop::LstmIO::cell
nvinfer1::ITensor * cell
Definition: sampleCharRNN.cpp:245
polygraphy.tools.args.trt.loader.TrtLoaderArgs._outputs
_outputs
Definition: tools/args/trt/loader.py:27
nvinfer1::plugin::GenerateDetectionPluginCreator::getPluginName
const char * getPluginName() const override
Return the plugin name.
Definition: generateDetectionPlugin.cpp:51
polygraphy.backend.trt.loader.LoadPlugins.__call__
def __call__(self, *args, **kwargs)
Definition: backend/trt/loader.py:52
nvinfer1::plugin::RPROIPlugin::H
int H
Definition: nvFasterRCNNPlugin.h:99
TLTMaskRCNNConfig::DETECTION_MIN_CONFIDENCE
static const float DETECTION_MIN_CONFIDENCE
Definition: tlt_mrcnn_config.h:44
nvinfer1::plugin::Region::detachFromContext
void detachFromContext() override
Detach the plugin object from its execution context.
Definition: regionPlugin.cpp:466
denoiser.Denoiser.__init__
def __init__(self, waveglow, filter_length=1024, n_overlap=4, win_length=1024, mode='zeros')
Definition: denoiser.py:26
bert::anonymous_namespace{embLayerNormVarSeqlenPlugin.cpp}::EMB_LAYER_NORM_VAR_SEQLEN_NAME
static const char * EMB_LAYER_NORM_VAR_SEQLEN_NAME
Definition: embLayerNormVarSeqlenPlugin.cpp:48
inference.MeasureTime.__enter__
def __enter__(self)
Definition: Tacotron2/inference.py:170
test_ir.TensorBaseTests.test_can_copy_outputs_from_other_node
def test_can_copy_outputs_from_other_node(self)
Definition: test_ir.py:74
nvinfer1::plugin::PriorBox::attachToContext
void attachToContext(cudnnContext *cudnnContext, cublasContext *cublasContext, IGpuAllocator *gpuAllocator) override
Attach the plugin object to an execution context and grant the plugin the access to some context reso...
Definition: priorBoxPlugin.cpp:340
polygraphy.backend.trt.loader.ShapeTuple
Definition: backend/trt/loader.py:235
nvinfer1::INetworkDefinition::addLRN
virtual ILRNLayer * addLRN(ITensor &input, int32_t window, float alpha, float beta, float k)=0
Add a LRN layer to the network.
polygraphy.backend.tf.loader.UseTfTrt.max_batch_size
max_batch_size
Definition: backend/tf/loader.py:229
Int8EntropyCalibrator2::Int8EntropyCalibrator2
Int8EntropyCalibrator2(TBatchStream stream, int firstBatch, const char *networkName, const char *inputBlobName, bool readCache=true)
Definition: EntropyCalibrator.h:105
polygraphy.backend.onnxrt.runner.OnnxrtRunner.infer_impl
def infer_impl(self, feed_dict)
Definition: backend/onnxrt/runner.py:47
onnx_graphsurgeon.logger.logger.Logger
Definition: onnx-graphsurgeon/onnx_graphsurgeon/logger/logger.py:60
helpers.data_processing._check_is_max_context
def _check_is_max_context(doc_spans, cur_span_index, position)
Definition: data_processing.py:49
nmtSample::DataWriter::write
virtual void write(const int *hOutputData, int actualOutputSequenceLength, int actualInputSequenceLength)=0
write the generated sequence
SampleUffSSD::constructNetwork
bool constructNetwork(SampleUniquePtr< nvinfer1::IBuilder > &builder, SampleUniquePtr< nvinfer1::INetworkDefinition > &network, SampleUniquePtr< nvinfer1::IBuilderConfig > &config, SampleUniquePtr< nvuffparser::IUffParser > &parser)
Parses an UFF model for SSD and creates a TensorRT network.
Definition: sampleUffSSD.cpp:176
polygraphy.tools.precision.precision.STBisect
Definition: precision.py:249
model.Decoder.decoder_rnn
decoder_rnn
Definition: demo/Tacotron2/tacotron2/model.py:267
layers.TacotronSTFT.spectral_normalize
def spectral_normalize(self, magnitudes)
Definition: layers.py:70
onnx_graphsurgeon.ir.tensor.Variable.empty
def empty()
Definition: tensor.py:151
nvinfer1::plugin::GridAnchorBasePluginCreator
Definition: gridAnchorPlugin.h:102
nvinfer1::INetworkDefinition::addConstant
virtual IConstantLayer * addConstant(Dims dimensions, Weights weights)=0
Add a constant layer to the network.
SampleDynamicReshape::buildPreprocessorEngine
bool buildPreprocessorEngine(const SampleUniquePtr< nvinfer1::IBuilder > &builder)
Builds an engine for preprocessing (mPreprocessorEngine).
Definition: sampleDynamicReshape.cpp:127
polygraphy.tools.precision.precision.Precision
Definition: precision.py:291
BertInference::mDeviceBuffers
std::vector< void * > mDeviceBuffers
Definition: bert_infer.h:347
onnx_graphsurgeon.ir.graph.Graph.register
def register(opsets=None)
Definition: graph.py:69
nvinfer1::plugin::RPROIPlugin::isOutputBroadcastAcrossBatch
bool isOutputBroadcastAcrossBatch(int outputIndex, const bool *inputIsBroadcasted, int nbInputs) const override
Definition: nvFasterRCNNPlugin.cpp:262
bert::QKVToContextInterleavedPlugin::getSMVersion
int getSMVersion() const
Definition: qkvToContextInt8InterleavedPlugin.cpp:83
sample::InferenceTrace::enqEnd
float enqEnd
Definition: sampleReporting.h:93
model.Invertible1x1Conv.forward
def forward(self, z)
Definition: demo/Tacotron2/waveglow/model.py:53
PoolParameters::pW
int pW
Definition: sampleUffPluginV2Ext.cpp:183
generate.b
b
Definition: 03_isolating_a_subgraph/generate.py:31
nvinfer1::plugin::ProposalLayer::getPluginVersion
const char * getPluginVersion() const override
Return the plugin version.
Definition: proposalLayerPlugin.cpp:170
nvinfer1::plugin::MultilevelProposeROI::~MultilevelProposeROI
~MultilevelProposeROI() override=default
anonymous_namespace{batchedNMSPlugin.cpp}::NMS_PLUGIN_NAMES
const char * NMS_PLUGIN_NAMES[]
Definition: batchedNMSPlugin.cpp:35
pytorch_quantization.optim.helper.quant_weight_inplace
def quant_weight_inplace(model)
Definition: helper.py:112
tools.common.run_polygraphy_surgeon
def run_polygraphy_surgeon(additional_opts=[], disable_verbose=False)
Definition: tools/common.py:56
sample::Bindings::getBindings
std::unordered_map< std::string, int > getBindings(bool(*predicate)(const Binding &b)) const
Definition: sampleUtils.h:436
UffPoolPluginV2::supportsFormatCombination
bool supportsFormatCombination(int pos, const PluginTensorDesc *inOut, int nbInputs, int nbOutputs) const override
The combination of kLINEAR + kINT8/kHALF/kFLOAT is supported.
Definition: sampleUffPluginV2Ext.cpp:521
tests.quant_modules_test.TestQuantModuleReplace.test_initialize_deactivate
def test_initialize_deactivate(self)
Definition: quant_modules_test.py:73
bert::fused_multihead_attention_v2_int8_256_64_kernel_sm80_cubin
unsigned char fused_multihead_attention_v2_int8_256_64_kernel_sm80_cubin[]
Definition: fused_multihead_attention_v2_int8_256_64_kernel.sm80.cpp:19
builder_varseqlen.BertConfig.intermediate_size
intermediate_size
Definition: builder_varseqlen.py:95
anonymous_namespace{resizeNearestPlugin.cpp}::RESIZE_PLUGIN_VERSION
const char * RESIZE_PLUGIN_VERSION
Definition: resizeNearestPlugin.cpp:32
SampleUffFasterRcnn::infer
bool infer()
Runs the TensorRT inference engine for this sample.
Definition: sampleUffFasterRCNN.cpp:316
nvinfer1::INetworkDefinition::addTopK
virtual ITopKLayer * addTopK(ITensor &input, TopKOperation op, int32_t k, uint32_t reduceAxes)=0
Add a TopK layer to the network.
nvinfer1::INetworkDefinition::addPluginV2
virtual IPluginV2Layer * addPluginV2(ITensor *const *inputs, int32_t nbInputs, IPluginV2 &plugin)=0
Add a plugin layer to the network using the IPluginV2 interface.
sample::anonymous_namespace{sampleInference.cpp}::EnqueueImplicit
Functor to enqueue inference with implict batch.
Definition: sampleInference.cpp:191
nvinfer1::plugin::RPROIParams::poolingH
int32_t poolingH
Definition: NvInferPluginUtils.h:118
nvinfer1::ISoftMaxLayer::~ISoftMaxLayer
virtual ~ISoftMaxLayer()
Definition: NvInfer.h:2196
LogStreamConsumerBuffer::mOutput
std::ostream & mOutput
Definition: pluginLogging.h:98
data_function.MelAudioLoader.get_mel_audio_pair
def get_mel_audio_pair(self, filename)
Definition: waveglow/data_function.py:41
polygraphy.tools.precision.precision.STCheckerBase.run
def run(self, args)
Definition: precision.py:54
polygraphy.comparator.struct.RunResults
Definition: comparator/struct.py:49
onnx_graphsurgeon.importers.onnx_importer.OnnxImporter.import_graph
Graph import_graph(onnx.GraphProto onnx_graph, "OrderedDict[str, Tensor]" tensor_map=None, opset=None)
Definition: onnx_importer.py:155
sample::Bindings
Definition: sampleUtils.h:304
builder_varseqlen.BertConfig.head_size
head_size
Definition: builder_varseqlen.py:97
SampleUffFasterRcnnParams::inputNodeName
std::string inputNodeName
Definition: sampleUffFasterRCNN.cpp:62
polygraphy.logger.logger.Logger.timestamp
timestamp
Definition: Polygraphy/polygraphy/logger/logger.py:132
tests.quant_conv_test.TestQuantConv2D.test_against_unquantized
def test_against_unquantized(self)
Definition: quant_conv_test.py:210
nvinfer1::UnaryOperation
UnaryOperation
Enumerates the unary operations that may be performed by a Unary layer.
Definition: NvInfer.h:3478
model.LocationLayer.location_dense
location_dense
Definition: demo/Tacotron2/tacotron2/model.py:37
RefineDetectionWorkSpace::argMaxLabelDims
nvinfer1::DimsHW argMaxLabelDims
Definition: maskRCNNKernels.h:99
sample::anonymous_namespace{sampleInference.cpp}::Iteration::getTrace
InferenceTrace getTrace(const TimePoint &cpuStart, const TrtCudaEvent &gpuStart, bool skipTransfers)
Definition: sampleInference.cpp:418
pytorch_quantization.nn.modules.quant_bert.QuantBertSelfAttention.matmul_v_input_quantizer
matmul_v_input_quantizer
Definition: quant_bert.py:75
polygraphy.tools.args.trt.runner.TrtRunnerArgs.add_to_script
def add_to_script(self, script, data_loader_name)
Definition: tools/args/trt/runner.py:51
samplesCommon::ManagedBuffer
The ManagedBuffer class groups together a pair of corresponding device and host buffers.
Definition: buffers.h:222
polygraphy.tools.args.comparator.ComparatorRunArgs._iters
_iters
Definition: tools/args/comparator.py:25
polygraphy.backend.trt_legacy.ConvertToUff.preprocessor
preprocessor
Definition: backend/trt_legacy.py:52
nvonnxparser::IOnnxConfig::getFullTextFileName
virtual const char * getFullTextFileName() const =0
Get the File Name of the Network Description as a Text File, including the weights.
nvinfer1::plugin::CoordConvACPluginCreator::getFieldNames
const PluginFieldCollection * getFieldNames() override
Return a list of fields that needs to be passed to createPlugin.
Definition: coordConvACPlugin.cpp:190
polygraphy.backend.onnx.loader.OnnxFromTfGraph.__call__
def __call__(self)
Definition: backend/onnx/loader.py:76
sample::ReportingOptions::refit
bool refit
Definition: sampleOptions.h:174
nmtSample::SLPProjection::mResizedKernelWeights
std::vector< float > mResizedKernelWeights
Definition: slpProjection.h:55
polygraphy.backend.tf.loader.GraphFromCkpt.__call__
def __call__(self)
Definition: backend/tf/loader.py:179
polygraphy.backend.tf.loader.OptimizeGraph.__call__
def __call__(self)
Definition: backend/tf/loader.py:66
nvinfer1::plugin::GroupNormalizationPlugin::mPluginNamespace
const char * mPluginNamespace
Definition: groupNormalizationPlugin.h:95
Logger::setReportableSeverity
void setReportableSeverity(Severity severity)
Method for controlling the verbosity of logging output.
Definition: pluginLogging.h:275
samplesCommon::Args::useDLACore
int32_t useDLACore
Definition: argsParser.h:84
nvinfer1::IPluginV2::getPluginType
virtual const char * getPluginType() const =0
Return the plugin type.
tests.tensor_quantizer_test.TestTensorQuantizer.test_state_loading
def test_state_loading(self)
Definition: tensor_quantizer_test.py:139
nvinfer1::plugin::NMSParameters::keepTopK
int32_t keepTopK
Definition: NvInferPluginUtils.h:250
test_ir.TestGraph.test_cleanup_remove_unused_node_outputs
def test_cleanup_remove_unused_node_outputs(self)
Definition: test_ir.py:638
data_function.TextMelLoader.__getitem__
def __getitem__(self, index)
Definition: tacotron2/data_function.py:76
nvinfer1::BuilderFlag::kDISABLE_TIMING_CACHE
@ kDISABLE_TIMING_CACHE
Disable reuse of timing information across identical layers.
MNISTBatchStream::mDims
Dims mDims
Definition: BatchStream.h:149
nvinfer1::plugin::NormalizePluginCreator::mEps
float mEps
Definition: normalizePlugin.h:128
polygraphy.comparator.data_loader.DataLoader.input_metadata
input_metadata
Definition: comparator/data_loader.py:91
nvcaffeparser1
The TensorRT Caffe parser API namespace.
SampleUffFasterRcnn::processInput
bool processInput(const samplesCommon::BufferManager &buffers)
Reads the input and mean data, preprocesses, and stores the result in a managed buffer.
Definition: sampleUffFasterRCNN.cpp:378
samplesCommon::readPPMFile
void readPPMFile(const std::string &filename, samplesCommon::PPM< C, H, W > &ppm)
Definition: samples/common/common.h:673
MaskRCNNUtils::RawDetection::x2
float x2
Definition: sampleUffMaskRCNN.cpp:54
nvinfer1::plugin::BatchedNMSDynamicPlugin::getSerializationSize
size_t getSerializationSize() const override
Find the size of the serialization buffer required.
Definition: batchedNMSPlugin.cpp:266
nvinfer1::plugin::CropAndResizePlugin::getOutputDimensions
Dims getOutputDimensions(int index, const Dims *inputs, int nbInputDims) override
Definition: cropAndResizePlugin.cpp:113
polygraphy.backend.trt.loader.ShapeTuple.__str__
def __str__(self)
Definition: backend/trt/loader.py:250
MinimumWorkspaceAlgorithmSelector
Selects Algorithms with minimum workspace requirements.
Definition: sampleAlgorithmSelector.cpp:275
model.Tacotron2.forward
def forward(self, inputs)
Definition: demo/Tacotron2/tacotron2/model.py:648
nvinfer1::LoopOutput::kLAST_VALUE
@ kLAST_VALUE
Output value is value of tensor for last iteration.
sample::anonymous_namespace{sampleInference.cpp}::MultiStream
std::array< TrtCudaStream, static_cast< int >(StreamType::kNUM)> MultiStream
Definition: sampleInference.cpp:271
comparator.test_postprocess.TestTopK
Definition: test_postprocess.py:20
polygraphy.backend.base.runner.BaseRunner.infer_impl
def infer_impl(self)
Definition: backend/base/runner.py:96
nvinfer1::plugin::SpecialSlice::~SpecialSlice
~SpecialSlice() override=default
polygraphy.tools.args.tf.loader.TfLoaderArgs.ckpt
ckpt
Definition: tools/args/tf/loader.py:73
nvcaffeparser1::parseInnerProduct
nvinfer1::ILayer * parseInnerProduct(nvinfer1::INetworkDefinition &network, const trtcaffe::LayerParameter &msg, CaffeWeightFactory &weightFactory, BlobNameToTensor &tensors)
Definition: parseInnerProduct.cpp:23
polygraphy.tools.args.trt.runner.TrtRunnerArgs.__init__
def __init__(self, write=True)
Definition: tools/args/trt/runner.py:22
SampleCharRNNParams::loadEngine
std::string loadEngine
Definition: sampleCharRNN.cpp:125
tools.test_run.TestTrt.test_trt_explicit_profile_implicit_runtime_shape
def test_trt_explicit_profile_implicit_runtime_shape(self)
Definition: test_run.py:117
polygraphy.backend.trt.loader.ModifyNetwork.exclude_outputs
exclude_outputs
Definition: backend/trt/loader.py:205
nvinfer1::plugin::MultilevelProposeROI::mImageSize
nvinfer1::Dims mImageSize
Definition: multilevelProposeROIPlugin.h:113
SampleReformatFreeIO::mTensorFormat
TensorFormat mTensorFormat
Definition: sampleReformatFreeIO.cpp:268
nvinfer1::utility::TRTParsedArgs
TRTParsedArgs is returned by getOptions after it has parsed a command line argument list (argv).
Definition: getOptions.h:57
nvinfer1::MatrixOperation::kNONE
@ kNONE
Treat x as a matrix if it has two dimensions, or as a collection of matrices if x has more than two d...
polygraphy.tools.args.onnxrt.runner.OnnxrtRunnerArgs.check_registered
def check_registered(self)
Definition: tools/args/onnxrt/runner.py:28
nvinfer1::plugin::PyramidROIAlign::check_valid_inputs
void check_valid_inputs(const nvinfer1::Dims *inputs, int nbInputDims)
Definition: pyramidROIAlignPlugin.cpp:144
polygraphy.tools.args.tf.loader.TfLoaderArgs.check_registered
def check_registered(self)
Definition: tools/args/tf/loader.py:66
half_float::detail::functions::cosh
static expr cosh(float arg)
Hyperbolic cosine implementation.
Definition: samples/common/half.h:1951
SampleConfig::mImageFilename
std::string mImageFilename
Definition: sampleConfig.h:40
tests.fixtures.models.QuantLeNet.fc2
fc2
Definition: tools/pytorch-quantization/tests/fixtures/models.py:50
polygraphy.backend.tf.runner.TfRunner
Definition: backend/tf/runner.py:30
backend.trt.test_calibrator.identity_builder_network
def identity_builder_network()
Definition: test_calibrator.py:32
nmtSample::MultiplicativeAlignment::~MultiplicativeAlignment
~MultiplicativeAlignment() override=default
FCPlugin::mBiasWeights
nvinfer1::Weights mBiasWeights
Definition: samples/opensource/samplePlugin/fcPlugin.h:309
nmtSample::LSTMDecoder::mNumLayers
int mNumLayers
Definition: lstmDecoder.h:50
polygraphy.comparator.data_loader.DataLoaderCache.data_loader
data_loader
Definition: comparator/data_loader.py:200
sample::anonymous_namespace{sampleInference.cpp}::Iteration::sync
float sync(const TimePoint &cpuStart, const TrtCudaEvent &gpuStart, std::vector< InferenceTrace > &trace, bool skipTransfers)
Definition: sampleInference.cpp:336
polygraphy.backend.trt.loader.EngineFromBytes._serialized_engine
_serialized_engine
Definition: backend/trt/loader.py:469
nvinfer1::plugin::MultilevelProposeROI::mFGThreshold
float mFGThreshold
Definition: multilevelProposeROIPlugin.h:98
pytorch_quantization.nn.modules.tensor_quantizer.TensorQuantizer.forward
def forward(self, inputs)
Definition: tensor_quantizer.py:317
pytorch_quantization.nn.modules.quant_rnn.QuantLSTMCell.bias
bias
Definition: modules/quant_rnn.py:389
builder_varseqlen.main
def main()
Definition: builder_varseqlen.py:581
bert::fused_multihead_attention_fp16_96_64_kernel_sm75_cu_o
unsigned char fused_multihead_attention_fp16_96_64_kernel_sm75_cu_o[]
Definition: fused_multihead_attention_fp16_96_64_kernel.sm75.cpp:19
helpers.tokenization._is_punctuation
def _is_punctuation(char)
Definition: tokenization.py:416
sample::anonymous_namespace{sampleInference.cpp}::Enqueue::mContext
nvinfer1::IExecutionContext & mContext
Definition: sampleInference.cpp:183
sample::Bindings::dumpInputs
void dumpInputs(const nvinfer1::IExecutionContext &context, std::ostream &os) const
Definition: sampleUtils.h:383
nvinfer1::MatrixOperation::kVECTOR
@ kVECTOR
Treat x as a vector if it has one dimension, or as a collection of vectors if x has more than one dim...
half_float::detail::functions::write
static std::basic_ostream< charT, traits > & write(std::basic_ostream< charT, traits > &out, float arg)
Output implementation.
Definition: ieee_half.h:1245
nvinfer1::plugin::GridAnchorParameters
The Anchor Generator plugin layer generates the prior boxes of designated sizes and aspect ratios acr...
Definition: NvInferPluginUtils.h:143
nvinfer1::plugin::BatchedNMSDynamicPlugin::scoresSize
int scoresSize
Definition: batchedNMSPlugin.h:114
nvcaffeparser1::CaffeParser::mPluginNamespace
std::string mPluginNamespace
Definition: caffeParser.h:92
tests.quant_conv_test.TestQuantConv3D.test_against_unquantized
def test_against_unquantized(self)
Definition: quant_conv_test.py:532
polygraphy.backend.tf.loader.GraphFromFrozen
Definition: backend/tf/loader.py:138
BertInference::mTimes
std::vector< std::vector< float > > mTimes
Definition: bert_infer.h:353
polygraphy.backend.pyt.runner.PytRunner.__init__
def __init__(self, model, input_metadata, output_names, name=None)
Definition: backend/pyt/runner.py:22
polygraphy.tools.registry.MissingTool
Definition: registry.py:30
helpers.calibrator.BertCalibrator.get_quantile
def get_quantile(self)
Definition: demo/BERT/helpers/calibrator.py:101
nvinfer1::plugin::FlattenConcatPluginCreator::getPluginName
const char * getPluginName() const override
Return the plugin name.
Definition: flattenConcat.cpp:301
sample::Logger::log
void log(Severity severity, const char *msg) override
Implementation of the nvinfer1::ILogger::log() virtual method.
Definition: samples/common/logging.h:244
polygraphy.comparator.data_loader.DataLoaderCache.save_inputs_path
save_inputs_path
Definition: comparator/data_loader.py:202
polygraphy.tools.args.comparator.ComparatorRunArgs.save_inputs
save_inputs
Definition: tools/args/comparator.py:45
BatchStream::getBatchesRead
int getBatchesRead() const
Definition: frcnnUtils.h:309
nvinfer1::CUDADriverWrapper::cuModuleLoadData
CUresult cuModuleLoadData(CUmodule *module, const void *image) const
Definition: cudaDriverWrapper.cpp:96
Args::help
bool help
Definition: demo/BERT/infer_c/common.h:28
nvcaffeparser1::CaffeWeightFactory::getDataType
nvinfer1::DataType getDataType() const
Definition: caffeWeightFactory.cpp:68
SampleMovieLens::build
bool build()
Builds the network engine.
Definition: sampleMovieLens.cpp:164
SampleINT8APIParams::referenceFileName
std::string referenceFileName
Definition: sampleINT8API.cpp:66
test_ir.TestNode.test_can_copy_inputs_from_other_node
def test_can_copy_inputs_from_other_node(self)
Definition: test_ir.py:177
nvinfer1::WeightsRole::kBIAS
@ kBIAS
bias for IConvolutionLayer, IDeconvolutionLayer, or IFullyConnectedLayer
polygraphy.tools.util.script.Script.format_str
def format_str(base_str, *args, **kwargs)
Definition: script.py:68
SampleConfig::setVerbosityLevel
virtual void setVerbosityLevel(Verbosity v)
Definition: sampleConfig.h:125
nvinfer1::IAlgorithmVariant::getImplementation
virtual int64_t getImplementation() const =0
Return implementation of the algorithm.
SampleUffSSD::mInputDims
nvinfer1::Dims mInputDims
The dimensions of the input to the network.
Definition: sampleUffSSD.cpp:93
sample::Binding::fill
void fill()
Definition: sampleUtils.h:232
sample::AllOptions::parse
void parse(Arguments &arguments) override
Definition: sampleOptions.cpp:734
polygraphy.tools.args
Definition: tools/Polygraphy/polygraphy/tools/args/__init__.py:1
test_ir.TestGraph.test_tensors_does_not_modify_tensors
def test_tensors_does_not_modify_tensors(self)
Definition: test_ir.py:563
nvinfer1::utility::TRTOption::helpText
std::string helpText
Text to show when printing out the command usage.
Definition: getOptions.h:39
nmtSample::BLEUScoreWriter::initialize
void initialize() override
it is called right before inference starts
Definition: bleuScoreWriter.cpp:183
SampleCharRNNWeightNames
Definition: sampleCharRNN.cpp:71
bert::EmbLayerNormVarSeqlenPlugin
Definition: embLayerNormVarSeqlenPlugin.h:43
polygraphy.backend.trt.loader.SaveEngine._engine
_engine
Definition: backend/trt/loader.py:501
pytorch_quantization.nn.modules.quant_rnn.QuantRNNBase.forward
def forward(self, input, hx=None)
Definition: modules/quant_rnn.py:185
test_func.TestExtend.test_extend_1_args_1_rv
def test_extend_1_args_1_rv(self)
Definition: test_func.py:80
nvinfer1::EnumMax< LayerType >
constexpr int32_t EnumMax< LayerType >()
Maximum number of elements in LayerType enum.
Definition: NvInfer.h:484
nvinfer1::plugin::MultilevelProposeROIPluginCreator::mFC
static PluginFieldCollection mFC
Definition: multilevelProposeROIPlugin.h:138
infer.DeviceBuffer.__init__
def __init__(self, shape, dtype=trt.int32)
Definition: infer.py:33
evaluate-v1.f1_score
def f1_score(prediction, ground_truth)
Definition: evaluate-v1.1.py:47
sample::runInference
void runInference(const InferenceOptions &inference, InferenceEnvironment &iEnv, int device, std::vector< InferenceTrace > &trace)
Run inference and collect timing.
Definition: sampleInference.cpp:557
polygraphy.tools.precision.precision.STLinear.find
def find(self)
Definition: precision.py:238
bert::SkipLayerNormInterleavedPluginCreator::mNamespace
std::string mNamespace
Definition: skipLayerNormInt8InterleavedPlugin.h:116
bert::fused_multihead_attention_v2_int8_128_64_kernel_cubin
unsigned char fused_multihead_attention_v2_int8_128_64_kernel_cubin[]
Definition: fused_multihead_attention_v2_int8_128_64_kernel.sm72.cpp:19
pytorch_quantization.nn.modules._utils.QuantInputMixin._input_quantizer
_input_quantizer
Definition: _utils.py:130
polygraphy.backend.cntk.runner.CNTKRunner.infer_impl
def infer_impl(self, feed_dict)
Definition: backend/cntk/runner.py:39
nvinfer1::plugin::CropAndResizePlugin::mCropHeight
size_t mCropHeight
Definition: cropAndResizePlugin.h:101
pytorch_quantization.nn.modules.tensor_quantizer.TensorQuantizer.step_size
def step_size(self)
Definition: tensor_quantizer.py:132
samplesCommon::PPM::h
int h
Definition: samples/common/common.h:655
backend.tf.test_runner.TestTfRunner.test_save_timeline
def test_save_timeline(self)
Definition: tf/test_runner.py:42
polygraphy.logger.logger.LoggerVerbosity.severity
severity
Definition: Polygraphy/polygraphy/logger/logger.py:62
nvinfer1::plugin::Reorg::canBroadcastInputAcrossBatch
bool canBroadcastInputAcrossBatch(int inputIndex) const override
Definition: reorgPlugin.cpp:148
nmtSample::BLEUScoreWriter::BLEUScoreWriter
BLEUScoreWriter(std::shared_ptr< std::istream > referenceTextInput, Vocabulary::ptr vocabulary, int maxOrder=4)
Definition: bleuScoreWriter.cpp:150
test_api.TestApi.imported_graph
imported_graph
Definition: test_api.py:27
bert::TFusedMultiHeadAttentionXMMAKernel::FusedMultiHeadAttentionKernelInfo
Definition: fused_multihead_attention.h:282
nvinfer1::ILoop::addRecurrence
virtual IRecurrenceLayer * addRecurrence(ITensor &initialValue) noexcept=0
Create a recurrence layer for this loop with initialValue as its first input.
SampleUffMNIST::teardown
bool teardown()
Used to clean up any state created in the sample class.
Definition: sampleUffMNIST.cpp:321
nvinfer1::SliceMode::kWRAP
@ kWRAP
Coordinates wrap around periodically.
onnx_graphsurgeon.logger.logger.Logger.root_dir
root_dir
Definition: onnx-graphsurgeon/onnx_graphsurgeon/logger/logger.py:102
samplesCommon::getMaxValue
float getMaxValue(const float *buffer, int64_t size)
Definition: samples/common/common.h:501
nvinfer1::ErrorCode::kUNSUPPORTED_STATE
@ kUNSUPPORTED_STATE
An error occurred due to the network not being supported on the device due to constraints of the hard...
polygraphy.tools.args.logger.LoggerArgs
Definition: Polygraphy/polygraphy/tools/args/logger.py:22
layers.TacotronSTFT.n_mel_channels
n_mel_channels
Definition: layers.py:60
onnx_graphsurgeon.exporters.base_exporter.BaseExporter.export_graph
def export_graph(Graph graph)
Definition: base_exporter.py:21
nvinfer1::IParametricReLULayer
Layer that represents a parametric ReLU operation.
Definition: NvInfer.h:4320
tests.quant_rnn_test.TestQuantLSTMCell.test_against_unquantized
def test_against_unquantized(self, verbose)
Definition: quant_rnn_test.py:133
nvonnxparser::ErrorCode::kMEM_ALLOC_FAILED
@ kMEM_ALLOC_FAILED
sample::Bindings::getDeviceBuffers
void ** getDeviceBuffers()
Definition: sampleUtils.h:334
chptToBin.convert_rnn_bias
def convert_rnn_bias(weights, dimensions, forget_bias=1.0)
Definition: chptToBin.py:285
nvinfer1::plugin::PriorBox::isOutputBroadcastAcrossBatch
bool isOutputBroadcastAcrossBatch(int outputIndex, const bool *inputIsBroadcasted, int nbInputs) const override
Definition: priorBoxPlugin.cpp:302
nvinfer1::TensorFormat::kHWC
@ kHWC
Non-vectorized channel-last format.
inference.MeasureTime.cpu_run
cpu_run
Definition: Tacotron2/inference.py:168
UffPoolPluginV2::mSrcDescriptor
cudnnTensorDescriptor_t mSrcDescriptor
Definition: sampleUffPluginV2Ext.cpp:632
polygraphy.backend.trt.loader.CreateNetwork.__init__
def __init__(self, explicit_precision=None, explicit_batch=None)
Definition: backend/trt/loader.py:61
pytorch_quantization.nn.modules.quant_rnn.GRUCell.__init__
def __init__(self, input_size, hidden_size, bias=True)
Definition: modules/quant_rnn.py:426
nvinfer1::plugin::Region::clone
IPluginV2Ext * clone() const override
Clone the plugin object.
Definition: regionPlugin.cpp:403
nvinfer1::plugin::BatchedNMSPlugin::supportsFormat
bool supportsFormat(DataType type, PluginFormat format) const override
Check format support.
Definition: batchedNMSPlugin.cpp:331
tests.functional_test.TestClip.test_broadcast
def test_broadcast(self)
Definition: functional_test.py:59
nvinfer1::IPluginV2Ext::getTensorRTVersion
int32_t getTensorRTVersion() const
Return the API version with which this plugin was built.
Definition: NvInferRuntimeCommon.h:692
half_float::detail::operator>
enable< bool, T, U >::type operator>(T x, U y)
Comparison for greater than.
Definition: ieee_half.h:2148
sample::MirroredBuffer::mDeviceBuffer
TrtDeviceBuffer mDeviceBuffer
Definition: sampleDevice.h:351
nvinfer1::plugin::ProposalPluginCreator::ProposalPluginCreator
ProposalPluginCreator()
Definition: proposalPlugin.cpp:317
SampleCharRNNParams::vocabSize
int vocabSize
Definition: sampleCharRNN.cpp:120
sample::Bindings::getOutputBindings
std::unordered_map< std::string, int > getOutputBindings() const
Definition: sampleUtils.h:424
nvinfer1::ICudaEngine::getBindingDataType
virtual DataType getBindingDataType(int32_t bindingIndex) const noexcept=0
Determine the required data type for a buffer from its binding index.
chptToBin.convert_rnn_kernel
def convert_rnn_kernel(weights, dimensions, is_decoder_rnn=False)
Definition: chptToBin.py:199
polygraphy.tools.inspect.inspect
Definition: inspect.py:1
sample::anonymous_namespace{sampleInference.cpp}::Enqueue::Enqueue
Enqueue(nvinfer1::IExecutionContext &context, void **buffers)
Definition: sampleInference.cpp:177
nvinfer1::plugin::MultilevelCropAndResize::terminate
void terminate() override
Release resources acquired during plugin layer initialization.
Definition: multilevelCropAndResizePlugin.cpp:111
test_ir.TestGraph.test_get_used_node_ids
def test_get_used_node_ids(self, graph)
Definition: test_ir.py:600
anonymous_namespace{priorBoxPlugin.cpp}::PRIOR_BOX_PLUGIN_VERSION
const char * PRIOR_BOX_PLUGIN_VERSION
Definition: priorBoxPlugin.cpp:32
polygraphy.backend.trt.util.get_input_metadata
def get_input_metadata(network)
Definition: trt/util.py:80
export_tacotron2_onnx.DecoderIter
Definition: export_tacotron2_onnx.py:103
samplesCommon::readReferenceFile
bool readReferenceFile(const std::string &fileName, std::vector< std::string > &refVector)
Definition: samples/common/common.h:408
nmtSample::Alignment::getSourceStatesSize
virtual int getSourceStatesSize()=0
get the size of the source states
SampleUffFasterRcnn::SampleUniquePtr
std::unique_ptr< T, samplesCommon::InferDeleter > SampleUniquePtr
Definition: sampleUffFasterRCNN.cpp:96
pytorch_quantization.nn._functions.quant_rnn.GRUCell
def GRUCell(input, hidden, w_ih, w_hh, b_ih=None, b_hh=None)
Definition: _functions/quant_rnn.py:74
MaskRCNNUtils::RawDetection
Definition: sampleUffMaskRCNN.cpp:52
SampleConfig::mTF32
bool mTF32
Definition: sampleConfig.h:52
bert::anonymous_namespace{qkvToContextInt8InterleavedPlugin.cpp}::QKV_TO_CONTEXT_INTERLEAVED_PLUGIN_VERSION
static const char * QKV_TO_CONTEXT_INTERLEAVED_PLUGIN_VERSION
Definition: qkvToContextInt8InterleavedPlugin.cpp:39
half_float::detail::fma
expr fma(half x, half y, half z)
Fused multiply add.
Definition: ieee_half.h:2275
SampleUffFasterRcnnParams::calBatchSize
int calBatchSize
Definition: sampleUffFasterRCNN.cpp:79
inference.MeasureTime.measurements
measurements
Definition: Tacotron2/inference.py:166
nvcaffeparser1::ICaffeParser::setErrorRecorder
virtual void setErrorRecorder(nvinfer1::IErrorRecorder *recorder)=0
Set the ErrorRecorder for this interface.
model.LocationLayer.forward
def forward(self, attention_weights_cat)
Definition: demo/Tacotron2/tacotron2/model.py:41
uint64_t
text.cmudict.CMUDict.__len__
def __len__(self)
Definition: cmudict.py:47
polygraphy.util.cuda.Cuda.htod
def htod(self, dst, src, nbytes, stream=None)
Definition: cuda.py:69
pytorch_quantization.nn._functions.quant_rnn.AutogradRNN
def AutogradRNN(mode, input_size, hidden_size, num_layers=1, batch_first=False, dropout=0, train=True, bidirectional=False, variable_length=False, dropout_state=None, flat_weight=None, input_quantizers=None, weight_quantizers=None)
Definition: _functions/quant_rnn.py:236
polygraphy.comparator.postprocess.PostprocessFunc
Definition: postprocess.py:20
TLTMaskRCNNConfig::RPN_ANCHOR_STRIDE
static const int RPN_ANCHOR_STRIDE
Definition: tlt_mrcnn_config.h:73
generate.a
a
Definition: 03_isolating_a_subgraph/generate.py:30
sample::anonymous_namespace{sampleInference.cpp}::EventType::kINPUT_E
@ kINPUT_E
sample::anonymous_namespace{sampleInference.cpp}::inferenceLoop
void inferenceLoop(IterationStreams &iStreams, const TimePoint &cpuStart, const TrtCudaEvent &gpuStart, int iterations, float maxDurationMs, float warmupMs, std::vector< InferenceTrace > &trace, bool skipTransfers)
Definition: sampleInference.cpp:476
nmtSample::MultiplicativeAlignment::mKernelWeights
nvinfer1::Weights mKernelWeights
Definition: multiplicativeAlignment.h:52
export_tacotron2_onnx.Postnet.__init__
def __init__(self, tacotron2)
Definition: export_tacotron2_onnx.py:77
nvinfer1::plugin::SplitPluginCreator::getPluginName
const char * getPluginName() const
Return the plugin name.
Definition: split.h:142
SampleConfig::mModelDtype
nvinfer1::DataType mModelDtype
Definition: sampleConfig.h:51
samplesCommon::Args::runInFp16
bool runInFp16
Definition: argsParser.h:82
AlgorithmCacheReader::reportAlgorithms
void reportAlgorithms(const nvinfer1::IAlgorithmContext *const *algoContexts, const nvinfer1::IAlgorithm *const *algoChoices, int32_t nbAlgorithms) override
Called by TensorRT to report choices it made.
Definition: sampleAlgorithmSelector.cpp:168
SampleINT8::mEngine
std::shared_ptr< nvinfer1::ICudaEngine > mEngine
The TensorRT engine used to run the network.
Definition: sampleINT8.cpp:98
polygraphy.backend.trt.loader.CreateConfig.__call__
def __call__(self, builder, network)
Definition: backend/trt/loader.py:334
nmtSample::DebugUtil::DumpTensorPlugin
Definition: debugUtil.h:38
anonymous_namespace{serialize.hpp}::Serializer< T, typename std::enable_if< std::is_arithmetic< T >::value||std::is_enum< T >::value||std::is_pod< T >::value >::type >::deserialize
static void deserialize(void const **buffer, size_t *buffer_size, T *value)
Definition: serialize.hpp:56
model.WN.end
end
Definition: demo/Tacotron2/waveglow/model.py:108
nvinfer1::plugin::SpecialSlicePluginCreator::~SpecialSlicePluginCreator
~SpecialSlicePluginCreator() override=default
nvinfer1::ILayer::setInput
virtual void setInput(int32_t index, ITensor &tensor)=0
Replace an input of this layer with a specific tensor.
tests.quant_conv_transposed_test.TestQuantConvTranspose2D
Definition: quant_conv_transposed_test.py:48
pytorch_quantization.nn.modules.tensor_quantizer.TensorQuantizer.fake_quant
def fake_quant(self)
Definition: tensor_quantizer.py:143
nvinfer1::plugin::PyramidROIAlign::getSerializationSize
size_t getSerializationSize() const override
Find the size of the serialization buffer required.
Definition: pyramidROIAlignPlugin.cpp:201
MaskRCNNConfig::DETECTION_NMS_THRESHOLD
static const float DETECTION_NMS_THRESHOLD
Definition: plugin/proposalLayerPlugin/mrcnn_config.h:47
nvonnxparser::IOnnxConfig::getTextFileName
virtual const char * getTextFileName() const =0
Returns the File Name of the Network Description as a Text File.
nvonnxparser::IOnnxConfig::setModelDtype
virtual void setModelDtype(const nvinfer1::DataType)=0
Set the Model Data Type.
RefineNMSParameters::scoreThreshold
float scoreThreshold
Definition: maskRCNNKernels.h:87
nmtSample::SoftmaxLikelihood::SoftmaxLikelihoodCombinationOperator::SoftmaxLikelihoodCombinationOperator
SoftmaxLikelihoodCombinationOperator()=default
nvinfer1::plugin::DetectionLayerPluginCreator::getFieldNames
const PluginFieldCollection * getFieldNames() override
Return a list of fields that needs to be passed to createPlugin.
Definition: detectionLayerPlugin.cpp:57
half_float::detail::nearbyint
half nearbyint(half arg)
Nearest integer using half's internal rounding mode.
Definition: ieee_half.h:2586
generate.y
y
Definition: 06_removing_nodes/generate.py:30
nvinfer1::IPluginCreator::getPluginVersion
virtual const char * getPluginVersion() const =0
Return the plugin version.
nvinfer1::plugin::BatchTilePluginCreator
Definition: batchTilePlugin.h:84
nvinfer1::ITensor::getLocation
virtual TensorLocation getLocation() const =0
Get the storage location of a tensor.
nvcaffeparser1::parseClip
nvinfer1::ILayer * parseClip(nvinfer1::INetworkDefinition &network, const trtcaffe::LayerParameter &msg, CaffeWeightFactory &, BlobNameToTensor &tensors)
Definition: parseClip.cpp:23
nvinfer1::plugin::SplitPluginCreator::setPluginNamespace
void setPluginNamespace(const char *libNamespace)
Set the namespace of the plugin creator based on the plugin library it belongs to.
Definition: split.h:169
polygraphy.backend.tf.loader.UseTfTrt.is_dynamic_op
is_dynamic_op
Definition: backend/tf/loader.py:230
nvinfer1::DimsHW::h
int32_t h() const
Get the height.
Definition: NvInfer.h:134
SampleMNIST::infer
bool infer()
Runs the TensorRT inference engine for this sample.
Definition: sampleMNIST.cpp:291
test_examples.Artifact
Definition: onnx-graphsurgeon/tests/test_examples.py:33
nvinfer1::plugin::CropAndResizePluginCreator::mPluginAttributes
static std::vector< PluginField > mPluginAttributes
Definition: cropAndResizePlugin.h:124
nvinfer1::plugin::ProposalPlugin::getOutputDimensions
Dims getOutputDimensions(int index, const Dims *inputs, int nbInputDims) override
Definition: proposalPlugin.cpp:165
nvinfer1::plugin::LReLU::getOutputDimensions
Dims getOutputDimensions(int index, const Dims *inputs, int nbInputDims) override
Definition: lReluPlugin.cpp:51
sample::printEpilog
void printEpilog(std::vector< InferenceTime > timings, float walltimeMs, float percentile, int queries, std::ostream &os)
Definition: sampleReporting.cpp:112
onnx_graphsurgeon.logger.logger.Logger.ultra_verbose
def ultra_verbose(self, message, mode=LogMode.EACH)
Definition: onnx-graphsurgeon/onnx_graphsurgeon/logger/logger.py:219
nvinfer1::plugin::BatchTilePlugin::mNamespace
std::string mNamespace
Definition: batchTilePlugin.h:81
nvonnxparser::ErrorCode::kINVALID_GRAPH
@ kINVALID_GRAPH
polygraphy.tools.args.model.ModelArgs._inputs
_inputs
Definition: tools/Polygraphy/polygraphy/tools/args/model.py:28
half_float::detail::functions::isgreater
static bool isgreater(half x, half y)
Comparison implementation.
Definition: samples/common/half.h:2408
nvcaffeparser1::WeightType::kNVSCALE
@ kNVSCALE
nvinfer1::NetworkDefinitionCreationFlag
NetworkDefinitionCreationFlag
List of immutable network properties expressed at network creation time. NetworkDefinitionCreationFla...
Definition: NvInfer.h:6891
test_ir.TestConstant
Definition: test_ir.py:121
nvinfer1::plugin::DetectionOutputParameters::inputOrder
int32_t inputOrder[3]
Definition: NvInferPluginUtils.h:187
preprocess_audio2mel.audio2mel
def audio2mel(dataset_path, audiopaths_and_text, melpaths_and_text, args)
Definition: preprocess_audio2mel.py:56
nvinfer1::plugin::MultilevelCropAndResize::detachFromContext
void detachFromContext() override
Detach the plugin object from its execution context.
Definition: multilevelCropAndResizePlugin.cpp:300
polygraphy.comparator.comparator.Comparator.run
def run(runners, data_loader=None, warm_up=None, use_subprocess=None, subprocess_timeout=None, subprocess_polling_interval=None, save_inputs_path=None)
Definition: comparator/comparator.py:33
nmtSample::DebugUtil::DumpTensorPlugin::DumpTensorPlugin
DumpTensorPlugin(std::shared_ptr< std::ostream > out)
Definition: debugUtil.cpp:28
half_float::detail::islessequal
bool islessequal(half x, half y)
Comparison for less equal.
Definition: ieee_half.h:2806
UffPoolPluginV2::copyDeviceInputToFP32
void copyDeviceInputToFP32(const void *src, void *&dst)
Definition: sampleUffPluginV2Ext.cpp:594
polygraphy.tools.args.onnx.loader.OnnxLoaderArgs.outputs
outputs
Definition: tools/args/onnx/loader.py:67
polygraphy.util.cuda.Stream.__init__
def __init__(self)
Definition: cuda.py:91
nvinfer1::plugin::MultilevelCropAndResizePluginCreator::mPluginAttributes
static std::vector< PluginField > mPluginAttributes
Definition: multilevelCropAndResizePlugin.h:125
nmtSample::Context::~Context
~Context() override=default
pytorch_quantization.tensor_quant.TensorQuantFunction.forward
def forward(ctx, inputs, amax, num_bits=8, unsigned=False, narrow_range=True)
Definition: tensor_quant.py:248
nvinfer1::plugin::SpecialSlicePluginCreator::createPlugin
IPluginV2Ext * createPlugin(const char *name, const PluginFieldCollection *fc) override
Return a plugin object.
Definition: specialSlicePlugin.cpp:57
nvinfer1::plugin::SpecialSlice::configurePlugin
void configurePlugin(const Dims *inputDims, int nbInputs, const Dims *outputDims, int nbOutputs, const DataType *inputTypes, const DataType *outputTypes, const bool *inputIsBroadcast, const bool *outputIsBroadcast, PluginFormat floatFormat, int maxBatchSize) override
Definition: specialSlicePlugin.cpp:194
bert::TFusedMHAKernelFactory::mKernels
std::unordered_map< uint64_t, const std::unique_ptr< TFusedMHAKernelList > > mKernels
Definition: fused_multihead_attention.h:327
polygraphy.util.cuda.try_get_stream_handle
def try_get_stream_handle(stream)
Definition: cuda.py:107
nvinfer1::plugin::BatchedNMSPlugin::getWorkspaceSize
size_t getWorkspaceSize(int maxBatchSize) const override
Definition: batchedNMSPlugin.cpp:197
comparator.test_comparator.TestComparator.name
name
Definition: test_comparator.py:102
nvinfer1::RNNGateType::kINPUT
@ kINPUT
Input gate (i).
MultilevelProposeROIWorkSpace::sortNMSMarkDims
nvinfer1::DimsHW sortNMSMarkDims
Definition: maskRCNNKernels.h:172
pytorch_quantization.tensor_quant.ScaledQuantDescriptor.__str__
def __str__(self)
Definition: tensor_quant.py:152
nvcaffeparser1::ICaffeParser::setPluginNamespace
virtual void setPluginNamespace(const char *libNamespace)=0
Set the namespace used to lookup and create plugins in the network.
nvinfer1::plugin::ResizeNearest::getSerializationSize
size_t getSerializationSize() const override
Find the size of the serialization buffer required.
Definition: resizeNearestPlugin.cpp:131
half_float::detail::log
expr log(half arg)
Natural logorithm.
Definition: ieee_half.h:2347
nvinfer1::plugin::MultilevelCropAndResize::mInputWidth
int mInputWidth
Definition: multilevelCropAndResizePlugin.h:100
nvinfer1::INetworkDefinition::addSoftMax
virtual ISoftMaxLayer * addSoftMax(ITensor &input)=0
Add a SoftMax layer to the network.
polygraphy.backend.onnx.util.check_model
def check_model(model)
Definition: onnx/util.py:24
nvinfer1::plugin::Quadruple
The Permute plugin layer permutes the input tensor by changing the memory order of the data.
Definition: NvInferPluginUtils.h:66
half_float::detail::lgamma
expr lgamma(half arg)
Natural logarithm of gamma function.
Definition: ieee_half.h:2533
polygraphy.backend.onnx.loader.OnnxFromTfGraph._graph
_graph
Definition: backend/onnx/loader.py:67
polygraphy.tools.surgeon.subtool.insert.Insert.run
def run(self, args)
Definition: insert.py:44
bert::fused_multihead_attention_v2_fp16_256_64_kernel_sm80_cubin_len
unsigned int fused_multihead_attention_v2_fp16_256_64_kernel_sm80_cubin_len
Definition: fused_multihead_attention_v2_fp16_256_64_kernel.sm80.cpp:5427
nmtSample::LimitedSamplesDataReader::gOriginalDataReader
DataReader::ptr gOriginalDataReader
Definition: limitedSamplesDataReader.h:42
export_tacotron2_onnx.Postnet
Definition: export_tacotron2_onnx.py:76
nvinfer1::plugin::ResizeNearest::attachToContext
void attachToContext(cudnnContext *cudnnContext, cublasContext *cublasContext, IGpuAllocator *gpuAllocator) override
Attach the plugin object to an execution context and grant the plugin the access to some context reso...
Definition: resizeNearestPlugin.cpp:252
nvinfer1::ICudaEngine::~ICudaEngine
virtual ~ICudaEngine()
Definition: NvInferRuntime.h:1325
ProposalWorkSpace
Definition: maskRCNNKernels.h:119
perf_varseqlen.main
def main()
Definition: perf_varseqlen.py:39
polygraphy.util.misc.FreeOnException
Definition: Polygraphy/polygraphy/util/misc.py:432
sample::gLogError
LogStreamConsumer gLogError
Definition: logger.cpp:26
test_infer.parse_args
def parse_args(parser)
Definition: test_infer.py:36
pytorch_quantization.tensor_quant.fake_tensor_quant
fake_tensor_quant
Definition: tensor_quant.py:427
AlgorithmCacheWriter::selectAlgorithms
int32_t selectAlgorithms(const nvinfer1::IAlgorithmContext &context, const nvinfer1::IAlgorithm *const *choices, int32_t nbChoices, int32_t *selection) override
Return value in [0, nbChoices] for a valid algorithm.
Definition: sampleAlgorithmSelector.cpp:61
nvonnxparser::IOnnxConfig::reduceVerbosity
virtual void reduceVerbosity()=0
Decrease verbosity Level.
nvinfer1::ResizeMode::kNEAREST
@ kNEAREST
ND (0 < N <= 8) nearest neighbor resizing.
onnx_graphsurgeon.ir.graph.NodeIDAdder.__exit__
def __exit__(self, exc_type, exc_value, traceback)
Definition: graph.py:54
nvinfer1::plugin::ProposalPluginCreator::mPluginAttributes
static std::vector< PluginField > mPluginAttributes
Definition: proposalPlugin.h:144
polygraphy.tools.args.trt_legacy.TrtLegacyArgs.tf_loader_args
tf_loader_args
Definition: tools/args/trt_legacy.py:48
SampleMovieLens::SampleMovieLens
SampleMovieLens(const SampleMovieLensParams &params)
Definition: sampleMovieLens.cpp:89
polygraphy.tools.args.data_loader
Definition: tools/args/data_loader.py:1
nvinfer1::ErrorCode::kINVALID_CONFIG
@ kINVALID_CONFIG
An error occurred when comparing the state of an argument relative to other arguments.
pytorch_quantization.nn.modules.quant_conv.QuantConv2d.__init__
def __init__(self, in_channels, out_channels, kernel_size, stride=1, padding=0, dilation=1, groups=1, bias=True, padding_mode='zeros', **kwargs)
Definition: quant_conv.py:95
nvuffparser::UffInputOrder::kNC
@ kNC
NC order.
nvcaffeparser1::parseTanH
nvinfer1::ILayer * parseTanH(nvinfer1::INetworkDefinition &network, const trtcaffe::LayerParameter &msg, CaffeWeightFactory &, BlobNameToTensor &tensors)
Definition: parseTanH.cpp:23
nvinfer1::plugin::GenerateDetection::mImageSize
nvinfer1::Dims mImageSize
Definition: generateDetectionPlugin.h:107
nvinfer1::plugin::PriorBoxParameters::numMinSize
int32_t numMinSize
Definition: NvInferPluginUtils.h:94
pytorch_quantization.nn.modules.quant_bert.QuantBertSelfAttention.all_head_size
all_head_size
Definition: quant_bert.py:61
samplesCommon::SampleParams::fp16
bool fp16
Allow running the network in FP16 mode.
Definition: argsParser.h:41
SampleDynamicReshape::validateOutput
bool validateOutput(int digit)
Checks whether the model prediction (in mOutput) is correct.
Definition: sampleDynamicReshape.cpp:405
UffPoolPluginV2::enqueue
int enqueue(int batchSize, const void *const *inputs, void **outputs, void *workspace, cudaStream_t stream) override
Definition: sampleUffPluginV2Ext.cpp:424
nvinfer1::plugin::reduced_divisor::reduced_divisor
__host__ __forceinline__ reduced_divisor(int _y)
Definition: reducedMathPlugin.h:58
polygraphy.backend.tf.loader.UseTfTrt._graph
_graph
Definition: backend/tf/loader.py:225
polygraphy.tools.args.comparator.ComparatorCompareArgs.load_results
load_results
Definition: tools/args/comparator.py:108
nvinfer1::plugin::MultilevelProposeROI::attachToContext
void attachToContext(cudnnContext *cudnnContext, cublasContext *cublasContext, IGpuAllocator *gpuAllocator) override
Attach the plugin object to an execution context and grant the plugin the access to some context reso...
Definition: multilevelProposeROIPlugin.cpp:466
nvinfer1::plugin::MultilevelCropAndResize::getNbOutputs
int getNbOutputs() const override
Get the number of outputs from the layer.
Definition: multilevelCropAndResizePlugin.cpp:101
nvinfer1::FillOperation::kLINSPACE
@ kLINSPACE
Generate evenly spaced numbers over a specified interval.
polygraphy.backend.pyt.runner.PytRunner.output_names
output_names
Definition: backend/pyt/runner.py:40
nvinfer1::plugin::FlattenConcat::serializeFromDevice
void serializeFromDevice(char *&hostBuffer, Weights deviceWeights) const
pytorch_quantization.nn.modules.quant_conv.QuantConvTranspose3d.__init__
def __init__(self, in_channels, out_channels, kernel_size, stride=1, padding=0, output_padding=0, groups=1, bias=True, dilation=1, padding_mode='zeros', **kwargs)
Definition: quant_conv.py:355
SampleFasterRCNN::verifyOutput
bool verifyOutput(const samplesCommon::BufferManager &buffers)
Filters output detections, handles post-processing of bounding boxes and verify results.
Definition: sampleFasterRCNN.cpp:303
nvinfer1::plugin::CropAndResizePluginCreator::getPluginVersion
const char * getPluginVersion() const override
Return the plugin version.
Definition: cropAndResizePlugin.cpp:258
pytorch_quantization.nn.modules.quant_rnn.QuantRNNBase._apply
def _apply(self, fn)
Definition: modules/quant_rnn.py:142
nvinfer1::plugin::BatchedNMSDynamicPlugin::getWorkspaceSize
size_t getWorkspaceSize(const PluginTensorDesc *inputs, int nbInputs, const PluginTensorDesc *outputs, int nbOutputs) const override
Definition: batchedNMSPlugin.cpp:203
sample::InferenceOptions::parse
void parse(Arguments &arguments) override
Definition: sampleOptions.cpp:662
LogStreamConsumerBuffer
Definition: pluginLogging.h:32
nvinfer1::IUnaryLayer
Layer that represents an unary operation.
Definition: NvInfer.h:3517
nvonnxparser::IParser::~IParser
virtual ~IParser()
Definition: NvOnnxParser.h:226
samplesCommon::BufferManager::getDeviceBindings
const std::vector< void * > & getDeviceBindings() const
Returns a vector of device buffers.
Definition: buffers.h:286
nvinfer1::plugin::SplitPlugin::getSerializationSize
size_t getSerializationSize() const override
Find the size of the serialization buffer required.
Definition: split.h:58
SampleMLP::SampleMLP
SampleMLP(const SampleMLPParams &params)
Definition: sampleMLP.cpp:63
samplesCommon::BufferManager::memcpyBuffers
void memcpyBuffers(const bool copyInput, const bool deviceToHost, const bool async, const cudaStream_t &stream=0)
Definition: buffers.h:435
sample::anonymous_namespace{sampleInference.cpp}::Iteration::setInputData
void setInputData()
Definition: sampleInference.cpp:371
TLTMaskRCNNConfig::MIN_LEVEL
static const int MIN_LEVEL
Definition: tlt_mrcnn_config.h:60
nmtSample::BenchmarkWriter::getInfo
std::string getInfo() override
get the textual description of the component
Definition: benchmarkWriter.cpp:54
nvinfer1::plugin::DetectionLayer
Definition: detectionLayerPlugin.h:35
tests.quant_conv_transposed_test.TestQuantConvTranspose2D.test_fake_quant_per_tensor
def test_fake_quant_per_tensor(self)
Definition: quant_conv_transposed_test.py:126
nvuffparser::UffInputOrder::kNHWC
@ kNHWC
NHWC order.
nvinfer1::ILoop
Helper for creating a recurrent subgraph.
Definition: NvInfer.h:4645
nvinfer1::ErrorCode::kSUCCESS
@ kSUCCESS
Execution completed successfully.
nvinfer1::UnaryOperation::kCOSH
@ kCOSH
Hyperbolic cosine.
nvcaffeparser1::checkBlobs
bool checkBlobs(const trtcaffe::LayerParameter &msg, int bottoms, int tops)
Definition: opParsers.h:32
polygraphy.backend.tf.runner.TfRunner.deactivate_impl
def deactivate_impl(self)
Definition: backend/tf/runner.py:73
samplesCommon::CaffeSampleParams::prototxtFileName
std::string prototxtFileName
Filename of prototxt design file of a network.
Definition: argsParser.h:53
nmtSample::PinnedHostBuffer::~PinnedHostBuffer
virtual ~PinnedHostBuffer()
Definition: pinnedHostBuffer.h:43
tests.quant_pooling_test.TestQuantAvgPool2d.test_input_fake_quant
def test_input_fake_quant(self)
Definition: quant_pooling_test.py:159
sample::ReportingOptions::output
bool output
Definition: sampleOptions.h:175
SampleConfig::mDebugBuilder
bool mDebugBuilder
Definition: sampleConfig.h:55
backend.onnx.test_loader.TestExportOnnxFromTf.test_opset
def test_opset(self)
Definition: onnx/test_loader.py:47
polygraphy.comparator.struct.IterationResult.__init__
def __init__(self, outputs=None, runtime=None, runner_name=None)
Definition: comparator/struct.py:23
SamplePlugin::SampleUniquePtr
std::unique_ptr< T, samplesCommon::InferDeleter > SampleUniquePtr
Definition: samplePlugin.cpp:58
backend.trt.test_calibrator.TestCalibrator.test_calibrator_with_path_name_cache
def test_calibrator_with_path_name_cache(self, identity_builder_network)
Definition: test_calibrator.py:90
nvonnxparser::IParserError::desc
virtual const char * desc() const =0
description of the error
bert::QKVToContextInterleavedPluginCreator::mPluginAttributes
static std::vector< nvinfer1::PluginField > mPluginAttributes
Definition: qkvToContextInt8InterleavedPlugin.h:124
sample::defaultAvgRuns
constexpr int defaultAvgRuns
Definition: sampleOptions.h:52
tests.quant_instancenorm_test.TestQuantInstanceNorm3D.test_no_quant
def test_no_quant(self)
Definition: quant_instancenorm_test.py:151
nvinfer1::PaddingMode
PaddingMode
Enumerates the modes of padding to perform in convolution, deconvolution and pooling layer,...
Definition: NvInfer.h:1170
pytorch_quantization.nn.modules.quant_bert.QuantBertSelfAttention.transpose_for_scores
def transpose_for_scores(self, x)
Definition: quant_bert.py:78
BertInferenceRunner::BertInferenceRunner
BertInferenceRunner(const std::string &enginePath, const int maxBatchSize, const int maxSeqLength, const bool enableGraph)
Definition: infer_c.cpp:25
pytorch_quantization.nn.modules.quant_pooling
Definition: quant_pooling.py:1
polygraphy.backend.tf.loader.SaveGraph._graph
_graph
Definition: backend/tf/loader.py:322
bert::sMhaKernelMetaInfosV2
static const struct bert::FusedMultiHeadAttentionKernelMetaInfoV2 sMhaKernelMetaInfosV2[]
nvcaffeparser1::CaffeWeightFactory::getBlobsSize
int getBlobsSize(const std::string &layerName)
Definition: caffeWeightFactory.cpp:93
onnx_graphsurgeon.logger.logger.Logger.once_logged
once_logged
Definition: onnx-graphsurgeon/onnx_graphsurgeon/logger/logger.py:103
nvinfer1::plugin::ProposalPlugin::getOutputDataType
DataType getOutputDataType(int index, const nvinfer1::DataType *inputTypes, int nbInputs) const override
Definition: proposalPlugin.cpp:276
test_ir.TestGraph.test_io_cannot_be_sync_list_on_init
def test_io_cannot_be_sync_list_on_init(self)
Definition: test_ir.py:776
polygraphy.comparator.struct.RunResults.__init__
def __init__(self)
Definition: comparator/struct.py:50
polygraphy.tools.args.trt.runner.TrtRunnerArgs.check_registered
def check_registered(self)
Definition: tools/args/trt/runner.py:42
sample::ModelOptions::uffInputs
UffInput uffInputs
Definition: sampleOptions.h:99
RefineDetectionWorkSpace::totalSize
size_t totalSize
Definition: maskRCNNKernels.h:116
tracer.avgData
def avgData(data, avg, times)
Definition: tracer.py:74
nvinfer1::plugin::InstanceNormalizationPlugin::_epsilon
float _epsilon
Definition: instanceNormalizationPlugin.h:90
samplesCommon::BufferManager
The BufferManager class handles host and device buffer allocation and deallocation.
Definition: buffers.h:237
Args::warmUpRuns
int warmUpRuns
Definition: demo/BERT/infer_c/common.h:33
tests.tensor_quantizer_test.TestTensorQuantizer.test_per_tensor_scale
def test_per_tensor_scale(self)
Definition: tensor_quantizer_test.py:58
model
nvinfer1::utils::transposeSubBuffers
bool transposeSubBuffers(void *input, DataType type, int32_t num, int32_t height, int32_t width)
Transpose num sub-buffers of height * width.
bert::fused_multihead_attention_int8_128_64_kernel_sm75_cu_o
unsigned char fused_multihead_attention_int8_128_64_kernel_sm75_cu_o[]
Definition: fused_multihead_attention_int8_128_64_kernel.sm75.cpp:20
polygraphy.logger.logger.Logger._severity
_severity
Definition: Polygraphy/polygraphy/logger/logger.py:126
pytorch_quantization.nn.functional.clip
clip
Definition: functional.py:63
nvcaffeparser1::CaffeParser::setPluginFactoryV2
void setPluginFactoryV2(nvcaffeparser1::IPluginFactoryV2 *factory) override
Set the IPluginFactoryV2 used to create the user defined pluginV2 objects.
Definition: caffeParser.h:55
nvinfer1::plugin::RPROIPlugin::attachToContext
void attachToContext(cudnnContext *cudnnContext, cublasContext *cublasContext, IGpuAllocator *gpuAllocator) override
Attach the plugin object to an execution context and grant the plugin the access to some context reso...
Definition: nvFasterRCNNPlugin.cpp:304
nvinfer1::IRuntime
Allows a serialized functionally unsafe engine to be deserialized.
Definition: NvInferRuntime.h:726
onnx_graphsurgeon.ir.graph.NodeIDAdder
Definition: graph.py:44
sample::defaultWorkspace
constexpr int defaultWorkspace
Definition: sampleOptions.h:36
TLTMaskRCNNConfig::DETECTION_NMS_THRESHOLD
static const float DETECTION_NMS_THRESHOLD
Definition: tlt_mrcnn_config.h:47
nmtSample::DebugUtil::DumpTensorPlugin::~DumpTensorPlugin
~DumpTensorPlugin() override=default
polygraphy.util.cuda.DeviceBuffer.copy_to
def copy_to(self, host_buffer, stream=None)
Definition: cuda.py:171
SampleMaskRCNN::build
bool build()
Definition: sampleUffMaskRCNN.cpp:352
model.Prenet.forward
def forward(self, x)
Definition: demo/Tacotron2/tacotron2/model.py:116
polygraphy.backend.onnx.loader.ModifyOnnx.do_shape_inference
do_shape_inference
Definition: backend/onnx/loader.py:124
nvinfer1::plugin::CoordConvACPlugin::clone
IPluginV2Ext * clone() const override
Clone the plugin object.
Definition: coordConvACPlugin.cpp:144
AlgorithmCacheReader::AlgorithmCacheItem::formats
std::vector< std::pair< int32_t, int32_t > > formats
Definition: sampleAlgorithmSelector.cpp:241
onnx_graphsurgeon.ir.tensor.Tensor.i
def i(self, tensor_idx=0, producer_idx=0)
Definition: tensor.py:91
SampleMaskRCNN::SampleMaskRCNN
SampleMaskRCNN(const SampleMaskRCNNParams &params)
Definition: sampleUffMaskRCNN.cpp:316
nmtSample::BLEUScoreWriter::write
void write(const int *hOutputData, int actualOutputSequenceLength, int actualInputSequenceLength) override
write the generated sequence
Definition: bleuScoreWriter.cpp:163
polygraphy.backend.onnx.loader.OnnxFromPath.__init__
def __init__(self, path)
Definition: backend/onnx/loader.py:25
nvinfer1::EnumMax< TopKOperation >
constexpr int32_t EnumMax< TopKOperation >()
Maximum number of elements in TopKOperation enum.
Definition: NvInfer.h:4063
nmtSample::BLEUScoreWriter
all it does is to evaluate BLEU score
Definition: bleuScoreWriter.h:34
sample::argsToArgumentsMap
Arguments argsToArgumentsMap(int argc, char *argv[])
Definition: sampleOptions.cpp:387
nvinfer1::IHostMemory
Class to handle library allocated memory that is accessible to the user.
Definition: NvInferRuntime.h:95
nvinfer1::plugin::CodeTypeSSD
CodeTypeSSD
The type of encoding used for decoding the bounding boxes and loc_data.
Definition: NvInferPluginUtils.h:155
polygraphy.tools.registry
Definition: registry.py:1
nvinfer1::plugin::ProposalPlugin::mAnchorRatioNum
size_t mAnchorRatioNum
Definition: proposalPlugin.h:122
polygraphy.logger.logger.Logger.super_verbose
def super_verbose(self, message, mode=LogMode.EACH)
Definition: Polygraphy/polygraphy/logger/logger.py:254
sample::BuildOptions
Definition: sampleOptions.h:106
nvinfer1::cuErrCheck_
void cuErrCheck_(CUresult stat, const CUDADriverWrapper &wrap, const char *file, int line)
Definition: cudaDriverWrapper.h:88
tests.quant_rnn_test.TestEpilogue.test_save_vectors
def test_save_vectors(self, verbose)
Definition: quant_rnn_test.py:515
pytorch_quantization.nn.modules.quant_pooling.QuantAdaptiveAvgPool1d.forward
def forward(self, input)
Definition: quant_pooling.py:124
nvinfer1::IPluginV2Ext::configureWithFormat
void configureWithFormat(const Dims *, int32_t, const Dims *, int32_t, DataType, PluginFormat, int32_t)
Derived classes should not implement this.
Definition: NvInferRuntimeCommon.h:700
SampleINT8API::mInOut
std::map< std::string, std::string > mInOut
Input and output mapping of the network.
Definition: sampleINT8API.cpp:108
model.Tacotron2.infer
def infer(self, inputs, input_lengths)
Definition: demo/Tacotron2/tacotron2/model.py:667
stft.STFT
Definition: stft.py:42
polygraphy.tools.inspect.subtool
Definition: tools/Polygraphy/polygraphy/tools/inspect/subtool/__init__.py:1
nvinfer1::IPlugin::~IPlugin
virtual ~IPlugin()
Definition: NvInferRuntime.h:212
nvinfer1::IInt8EntropyCalibrator2::getAlgorithm
CalibrationAlgoType getAlgorithm() override
Signal that this is the entropy calibrator 2.
Definition: NvInfer.h:6084
pytorch_quantization.tensor_quant.FakeTensorQuantFunction.backward
def backward(ctx, grad_outputs)
Definition: tensor_quant.py:312
polygraphy.tools.run.run
Definition: run.py:1
model.Tacotron2.encoder
encoder
Definition: demo/Tacotron2/tacotron2/model.py:597
nvinfer1::plugin::NormalizePluginCreator::mAcrossSpatial
bool mAcrossSpatial
Definition: normalizePlugin.h:126
polygraphy.backend.trt_legacy.LoadNetworkFromUff
Definition: backend/trt_legacy.py:80
TLTMaskRCNNConfig::MAX_LEVEL
static const int MAX_LEVEL
Definition: tlt_mrcnn_config.h:61
nvinfer1::plugin::BatchedNMSBasePluginCreator
Definition: batchedNMSPlugin.h:121
polygraphy.tools.surgeon.subtool.sanitize.Sanitize
Definition: sanitize.py:19
sample::InferenceTime::latency
float latency() const
Definition: sampleReporting.h:59
nmtSample::LimitedSamplesDataReader::getInfo
std::string getInfo() override
get the textual description of the component
Definition: limitedSamplesDataReader.cpp:47
tools.test_args.TestParseMeta.test_parse_shape_with_dim_param_double_quote
def test_parse_shape_with_dim_param_double_quote(self)
Definition: test_args.py:47
SampleReformatFreeIO::mOutputDims
nvinfer1::Dims mOutputDims
The dimensions of the output to the network.
Definition: sampleReformatFreeIO.cpp:266
model.Decoder.p_attention_dropout
p_attention_dropout
Definition: demo/Tacotron2/tacotron2/model.py:250
sample::InferenceTime::in
float in
Definition: sampleReporting.h:53
bert::SkipLayerNormInterleavedPluginCreator::createPlugin
nvinfer1::IPluginV2 * createPlugin(const char *name, const nvinfer1::PluginFieldCollection *fc) override
Return a plugin object.
Definition: skipLayerNormInt8InterleavedPlugin.cpp:290
polygraphy.backend.trt_legacy.TrtLegacyRunner.layerwise
layerwise
Definition: backend/trt_legacy.py:223
SampleConfig::mMaxWorkspaceSize
int64_t mMaxWorkspaceSize
Definition: sampleConfig.h:46
bert::QKVToContextInterleavedPluginCreator::getPluginNamespace
const char * getPluginNamespace() const override
Return the namespace of the plugin creator object.
Definition: qkvToContextInt8InterleavedPlugin.cpp:358
nvinfer1::plugin::ResizeNearest::mInputDims
Dims mInputDims
Definition: resizeNearestPlugin.h:91
nvinfer1::plugin::Reorg::serialize
void serialize(void *buffer) const override
Serialize the layer.
Definition: reorgPlugin.cpp:78
half_float::detail::remquo
expr remquo(half x, half y, int *quo)
Remainder of division.
Definition: ieee_half.h:2264
nvinfer1::plugin::ResizeNearest::destroy
void destroy() override
Destroy the plugin object.
Definition: resizeNearestPlugin.cpp:121
helpers.tokenization.convert_by_vocab
def convert_by_vocab(vocab, items)
Definition: tokenization.py:135
nmtSample::TextReader::mInput
std::shared_ptr< std::istream > mInput
Definition: textReader.h:45
nvinfer1::plugin::BatchedNMSBasePluginCreator::mPluginName
std::string mPluginName
Definition: batchedNMSPlugin.h:136
pytorch_quantization.nn.modules.quant_pooling.QuantAvgPool2d.__init__
def __init__(self, kernel_size, stride=None, padding=0, ceil_mode=False, count_include_pad=True, divisor_override=None, **kwargs)
Definition: quant_pooling.py:91
data_function.TextMelCollate.n_frames_per_step
n_frames_per_step
Definition: tacotron2/data_function.py:87
SampleUffPluginV2Ext::build
bool build()
Creates the network, configures the builder and creates the network engine.
Definition: sampleUffPluginV2Ext.cpp:202
bert::Fused_multihead_attention_params_v2::packed_mask_ptr
void * packed_mask_ptr
Definition: fused_multihead_attention_v2.h:30
onnx_graphsurgeon.ir.tensor.Tensor.to_variable
def to_variable(self, np.dtype dtype=None, Sequence[Union[int, str]] shape=[])
Definition: tensor.py:74
tests.calibrator_test.TestEntropyCalibrator.test_unsigned
def test_unsigned(self, verbose)
Definition: calibrator_test.py:157
polygraphy.backend.trt_legacy.LoadNetworkFromUff.uff_loader
uff_loader
Definition: backend/trt_legacy.py:82
tools.test_precision.test_polygraphy_precision_bisect_sanity
def test_polygraphy_precision_bisect_sanity()
Definition: test_precision.py:34
polygraphy.backend.common.loader
Definition: backend/common/loader.py:1
pytorch_quantization.nn.modules._utils.QuantInputMixin.default_quant_desc_input
default_quant_desc_input
Definition: _utils.py:100
bert::fused_multihead_attention_int8_128_64_kernel_sm80_cu_o
unsigned char fused_multihead_attention_int8_128_64_kernel_sm80_cu_o[]
Definition: fused_multihead_attention_int8_128_64_kernel.sm80.cpp:20
bert::EmbLayerNormVarSeqlenPlugin::mPosEmb
bert::WeightsWithOwnership mPosEmb
Definition: embLayerNormVarSeqlenPlugin.h:101
nvinfer1::ICudaEngine::getNbBindings
virtual int32_t getNbBindings() const noexcept=0
Get the number of binding indices.
nmtSample::LSTMEncoder::LSTMEncoder
LSTMEncoder(ComponentWeights::ptr weights)
Definition: lstmEncoder.cpp:26
SampleMNISTAPI::build
bool build()
Function builds the network engine.
Definition: sampleMNISTAPI.cpp:127
nvinfer1::plugin::MultilevelCropAndResize::destroy
void destroy() override
Destroy the plugin object.
Definition: multilevelCropAndResizePlugin.cpp:115
onnx_graphsurgeon.ir.graph.Graph.__name__
__name__
Definition: graph.py:119
nvinfer1::plugin::BatchTilePlugin::getNbOutputs
int getNbOutputs() const override
Get the number of outputs from the layer.
Definition: batchTilePlugin.cpp:68
polygraphy.tools.util.script.Script.__str__
def __str__(self)
Definition: script.py:231
SampleINT8::mParams
SampleINT8Params mParams
The parameters for the sample.
Definition: sampleINT8.cpp:94
nvinfer1::RNNInputMode::kLINEAR
@ kLINEAR
Perform the normal matrix multiplication in the first recurrent layer.
polygraphy.util.cuda.wrapper
def wrapper()
Definition: cuda.py:84
sample::TrtCudaGraph::operator=
TrtCudaGraph & operator=(const TrtCudaGraph &)=delete
nvinfer1::IPlugin::getNbOutputs
virtual int32_t getNbOutputs() const =0
Get the number of outputs from the layer.
BatchStream::getFileBatch
float * getFileBatch()
Definition: BatchStream.h:285
BatchStream::mLabels
std::vector< float > mLabels
Labels for the batch.
Definition: BatchStream.h:380
SampleCharRNNWeightNames::FCW_NAME
const std::string FCW_NAME
Definition: sampleCharRNN.cpp:77
nmtSample::LSTMEncoder::getInfo
std::string getInfo() override
get the textual description of the component
Definition: lstmEncoder.cpp:114
test_ir.toposort_linear_graph
def toposort_linear_graph()
Definition: test_ir.py:376
onnx_graphsurgeon.exporters.onnx_exporter.OnnxExporter.export_tensor_proto
onnx.TensorProto export_tensor_proto(Constant tensor)
Definition: onnx_exporter.py:32
option::name
const char * name
Definition: getopt.h:76
TLTMaskRCNNConfig::IMAGE_SHAPE
static const nvinfer1::Dims3 IMAGE_SHAPE
Definition: tlt_mrcnn_config.h:27
nvinfer1::plugin::BasePlugin::mNamespace
std::string mNamespace
Definition: plugin.h:57
nvcaffeparser1::CaffeParser::setPluginNamespace
void setPluginNamespace(const char *libNamespace) override
Set the namespace used to lookup and create plugins in the network.
Definition: caffeParser.h:56
MaskRCNNConfig
Definition: plugin/proposalLayerPlugin/mrcnn_config.h:24
pytorch_quantization.nn.modules.tensor_quantizer.TensorQuantizer._if_clip
_if_clip
Definition: tensor_quantizer.py:85
SampleINT8APIParams::dlaCore
int dlaCore
Definition: sampleINT8API.cpp:59
sample::SystemOptions::DLACore
int DLACore
Definition: sampleOptions.h:137
nvinfer1::plugin::ProposalLayerPluginCreator::mPreNMSTopK
int mPreNMSTopK
Definition: proposalLayerPlugin.h:135
sample::BuildOptions::load
bool load
Definition: sampleOptions.h:118
sample::anonymous_namespace{sampleInference.cpp}::Iteration::query
void query(bool skipTransfers)
Definition: sampleInference.cpp:303
tools.common.run_subtool
def run_subtool(subtool, additional_opts, disable_verbose=False)
Definition: tools/common.py:32
SampleMovieLensParams::userToExpectedItemProbMap
std::map< int32_t, std::vector< std::pair< int32_t, float > > > userToExpectedItemProbMap
Definition: sampleMovieLens.cpp:74
onnx_graphsurgeon.ir.graph.Graph.node
node
Definition: graph.py:264
sample::InferenceEnvironment
Definition: sampleInference.h:33
polygraphy.util.format.FormatManager._generate_permutations
def _generate_permutations()
Definition: format.py:34
LogStreamConsumerBuffer::~LogStreamConsumerBuffer
~LogStreamConsumerBuffer()
Definition: pluginLogging.h:47
nvinfer1::plugin::SpecialSlice::initialize
int initialize() override
Initialize the layer for execution.
Definition: specialSlicePlugin.cpp:127
bert::EmbLayerNormVarSeqlenPlugin::configurePlugin
void configurePlugin(const nvinfer1::DynamicPluginTensorDesc *in, int nbInputs, const nvinfer1::DynamicPluginTensorDesc *out, int nbOutputs) override
Definition: embLayerNormVarSeqlenPlugin.cpp:214
helpers.data_processing
Definition: data_processing.py:1
pytorch_quantization.nn.modules.quant_bert.QuantBertSelfAttention.num_attention_heads
num_attention_heads
Definition: quant_bert.py:59
bert::Fused_multihead_attention_params_v2::interleaved
bool interleaved
Definition: fused_multihead_attention_v2.h:68
helpers.tokenization.BasicTokenizer._run_split_on_punc
def _run_split_on_punc(self, text)
Definition: tokenization.py:261
nvinfer1::PluginFieldType::kFLOAT64
@ kFLOAT64
FP64 field type.
polygraphy.backend.trt.loader.LoadPlugins.__init__
def __init__(self, obj=None, plugins=None)
Definition: backend/trt/loader.py:28
ParserOnnxConfig::mModelDtype
nvinfer1::DataType mModelDtype
Definition: parserOnnxConfig.h:47
text.numbers.normalize_numbers
def normalize_numbers(text)
Definition: numbers.py:79
polygraphy.backend.trt.loader.CreateConfig.tf32
tf32
Definition: backend/trt/loader.py:322
nvinfer1::IExprBuilder::constant
virtual const IDimensionExpr * constant(int32_t value)=0
Return pointer to IDimensionExp for given value.
half_float::half::operator--
half & operator--()
Prefix decrement.
Definition: samples/common/half.h:1530
nvinfer1::plugin::CropAndResizePlugin::mInputWidth
size_t mInputWidth
Definition: cropAndResizePlugin.h:101
ParserOnnxConfig::setModelDtype
virtual void setModelDtype(const nvinfer1::DataType modelDtype)
Set the Model Data Type.
Definition: parserOnnxConfig.h:77
polygraphy.tools.surgeon.subtool.extract
Definition: extract.py:1
backend.onnx.test_loader.TestLoggerCallbacks.test_set_severity
def test_set_severity(self, sev)
Definition: onnx/test_loader.py:31
pytorch_quantization.nn.modules.quant_rnn.QuantLSTMCell.weight_ih
weight_ih
Definition: modules/quant_rnn.py:390
test_func.TestExtend.test_override_rv
def test_override_rv(self)
Definition: test_func.py:20
polygraphy.backend.onnxrt.runner.OnnxrtRunner.get_input_metadata
def get_input_metadata(self)
Definition: backend/onnxrt/runner.py:59
model.Decoder.linear_projection
linear_projection
Definition: demo/Tacotron2/tacotron2/model.py:271
polygraphy.backend.onnx
Definition: tools/Polygraphy/polygraphy/backend/onnx/__init__.py:1
nvinfer1::plugin::read
T read(const char *&buffer)
Definition: plugin.h:87
half_float::detail::half2float
T half2float(uint16 value)
Convert half-precision to floating point.
Definition: ieee_half.h:946
nmtSample::SLPProjection::mKernelWeights
nvinfer1::Weights mKernelWeights
Definition: slpProjection.h:52
half_float::detail::erf
expr erf(half arg)
Error function.
Definition: ieee_half.h:2519
polygraphy.util.misc.unpack_args
def unpack_args(args, num)
Definition: Polygraphy/polygraphy/util/misc.py:115
nvuffparser::FieldType::kCHAR
@ kCHAR
char field type. String for length>1.
sample::anonymous_namespace{sampleInference.cpp}::Iteration::mGraph
TrtCudaGraph mGraph
Definition: sampleInference.cpp:459
bert::QKVToContextInterleavedPluginCreator
Definition: qkvToContextInt8InterleavedPlugin.h:103
tests.calibrator_test.TestEntropyCalibrator
Definition: calibrator_test.py:140
SampleDynamicReshape::mPredictionInputDims
nvinfer1::Dims mPredictionInputDims
The dimensions of the input of the MNIST model.
Definition: sampleDynamicReshape.cpp:80
polygraphy.tools.surgeon.subtool.base.BaseSurgeonSubtool.add_parser_args
def add_parser_args(self, parser, gs=False, output=False)
Definition: surgeon/subtool/base.py:36
nvonnxparser::IOnnxConfig::Verbosity
int Verbosity
Defines Verbosity level.
Definition: NvOnnxConfig.h:55
SampleConfig::getEngineFileName
const char * getEngineFileName() const
Definition: sampleConfig.h:129
profiler.addReference
def addReference(row, reference)
Definition: profiler.py:108
polygraphy.tools.args.tf.loader.TfLoaderArgs._enable_outputs
_enable_outputs
Definition: tools/args/tf/loader.py:26
nvinfer1::plugin::CoordConvACPlugin::iH
int iH
Definition: coordConvACPlugin.h:86
nvinfer1::plugin::BatchedNMSDynamicPlugin::numPriors
int numPriors
Definition: batchedNMSPlugin.h:115
nvinfer1::plugin::MultilevelProposeROIPluginCreator::getPluginVersion
const char * getPluginVersion() const override
Return the plugin version.
Definition: multilevelProposeROIPlugin.cpp:59
inference.load_and_setup_model
def load_and_setup_model(model_name, parser, checkpoint, fp16_run, cpu_run, forward_is_infer=False)
Definition: Tacotron2/inference.py:100
data_function.TextMelLoader.get_text
def get_text(self, text)
Definition: tacotron2/data_function.py:72
sample::Logger::TestResult::kPASSED
@ kPASSED
The test passed.
samplesCommon::getW
int getW(const Dims &d)
Definition: samples/common/common.h:891
tests.quant_pooling_test.TestQuantAdaptiveAvgPool3d
Definition: quant_pooling_test.py:287
nvcaffeparser1::ICaffeParser::parseBinaryProto
virtual IBinaryProtoBlob * parseBinaryProto(const char *fileName)=0
Parse and extract data stored in binaryproto file.
onnx_graphsurgeon.ir.graph.Graph.__init__
def __init__(self, Sequence[Node] nodes=None, Sequence[Tensor] inputs=None, Sequence[Tensor] outputs=None, name=None, doc_string=None, opset=None)
Definition: graph.py:105
nvinfer1::plugin::Normalize::getOutputDataType
DataType getOutputDataType(int index, const nvinfer1::DataType *inputTypes, int nbInputs) const override
Definition: normalizePlugin.cpp:177
std::numeric_limits< half_float::half >::round_error
static half_float::half round_error()
Maximum rounding error.
Definition: samples/common/half.h:4240
tests.functional_test.TestClip
Definition: functional_test.py:35
test_exporters.TestOnnxExporter.test_export_variable_tensor
def test_export_variable_tensor(self)
Definition: test_exporters.py:63
sample::anonymous_namespace{sampleEngines.cpp}::RndInt8Calibrator::mErr
std::ostream & mErr
Definition: sampleEngines.cpp:188
nvinfer1::plugin::ResizeNearestPluginCreator::deserializePlugin
IPluginV2Ext * deserializePlugin(const char *name, const void *data, size_t length) override
Called during deserialization of plugin layer.
Definition: resizeNearestPlugin.cpp:77
BertInference::reportTiming
void reportTiming(int batchIndex, int batchSize)
Definition: bert_infer.h:305
helpers.tokenization.convert_tokens_to_ids
def convert_tokens_to_ids(vocab, tokens)
Definition: tokenization.py:143
nvinfer1::IAlgorithmSelector
Interface implemented by application for selecting and reporting algorithms of a layer provided by th...
Definition: NvInfer.h:6305
nmtSample::DeviceBuffer
Definition: deviceBuffer.h:27
nvinfer1::plugin::CropAndResizePlugin::configurePlugin
void configurePlugin(const Dims *inputDims, int nbInputs, const Dims *outputDims, int nbOutputs, const DataType *inputTypes, const DataType *outputTypes, const bool *inputIsBroadcast, const bool *outputIsBroadcast, PluginFormat floatFormat, int maxBatchSize) override
Definition: cropAndResizePlugin.cpp:219
onnx_graphsurgeon.importers.onnx_importer.OnnxImporter.import_node
Node import_node(onnx.NodeProto onnx_node, "OrderedDict[str, Tensor]" tensor_map, "OrderedDict[str, Tensor]" subgraph_tensor_map)
Definition: onnx_importer.py:89
nvinfer1::plugin::SplitPlugin::_x_stride
int _x_stride
Definition: split.h:43
nvinfer1::plugin::Normalize::getPluginNamespace
const char * getPluginNamespace() const override
Return the namespace of the plugin object.
Definition: normalizePlugin.cpp:171
pytorch_quantization.nn.modules._utils.QuantInputMixin.set_default_quant_desc_input
def set_default_quant_desc_input(cls, value)
Definition: _utils.py:103
nvinfer1::plugin::GroupNormalizationPlugin::mNamespace
std::string mNamespace
Definition: groupNormalizationPlugin.h:96
nvonnxparser::IOnnxConfig::setModelFileName
virtual void setModelFileName(const char *onnxFilename)=0
Set the Model File Name.
nvinfer1::plugin::MultilevelProposeROI::canBroadcastInputAcrossBatch
bool canBroadcastInputAcrossBatch(int inputIndex) const override
Definition: multilevelProposeROIPlugin.cpp:443
onnx_graphsurgeon.ir.node.Node.__setattr__
def __setattr__(self, name, value)
Definition: node.py:85
polygraphy.tools.precision.precision.STCheckerBase.mark_layers
def mark_layers(self, indices)
Definition: precision.py:102
tools.test_run.TestTf
Definition: test_run.py:144
Logger::reportTestResult
static void reportTestResult(const TestAtom &testAtom, TestResult result)
method that implements logging test results
Definition: pluginLogging.h:439
nvinfer1::plugin::GridAnchorRectPluginCreator
Definition: gridAnchorPlugin.h:134
tests.quant_conv_test.TestQuantConv1D.test_against_unquantized
def test_against_unquantized(self)
Definition: quant_conv_test.py:433
samplesCommon::Args::help
bool help
Definition: argsParser.h:83
nvinfer1::plugin::GenerateDetection::mBackgroundLabel
int mBackgroundLabel
Definition: generateDetectionPlugin.h:94
test_ir.TensorBaseTests.test_set_outputs_updates_old_outputs
def test_set_outputs_updates_old_outputs(self)
Definition: test_ir.py:61
pytorch_quantization.nn.modules.tensor_quantizer.TensorQuantizer._load_from_state_dict
def _load_from_state_dict(self, state_dict, prefix, *args, **kwargs)
Definition: tensor_quantizer.py:379
polygraphy.tools.args.tf.config.TfConfigArgs.allow_growth
allow_growth
Definition: tools/Polygraphy/polygraphy/tools/args/tf/config.py:31
half_float::half::operator=
half & operator=(detail::expr rhs)
Assignment operator.
Definition: samples/common/half.h:1431
SamplePlugin::build
bool build()
Builds the network engine.
Definition: samplePlugin.cpp:127
sample::BuildOptions::workspace
int workspace
Definition: sampleOptions.h:109
pytorch_quantization.nn._functions.quant_rnn.RNNReLUCell
def RNNReLUCell(input, hidden, w_ih, w_hh, b_ih=None, b_hh=None)
Definition: _functions/quant_rnn.py:36
polygraphy.common.func.make_iterable
def make_iterable(obj)
Definition: func.py:21
nvinfer1::plugin::GridAnchorGenerator::isOutputBroadcastAcrossBatch
bool isOutputBroadcastAcrossBatch(int outputIndex, const bool *inputIsBroadcasted, int nbInputs) const override
Definition: gridAnchorPlugin.cpp:309
samplesCommon::TypedHostMemory::destroy
void destroy() noexcept override
Destroy the allocated memory.
Definition: samples/common/common.h:343
perf_varseqlen.DeviceBuffer.binding
def binding(self)
Definition: perf_varseqlen.py:32
nvinfer1::plugin::Normalize::C
int C
Definition: normalizePlugin.h:101
SampleCharRNNBase::mWeightMap
std::map< std::string, nvinfer1::Weights > mWeightMap
Definition: sampleCharRNN.cpp:185
SampleMovieLensParams
The SampleMovieLensParams structure groups the additional parameters required by the MovieLens sample...
Definition: sampleMovieLens.cpp:62
SampleReformatFreeIO::build
bool build(int dataWidth)
Builds the network engine.
Definition: sampleReformatFreeIO.cpp:283
SampleINT8APIArgs::modelFileName
std::string modelFileName
Definition: sampleINT8API.cpp:651
pytorch_quantization.calib.calibrator._Calibrator._unsigned
_unsigned
Definition: tools/pytorch-quantization/pytorch_quantization/calib/calibrator.py:34
polygraphy.backend.trt.loader.ModifyNetwork.__init__
def __init__(self, network, outputs=None, exclude_outputs=None)
Definition: backend/trt/loader.py:184
SampleConfig::getVerbosityLevel
Verbosity getVerbosityLevel() const
Get the Verbosity Level.
Definition: sampleConfig.h:113
pytorch_quantization.nn._functions
Definition: tools/pytorch-quantization/pytorch_quantization/nn/_functions/__init__.py:1
nvinfer1::plugin::DetectionLayerPluginCreator::getPluginName
const char * getPluginName() const override
Return the plugin name.
Definition: detectionLayerPlugin.cpp:47
nvinfer1::WeightsRole::kSHIFT
@ kSHIFT
shift part of IScaleLayer
SampleINT8::mInputDims
nvinfer1::Dims mInputDims
The dimensions of the input to the network.
Definition: sampleINT8.cpp:96
test_ir.TestNodeIO.test_setitem
def test_setitem(self, field_names)
Definition: test_ir.py:309
profiler.refFeatures
def refFeatures(names)
Definition: profiler.py:89
nmtSample::LSTMDecoder::getStateSizes
std::vector< nvinfer1::Dims > getStateSizes() override
get the sizes (vector of them) of the hidden state vectors
Definition: lstmDecoder.cpp:133
pytorch_quantization.nn.modules.tensor_quantizer.TensorQuantizer._scale_amax
_scale_amax
Definition: tensor_quantizer.py:77
SampleINT8API::mOutputDims
nvinfer1::Dims mOutputDims
The dimensions of the output to the network.
Definition: sampleINT8API.cpp:112
nvinfer1::plugin::PriorBoxPluginCreator::getPluginName
const char * getPluginName() const override
Return the plugin name.
Definition: priorBoxPlugin.cpp:368
nvinfer1::IPluginV2DynamicExt
Definition: NvInferRuntime.h:417
replace
Definition: replace.py:1
nvinfer1::IPlugin::getWorkspaceSize
virtual size_t getWorkspaceSize(int32_t maxBatchSize) const =0
Find the workspace size required by the layer.
nvinfer1::MatrixOperation
MatrixOperation
Enumerates the operations that may be performed on a tensor by IMatrixMultiplyLayer before multiplica...
Definition: NvInfer.h:4132
SampleCharRNNBase::~SampleCharRNNBase
virtual ~SampleCharRNNBase()=default
nvinfer1::plugin::DetectionOutputParameters::nmsThreshold
float nmsThreshold
Definition: NvInferPluginUtils.h:185
pytorch_quantization.nn.modules.quant_pooling.QuantAvgPool1d.forward
def forward(self, input)
Definition: quant_pooling.py:85
nvinfer1::plugin::NMSPluginCreator::NMSPluginCreator
NMSPluginCreator()
Definition: nmsPlugin.cpp:260
PoolParameters::mQ
int mQ
Definition: sampleUffPluginV2Ext.cpp:177
nvinfer1::plugin::ProposalLayerPluginCreator::mKeepTopK
int mKeepTopK
Definition: proposalLayerPlugin.h:136
nvinfer1::EnumMax< NetworkDefinitionCreationFlag >
constexpr int32_t EnumMax< NetworkDefinitionCreationFlag >()
Maximum number of elements in NetworkDefinitionCreationFlag enum.
Definition: NvInfer.h:6916
onnx_graphsurgeon.ir.graph.Graph._get_used_node_ids
def _get_used_node_ids(self)
Definition: graph.py:182
nvinfer1::plugin::CublasError
Definition: checkMacrosPlugin.h:119
sample::Binding::dump
void dump(std::ostream &os, const std::string separator=" ") const
Definition: sampleUtils.h:268
nvinfer1::IInt8EntropyCalibrator
Entropy calibrator.
Definition: NvInfer.h:6063
nvinfer1::plugin::gLogger
ILogger * gLogger
Definition: checkMacrosPlugin.cpp:28
nvcaffeparser1::CaffeWeightFactory::getAllWeights
std::vector< nvinfer1::Weights > getAllWeights(const std::string &layerName)
Definition: caffeWeightFactory.cpp:131
nvcaffeparser1::CaffeWeightFactory::mRef
std::unique_ptr< trtcaffe::NetParameter > mRef
Definition: caffeWeightFactory.h:59
nvinfer1::plugin::BatchedNMSPlugin::getSerializationSize
size_t getSerializationSize() const override
Find the size of the serialization buffer required.
Definition: batchedNMSPlugin.cpp:248
nvinfer1::IPluginV2DynamicExt::clone
IPluginV2DynamicExt * clone() const =0
Clone the plugin object.
samplesCommon::BufferManager::size
size_t size(const std::string &tensorName) const
Returns the size of the host and device buffers that correspond to tensorName.
Definition: buffers.h:313
xy_t
Definition: maskRCNNKernels.h:255
half_float::detail::unary_specialized
Wrapper for unary half-precision functions needing specialization for individual argument types.
Definition: ieee_half.h:1050
nvinfer1::plugin::MultilevelCropAndResize::serialize
void serialize(void *buffer) const override
Serialize the layer.
Definition: multilevelCropAndResizePlugin.cpp:217
model.Decoder.gate_threshold
gate_threshold
Definition: demo/Tacotron2/tacotron2/model.py:249
nvcaffeparser1::CaffeParser
Definition: caffeParser.h:32
nvinfer1::plugin
Definition: NvInferPluginUtils.h:58
tests.utils.copy_state_and_quantize_fused
def copy_state_and_quantize_fused(dst, src, num_bits)
Definition: tools/pytorch-quantization/tests/utils.py:52
test_util.test_combine_dicts_second_overwrites_first
def test_combine_dicts_second_overwrites_first()
Definition: onnx-graphsurgeon/tests/test_util.py:20
inference_varseqlen.parse_args
def parse_args()
Definition: inference_varseqlen.py:39
samplesCommon::GenericBuffer::mSize
size_t mSize
Definition: buffers.h:172
nvinfer1::IPluginV2IOExt::DataType
const Dims DataType
Definition: NvInferRuntimeCommon.h:792
test_ir.TensorBaseTests.test_equals
def test_equals(self)
Definition: test_ir.py:52
UffPoolPluginV2::setPluginNamespace
void setPluginNamespace(const char *libNamespace) override
Set the namespace that this plugin object belongs to.
Definition: sampleUffPluginV2Ext.cpp:558
polygraphy.util.misc.FreeOnException.objs
objs
Definition: Polygraphy/polygraphy/util/misc.py:442
bert::threadsPerCta384
constexpr size_t threadsPerCta384
Definition: embLayerNormVarSeqlenPlugin.cpp:34
stft.STFT.forward
def forward(self, input_data)
Definition: stft.py:139
onnx_graphsurgeon.ir.node.Node.i
def i(self, tensor_idx=0, producer_idx=0)
Definition: node.py:44
pytorch_quantization.nn.modules.tensor_quantizer.TensorQuantizer.enable_quant
def enable_quant(self)
Definition: tensor_quantizer.py:186
builder_varseqlen.BertConfig.use_int8
use_int8
Definition: builder_varseqlen.py:99
half_float::detail::round
half round(half arg)
Nearest integer.
Definition: ieee_half.h:2572
polygraphy.tools.args.comparator.ComparatorCompareArgs.add_to_script
def add_to_script(self, script, results_name)
Definition: tools/args/comparator.py:115
nvuffparser::IPluginFactory
Plugin factory used to configure plugins.
Definition: NvUffParser.h:98
perf
Definition: perf.py:1
nmtSample::SequenceProperties::getStartSequenceId
virtual int getStartSequenceId()=0
polygraphy.tools.args.trt.loader.TrtLoaderArgs.workspace
workspace
Definition: tools/args/trt/loader.py:97
SampleUffFasterRcnnParams::inputWidth
int inputWidth
Definition: sampleUffFasterRCNN.cpp:70
sample::ReportingOptions::exportOutput
std::string exportOutput
Definition: sampleOptions.h:178
SampleDynamicReshape::mPredictionContext
SampleUniquePtr< nvinfer1::IExecutionContext > mPredictionContext
Definition: sampleDynamicReshape.cpp:86
nvinfer1::TensorFormat::kNCHW
@ kNCHW
Deprecated name of kLINEAR, provided for backwards compatibility and will be removed in TensorRT 8....
pytorch_quantization.nn.modules.quant_rnn.QuantRNNBase.all_weights
def all_weights(self)
Definition: modules/quant_rnn.py:265
test_ir.TestNodeIO.test_clear
def test_clear(self, field_names)
Definition: test_ir.py:286
pytorch_quantization.nn.modules.quant_rnn.QuantRNNBase.dropout_state
dropout_state
Definition: modules/quant_rnn.py:52
nvinfer1::ITensor::setDimensions
virtual void setDimensions(Dims dimensions)=0
Set the dimensions of a tensor.
bert::EmbLayerNormVarSeqlenPlugin::mTokEmb
bert::WeightsWithOwnership mTokEmb
Definition: embLayerNormVarSeqlenPlugin.h:100
bert::EmbLayerNormVarSeqlenPlugin::mTokEmbDev
bert::cuda_unique_ptr< void > mTokEmbDev
Definition: embLayerNormVarSeqlenPlugin.h:91
parserutils::volume
int64_t volume(const nvinfer1::Dims &d)
Definition: parserUtils.h:47
sample::anonymous_namespace{sampleReporting.cpp}::findMedian
float findMedian(const std::vector< InferenceTime > &timings, const T &toFloat)
Find median in a sorted sequence of timings.
Definition: sampleReporting.cpp:55
nvinfer1::PluginFieldType::kFLOAT16
@ kFLOAT16
FP16 field type.
nvinfer1::plugin::LReluPluginCreator::getFieldNames
const PluginFieldCollection * getFieldNames() override
Return a list of fields that needs to be passed to createPlugin.
Definition: lReluPlugin.cpp:150
nvinfer1::PoolingType
PoolingType
The type of pooling to perform in a pooling layer.
Definition: NvInfer.h:1706
nvinfer1::plugin::Reorg::mPluginNamespace
std::string mPluginNamespace
Definition: reorgPlugin.h:89
polygraphy.backend.tf.loader.ModifyGraph.__call__
def __call__(self)
Definition: backend/tf/loader.py:290
sample::anonymous_namespace{sampleOptions.cpp}::insertShapesInference
void insertShapesInference(std::unordered_map< std::string, std::vector< int >> &shapes, const std::string &name, const std::vector< int > &dims)
Definition: sampleOptions.cpp:223
polygraphy.tools.args.model.ModelArgs.model_type
model_type
Definition: tools/Polygraphy/polygraphy/tools/args/model.py:94
nvinfer1::plugin::ProposalPlugin::getPluginType
const char * getPluginType() const override
Return the plugin type.
Definition: proposalPlugin.cpp:150
nvinfer1::plugin::CropAndResizePluginCreator::mFC
static PluginFieldCollection mFC
Definition: cropAndResizePlugin.h:123
polygraphy.tools.args.trt_legacy
Definition: tools/args/trt_legacy.py:1
nvinfer1::plugin::GroupNormalizationPluginCreator::deserializePlugin
IPluginV2DynamicExt * deserializePlugin(const char *name, const void *serialData, size_t serialLength) override
Called during deserialization of plugin layer.
Definition: groupNormalizationPlugin.cpp:309
polygraphy.tools.precision.precision.STLinear
Definition: precision.py:229
nvcaffeparser1::IBinaryProtoBlob::getDataType
virtual nvinfer1::DataType getDataType()=0
tests.quant_conv_test.TestQuantConv2D.test_fake_quant_per_channel_bias
def test_fake_quant_per_channel_bias(self)
Definition: quant_conv_test.py:191
nvcaffeparser1::parseAbsVal
nvinfer1::ILayer * parseAbsVal(nvinfer1::INetworkDefinition &network, const trtcaffe::LayerParameter &msg, CaffeWeightFactory &, BlobNameToTensor &tensors)
Definition: parseAbsVal.cpp:23
bert::FusedMultiHeadAttentionKernelMetaInfoV2::mCubin
const unsigned char * mCubin
Definition: fused_multihead_attention_v2.h:164
polygraphy.backend.tf.loader.GraphFromKeras.path
path
Definition: backend/tf/loader.py:119
polygraphy.backend.tf.loader.GraphFromKeras
Definition: backend/tf/loader.py:111
builder_varseqlen.set_output_range
def set_output_range(layer, maxval, out_idx=0)
Definition: builder_varseqlen.py:117
SampleMovieLens
The SampleMovieLens class implements the MovieLens sample.
Definition: sampleMovieLens.cpp:83
polygraphy.tools.precision.precision.STWorstFirst.add_parser_args
def add_parser_args(self, parser)
Definition: precision.py:175
polygraphy.tools.surgeon.subtool.base.Config
Definition: surgeon/subtool/base.py:60
nvinfer1::plugin::Reorg::setPluginNamespace
void setPluginNamespace(const char *pluginNamespace) override
Set the namespace that this plugin object belongs to.
Definition: reorgPlugin.cpp:121
pytorch_quantization.nn.modules.quant_instancenorm.QuantInstanceNorm2d.forward
def forward(self, input)
Definition: quant_instancenorm.py:60
bert::xmmasM256
constexpr size_t xmmasM256
Definition: embLayerNormVarSeqlenPlugin.cpp:38
nvcaffeparser1::parseBNLL
nvinfer1::ILayer * parseBNLL(nvinfer1::INetworkDefinition &network, const trtcaffe::LayerParameter &msg, CaffeWeightFactory &, BlobNameToTensor &tensors)
Definition: parseBNLL.cpp:23
tools.test_run.TestGen.test_polygraphy_run_gen_script
def test_polygraphy_run_gen_script(self)
Definition: test_run.py:35
nvinfer1::plugin::reduced_divisor::shift_coeff
unsigned int shift_coeff
Definition: reducedMathPlugin.h:97
nvinfer1::plugin::RPROIPluginCreator::params
RPROIParams params
Definition: nvFasterRCNNPlugin.h:122
polygraphy.util.format.FormatManager
Definition: format.py:32
SampleReformatFreeIO::teardown
bool teardown()
Used to clean up any state created in the sample class.
Definition: sampleReformatFreeIO.cpp:432
onnx_graphsurgeon.logger.logger.Logger.severity
def severity(self)
Definition: onnx-graphsurgeon/onnx_graphsurgeon/logger/logger.py:112
profiler.mergeRow
def mergeRow(reference, profile, diff)
Definition: profiler.py:121
SampleDynamicReshape::loadPGMFile
Dims loadPGMFile(const std::string &fileName)
Loads a PGM file into mInput and returns the dimensions of the loaded image.
Definition: sampleDynamicReshape.cpp:371
nvonnxparser::IParser::parse
virtual bool parse(void const *serialized_onnx_model, size_t serialized_onnx_model_size)=0
Parse a serialized ONNX model into the TensorRT network.
test_ir.TestConstant.test_can_get_shape
def test_can_get_shape(self)
Definition: test_ir.py:127
SampleSSD::infer
bool infer()
Runs the TensorRT inference engine for this sample.
Definition: sampleSSD.cpp:222
test_struct.TestTensorMetadata.test_str_no_meta
def test_str_no_meta(self)
Definition: common/test_struct.py:36
test_ir.TestGraph.test_layer_with_attrs
def test_layer_with_attrs(self)
Definition: test_ir.py:495
onnx_graphsurgeon.exporters.onnx_exporter.OnnxExporter.export_graph
onnx.GraphProto export_graph(Graph graph, do_type_check=True)
Definition: onnx_exporter.py:68
nvinfer1::LayerType::kSLICE
@ kSLICE
Slice layer.
nvinfer1::plugin::SplitPluginCreator
Definition: split.h:135
pytorch_quantization.nn.modules.tensor_quantizer.TensorQuantizer._scale
_scale
Definition: tensor_quantizer.py:82
nvinfer1::plugin::SpecialSlice::serialize
void serialize(void *buffer) const override
Serialize the layer.
Definition: specialSlicePlugin.cpp:109
nvinfer1::ElementWiseOperation::kSUM
@ kSUM
Sum of the two elements.
SampleOnnxMnistCoordConvAC::constructNetwork
bool constructNetwork(SampleUniquePtr< nvinfer1::IBuilder > &builder, SampleUniquePtr< nvinfer1::INetworkDefinition > &network, SampleUniquePtr< nvinfer1::IBuilderConfig > &config, SampleUniquePtr< nvonnxparser::IParser > &parser)
Parses an ONNX model for MNIST and creates a TensorRT network.
Definition: sampleOnnxMnistCoordConvAC.cpp:171
nvinfer1::LoopOutput::kCONCATENATE
@ kCONCATENATE
Output value is concatenation of values of tensor for each iteration, in forward order.
IBatchStream::getBatchesRead
virtual int getBatchesRead() const =0
UffPoolPluginV2::mPoolingDesc
cudnnPoolingDescriptor_t mPoolingDesc
Definition: sampleUffPluginV2Ext.cpp:633
polygraphy.tools.precision.precision.STCheckerBase
SUBTOOLS #################################.
Definition: precision.py:31
nvinfer1::plugin::BatchedNMSDynamicPlugin::destroy
void destroy() override
Destroy the plugin object.
Definition: batchedNMSPlugin.cpp:380
helpers.tokenization.WordpieceTokenizer.vocab
vocab
Definition: tokenization.py:334
polygraphy.tools.args.comparator.ComparatorRunArgs.save_results
save_results
Definition: tools/args/comparator.py:46
nvinfer1::plugin::MultilevelProposeROIPluginCreator::MultilevelProposeROIPluginCreator
MultilevelProposeROIPluginCreator()
Definition: multilevelProposeROIPlugin.cpp:41
nvinfer1::plugin::GridAnchorGenerator::mPluginName
std::string mPluginName
Definition: gridAnchorPlugin.h:86
nvinfer1::DimsHW
Descriptor for two-dimensional spatial data.
Definition: NvInfer.h:95
anonymous_namespace{detectionLayerPlugin.cpp}::DETECTIONLAYER_PLUGIN_VERSION
const char * DETECTIONLAYER_PLUGIN_VERSION
Definition: detectionLayerPlugin.cpp:28
Exception
nvinfer1::IIdentityLayer
A layer that represents the identity function.
Definition: NvInfer.h:4257
polygraphy.tools.args.model.ModelArgs._model_required
_model_required
Definition: tools/Polygraphy/polygraphy/tools/args/model.py:27
nvinfer1::plugin::ProposalPlugin::mPreNmsTopN
size_t mPreNmsTopN
Definition: proposalPlugin.h:117
nvonnxparser::IParser::getError
virtual IParserError const * getError(int index) const =0
Get an error that occurred during prior calls to parse.
nvinfer1::plugin::InstanceNormalizationPlugin::destroy
void destroy() override
Destroy the plugin object.
Definition: instanceNormalizationPlugin.cpp:249
sample::LayerProfile
Layer profile information.
Definition: sampleReporting.h:158
nvinfer1::plugin::PyramidROIAlign::mROICount
int mROICount
Definition: pyramidROIAlignPlugin.h:97
polygraphy.tools.args.tf.loader.TfLoaderArgs.save_pb
save_pb
Definition: tools/args/tf/loader.py:75
polygraphy.tools.args.data_loader.DataLoaderArgs.register
def register(self, maker)
Definition: tools/args/data_loader.py:42
polygraphy.tools.args.model.ModelArgs
Definition: tools/Polygraphy/polygraphy/tools/args/model.py:25
nvinfer1::plugin::SpecialSlice::canBroadcastInputAcrossBatch
bool canBroadcastInputAcrossBatch(int inputIndex) const override
Definition: specialSlicePlugin.cpp:188
SharedMemory::~SharedMemory
~SharedMemory()
Definition: sampleMovieLensMPS.cpp:126
samplesCommon::Args::runInInt8
bool runInInt8
Definition: argsParser.h:81
nvinfer1::plugin::RPROIPlugin::C
int C
Definition: nvFasterRCNNPlugin.h:99
nvcaffeparser1::BinaryProtoBlob::getDataType
nvinfer1::DataType getDataType() override
Definition: binaryProtoBlob.h:41
nvcaffeparser1::parsePooling
nvinfer1::ILayer * parsePooling(nvinfer1::INetworkDefinition &network, const trtcaffe::LayerParameter &msg, CaffeWeightFactory &, BlobNameToTensor &tensors)
Definition: parsePooling.cpp:23
nvinfer1::plugin::GenerateDetection::getOutputDimensions
Dims getOutputDimensions(int index, const Dims *inputs, int nbInputDims) override
Definition: generateDetectionPlugin.cpp:255
bert::fused_multihead_attention_v2_int8_384_64_kernel_sm75_cubin
unsigned char fused_multihead_attention_v2_int8_384_64_kernel_sm75_cubin[]
Definition: fused_multihead_attention_v2_int8_384_64_kernel.sm75.cpp:19
nvinfer1::plugin::CropAndResizePlugin::canBroadcastInputAcrossBatch
bool canBroadcastInputAcrossBatch(int inputIndex) const override
Definition: cropAndResizePlugin.cpp:214
polygraphy.comparator.comparator.Comparator.postprocess
def postprocess(run_results, postprocess_func)
Definition: comparator/comparator.py:188
layers.ConvNorm.forward
def forward(self, signal)
Definition: layers.py:53
polygraphy.backend.trt.util.check_onnx_parser_errors
def check_onnx_parser_errors(parser)
Definition: trt/util.py:34
BatchStream::getFileLabels
float * getFileLabels()
Definition: BatchStream.h:290
nvinfer1::TacticSources
uint32_t TacticSources
Represents a collection of one or more TacticSource values combine using bitwise-OR operations.
Definition: NvInfer.h:6457
layers.ConvNorm.conv
conv
Definition: layers.py:43
bert::fused_multihead_attention_v2_int8_128_64_kernel_sm75_cubin_len
unsigned int fused_multihead_attention_v2_int8_128_64_kernel_sm75_cubin_len
Definition: fused_multihead_attention_v2_int8_128_64_kernel.sm75.cpp:15845
samplesCommon::isDebug
bool isDebug()
Definition: samples/common/common.h:370
nvinfer1::plugin::DetectionOutputParameters
The DetectionOutput plugin layer generates the detection output based on location and confidence pred...
Definition: NvInferPluginUtils.h:181
sample::defaultBatch
constexpr int defaultBatch
Definition: sampleOptions.h:44
backend.trt.test_runner.TestLoggerCallbacks.test_set_severity
def test_set_severity(self, sev)
Definition: trt/test_runner.py:31
nmtSample::DebugUtil::DumpTensorPlugin::mOut
std::shared_ptr< std::ostream > mOut
Definition: debugUtil.h:68
ParserOnnxConfig::getTextFileName
virtual const char * getTextFileName() const
Returns the File Name of the Network Description as a Text File.
Definition: parserOnnxConfig.h:112
nvinfer1::IPluginV2::enqueue
virtual int32_t enqueue(int32_t batchSize, const void *const *inputs, void **outputs, void *workspace, cudaStream_t stream)=0
Execute the layer.
nvinfer1::plugin::Normalize::destroy
void destroy() override
Destroy the plugin object.
Definition: normalizePlugin.cpp:239
pytorch_quantization.nn.modules.quant_pooling.QuantAdaptiveAvgPool3d
Definition: quant_pooling.py:141
nvinfer1::CalibrationAlgoType::kENTROPY_CALIBRATION_2
@ kENTROPY_CALIBRATION_2
nvinfer1::plugin::ProposalLayer::getWorkspaceSize
size_t getWorkspaceSize(int maxBatchSize) const override
Definition: proposalLayerPlugin.cpp:248
nmtSample::PinnedHostBuffer::ptr
std::shared_ptr< PinnedHostBuffer< T > > ptr
Definition: pinnedHostBuffer.h:35
nvinfer1::DataType
DataType
The type of weights and tensors.
Definition: NvInferRuntimeCommon.h:136
nvinfer1::IRefitter::~IRefitter
virtual ~IRefitter()
Definition: NvInferRuntime.h:885
builder_varseqlen.BertConfig
Definition: builder_varseqlen.py:89
nvinfer1::PluginFieldCollection
Definition: NvInferRuntimeCommon.h:876
SampleUffFasterRcnnParams::uffFileName
std::string uffFileName
The file name of the UFF model to use.
Definition: sampleUffFasterRCNN.cpp:61
bert::EmbLayerNormVarSeqlenPluginCreator::EmbLayerNormVarSeqlenPluginCreator
EmbLayerNormVarSeqlenPluginCreator()
Definition: embLayerNormVarSeqlenPlugin.cpp:419
nvinfer1::IPluginV2
Plugin class for user-implemented layers.
Definition: NvInferRuntimeCommon.h:400
SampleUffFasterRcnnParams::loadEngine
std::string loadEngine
Definition: sampleUffFasterRCNN.cpp:86
nvinfer1::plugin::PyramidROIAlign::clone
IPluginV2Ext * clone() const override
Clone the plugin object.
Definition: pyramidROIAlignPlugin.cpp:127
nvinfer1::IRefitter::getErrorRecorder
virtual IErrorRecorder * getErrorRecorder() const =0
get the ErrorRecorder assigned to this interface.
sample::anonymous_namespace{sampleInference.cpp}::Iteration::createEnqueueFunction
void createEnqueueFunction(const InferenceOptions &inference, nvinfer1::IExecutionContext &context, Bindings &bindings)
Definition: sampleInference.cpp:434
nvinfer1::ActivationType::kELU
@ kELU
Elu activation: x>=0 ? x : alpha * (exp(x) - 1).
model.WaveGlow.n_early_every
n_early_every
Definition: demo/Tacotron2/waveglow/model.py:167
polygraphy.backend.trt.loader.LoadPlugins
Definition: backend/trt/loader.py:27
SampleAlgorithmSelector::constructNetwork
bool constructNetwork(SampleUniquePtr< nvcaffeparser1::ICaffeParser > &parser, SampleUniquePtr< nvinfer1::INetworkDefinition > &network)
uses a Caffe parser to create the MNIST Network and marks the output layers.
Definition: sampleAlgorithmSelector.cpp:510
onnx_graphsurgeon.util.misc.SynchronizedList._add_to_elem
def _add_to_elem(self, elem)
Definition: onnx-graphsurgeon/onnx_graphsurgeon/util/misc.py:76
Logger::getReportableSeverity
Severity getReportableSeverity() const
Definition: pluginLogging.h:391
polygraphy.backend.trt_legacy.TrtLegacyRunner.HostDeviceMem.__str__
def __str__(self)
Definition: backend/trt_legacy.py:184
pytorch_quantization.nn.modules.quant_rnn.QuantRNNCell
Definition: modules/quant_rnn.py:336
LogStreamConsumer::mShouldLog
bool mShouldLog
Definition: pluginLogging.h:180
builder_varseqlen.BertConfig.interleaved
interleaved
Definition: builder_varseqlen.py:101
nvinfer1::plugin::GridAnchorBasePluginCreator::createPlugin
IPluginV2Ext * createPlugin(const char *name, const PluginFieldCollection *fc) override
Return a plugin object.
Definition: gridAnchorPlugin.cpp:379
SampleINT8APIParams::imageFileName
std::string imageFileName
Definition: sampleINT8API.cpp:65
SampleCharRNNBase::infer
bool infer()
Runs the TensorRT inference engine for this sample.
Definition: sampleCharRNN.cpp:809
polygraphy.backend.tf.loader.CreateConfig.__init__
def __init__(self, gpu_memory_fraction=None, allow_growth=None, use_xla=None)
Definition: backend/tf/loader.py:358
data_function.TextMelCollate
Definition: tacotron2/data_function.py:83
bert::Fused_multihead_attention_params_v2::d
int d
Definition: fused_multihead_attention_v2.h:56
BatchStream::getBatchSize
int getBatchSize() const override
Definition: BatchStream.h:274
nvinfer1::plugin::SplitPlugin::_axis
int _axis
Definition: split.h:40
tests.quant_instancenorm_test.TestQuantInstanceNorm1D.test_no_quant
def test_no_quant(self)
Definition: quant_instancenorm_test.py:48
sample::LogStreamConsumerBase::mBuffer
LogStreamConsumerBuffer mBuffer
Definition: samples/common/logging.h:120
tools.test_surgeon.TestSurgeonExtract.test_sanity_dim_param
def test_sanity_dim_param(self)
Definition: test_surgeon.py:39
nvinfer1::plugin::DetectionOutputParameters::confSigmoid
bool confSigmoid
Definition: NvInferPluginUtils.h:188
SampleINT8API::readPerTensorDynamicRangeValues
bool readPerTensorDynamicRangeValues()
Populate per tensor dynamic range values.
Definition: sampleINT8API.cpp:184
bert::SkipLayerNormInterleavedPlugin::initialize
int initialize() override
Initialize the layer for execution.
Definition: skipLayerNormInt8InterleavedPlugin.cpp:223
nvinfer1::IAlgorithmIOInfo
Carries information about input or output of the algorithm. IAlgorithmIOInfo for all the input and ou...
Definition: NvInfer.h:6169
nvinfer1::ITensor::isNetworkOutput
virtual bool isNetworkOutput() const =0
Whether the tensor is a network output.
tools.test_run.TestOnnxTf
Definition: test_run.py:176
model.WN.__init__
def __init__(self, n_in_channels, n_mel_channels, n_layers, n_channels, kernel_size)
Definition: demo/Tacotron2/waveglow/model.py:89
polygraphy.tools.precision.precision
Definition: precision.py:1
nvinfer1::plugin::BatchedNMSPlugin::mNamespace
std::string mNamespace
Definition: batchedNMSPlugin.h:71
nvcaffeparser1::IBinaryProtoBlob::destroy
virtual void destroy()=0
SampleFasterRCNNParams::nmsMaxOut
int nmsMaxOut
The maximum number of detection post-NMS.
Definition: sampleFasterRCNN.cpp:48
sample::Profiler::mUpdatesCount
int mUpdatesCount
Definition: sampleReporting.h:190
nvinfer1::LayerType::kSOFTMAX
@ kSOFTMAX
SoftMax layer.
inference.parse_args
def parse_args()
Definition: BERT/inference.py:39
nvinfer1::plugin::ResizeNearest::detachFromContext
void detachFromContext() override
Detach the plugin object from its execution context.
Definition: resizeNearestPlugin.cpp:258
UffPoolPluginV2::initialize
int initialize() override
Initialize the layer for execution.
Definition: sampleUffPluginV2Ext.cpp:400
sample::Binding::isInput
bool isInput
Definition: sampleUtils.h:217
nvuffparser::FieldType::kDIMS
@ kDIMS
nvinfer1::Dims field type.
train.reduce_tensor
def reduce_tensor(tensor, num_gpus)
Definition: train.py:151
nvinfer1::plugin::Normalize::enqueue
int enqueue(int batchSize, const void *const *inputs, void **outputs, void *workspace, cudaStream_t stream) override
Definition: normalizePlugin.cpp:107
test_infer_trt.print_stats
def print_stats(measurements_all)
Definition: test_infer_trt.py:100
bert::TFusedMultiHeadAttentionXMMAKernel::mDataType
Data_type mDataType
Definition: fused_multihead_attention.h:277
MNISTBatchStream::readLabelsFile
void readLabelsFile(const std::string &labelsFilePath)
Definition: BatchStream.h:127
nvinfer1::plugin::MultilevelProposeROI::generate_pyramid_anchors
void generate_pyramid_anchors(const nvinfer1::Dims &image_size)
Definition: multilevelProposeROIPlugin.cpp:345
ILogger
bert::FusedMultiHeadAttentionKernelMetaInfoV2::mD
unsigned int mD
Definition: fused_multihead_attention_v2.h:162
export_waveglow_onnx.convert_WN_1d_to_2d_
def convert_WN_1d_to_2d_(WN)
Definition: export_waveglow_onnx.py:66
bert::SkipLayerNormInterleavedPlugin::clone
nvinfer1::IPluginV2DynamicExt * clone() const override
Clone the plugin object.
Definition: skipLayerNormInt8InterleavedPlugin.cpp:100
sample::InferenceTrace::computeEnd
float computeEnd
Definition: sampleReporting.h:97
nvinfer1::IDimensionExpr::~IDimensionExpr
virtual ~IDimensionExpr()
Definition: NvInferRuntime.h:336
SampleAlgorithmSelector::mParams
samplesCommon::CaffeSampleParams mParams
The parameters for the sample.
Definition: sampleAlgorithmSelector.cpp:359
ParserOnnxConfig::ParserOnnxConfig
ParserOnnxConfig()
Definition: parserOnnxConfig.h:52
helpers.tokenization.BasicTokenizer.do_lower_case
do_lower_case
Definition: tokenization.py:224
pytorch_quantization.nn.modules.quant_rnn.QuantRNNCellBase.check_forward_input
def check_forward_input(self, input)
Definition: modules/quant_rnn.py:318
tests.calibrator_test.TestMSECalibrator.test_unsigned_one_tensor
def test_unsigned_one_tensor(self, verbose)
Definition: calibrator_test.py:213
nvinfer1::ILRNLayer
A LRN layer in a network definition.
Definition: NvInfer.h:1993
nvinfer1::plugin::BasePlugin::setPluginNamespace
void setPluginNamespace(const char *libNamespace) override
Set the namespace that this plugin object belongs to.
Definition: plugin.h:47
nmtSample::DebugUtil::DumpTensorPlugin::getWorkspaceSize
size_t getWorkspaceSize(int maxBatchSize) const override
Definition: debugUtil.cpp:81
MaskRCNNUtils::resizePPM
void resizePPM(const PPM< T > &src, PPM< T > &dst, int target_height, int target_width, int channel)
Definition: sampleUffMaskRCNN.cpp:105
tests.quant_conv_test.TestQuantConv2D.test_unused_kwargs
def test_unused_kwargs(self)
Definition: quant_conv_test.py:263
export_tacotron2_onnx.Encoder
Definition: export_tacotron2_onnx.py:63
evaluate-v1.evaluate
def evaluate(dataset, predictions, f1_acc)
Definition: evaluate-v1.1.py:71
pytorch_quantization.nn
Definition: tools/pytorch-quantization/pytorch_quantization/nn/__init__.py:1
SampleCharRNNBase::copyEmbeddingToInput
void copyEmbeddingToInput(samplesCommon::BufferManager &buffers, const char &c)
Looks up the embedding tensor for a given char and copies it to input buffer.
Definition: sampleCharRNN.cpp:895
MaskRCNNUtils::RawDetection::x1
float x1
Definition: sampleUffMaskRCNN.cpp:54
nmtSample::BLEUScoreWriter::mSmooth
bool mSmooth
Definition: bleuScoreWriter.h:57
backend.trt.test_loader.TestModifyNetwork.test_exclude_outputs_with_mark_layerwise
def test_exclude_outputs_with_mark_layerwise(self, identity_identity_network)
Definition: trt/test_loader.py:137
classification_flow.build_sensitivity_profile
def build_sensitivity_profile(model, criterion, data_loader_test)
Definition: classification_flow.py:421
nvinfer1::plugin::ProposalPlugin::ProposalPlugin
ProposalPlugin()=delete
UffPoolPluginV2::terminate
void terminate() override
Release resources acquired during plugin layer initialization.
Definition: sampleUffPluginV2Ext.cpp:411
bert::TFusedMHAKernelFactory::Get
static TFusedMHAKernelFactory< TFusedMHAKernelList > & Get()
Definition: fused_multihead_attention.h:313
pytorch_quantization.tensor_quant.FakeAffineTensorQuantFunction
Definition: tensor_quant.py:366
half_float::detail::functions::atan
static expr atan(float arg)
Arc tangent implementation.
Definition: samples/common/half.h:1926
sample::anonymous_namespace{sampleInference.cpp}::EnqueueExplicit::EnqueueExplicit
EnqueueExplicit(nvinfer1::IExecutionContext &context, void **buffers)
Definition: sampleInference.cpp:218
nvinfer1::plugin::SpecialSlicePluginCreator::getFieldNames
const PluginFieldCollection * getFieldNames() override
Return a list of fields that needs to be passed to createPlugin.
Definition: specialSlicePlugin.cpp:52
sample::Bindings::fill
void fill(int binding)
Definition: sampleUtils.h:366
nvinfer1::plugin::BatchTilePlugin::BatchTilePlugin
BatchTilePlugin()=delete
SampleINT8::processInput
bool processInput(const samplesCommon::BufferManager &buffers, const float *data)
Reads the input and stores it in a managed buffer.
Definition: sampleINT8.cpp:369
pytorch_quantization.nn.modules.quant_rnn.QuantRNNCellBase.check_forward_hidden
def check_forward_hidden(self, input, hx, hidden_label='')
Definition: modules/quant_rnn.py:324
half_float::detail::fmin
half fmin(half x, half y)
Minimum of half expressions.
Definition: ieee_half.h:2299
bert::get_size_in_bytes
static size_t get_size_in_bytes(size_t n, Data_type dtype)
Definition: fused_multihead_attention.h:30
nvinfer1::plugin::CoordConvACPlugin::oC
int oC
Definition: coordConvACPlugin.h:87
SampleINT8API::mPerTensorDynamicRangeMap
std::unordered_map< std::string, float > mPerTensorDynamicRangeMap
Mapping from tensor name to max absolute dynamic range values.
Definition: sampleINT8API.cpp:115
nvinfer1::ITripLimitLayer
Definition: NvInfer.h:4616
SampleMaskRCNN::mParams
SampleMaskRCNNParams mParams
Definition: sampleUffMaskRCNN.cpp:330
polygraphy.backend.cntk.runner.CNTKRunner.cntk_model
cntk_model
Definition: backend/cntk/runner.py:32
pytorch_quantization.tensor_quant.ScaledQuantDescriptor.to_yaml
def to_yaml(self)
Definition: tensor_quant.py:200
nvinfer1::plugin::PyramidROIAlign::destroy
void destroy() override
Destroy the plugin object.
Definition: pyramidROIAlignPlugin.cpp:102
sample::Logger::TestAtom::TestAtom
TestAtom(bool started, const std::string &name, const std::string &cmdline)
Definition: samples/common/logging.h:274
bert::FusedMultiHeadAttentionXMMAKernelV2::hashID
uint64_t hashID(unsigned int s, bool interleaved, bool unroll) const
Definition: fused_multihead_attention_v2.h:433
nvinfer1::RNNOperation
RNNOperation
Enumerates the RNN operations that may be performed by an RNN layer.
Definition: NvInfer.h:2791
samplesCommon::BBox::y1
float y1
Definition: samples/common/common.h:669
polygraphy.backend.base.runner.BaseRunner.__enter__
def __enter__(self)
Definition: backend/base/runner.py:66
nvinfer1::plugin::PriorBox
Definition: priorBoxPlugin.h:32
SampleSSD::SampleUniquePtr
std::unique_ptr< T, samplesCommon::InferDeleter > SampleUniquePtr
Definition: sampleSSD.cpp:63
nvcaffeparser1::CaffeParser::setProtobufBufferSize
void setProtobufBufferSize(size_t size) override
Set buffer size for the parsing and storage of the learned model.
Definition: caffeParser.h:47
polygraphy.tools.args.data_loader.DataLoaderArgs.get_data_loader
def get_data_loader(self)
Definition: tools/args/data_loader.py:96
samplesCommon::TimerBase::stop
virtual void stop()
Definition: samples/common/common.h:785
nvinfer1::plugin::InstanceNormalizationPlugin::_y_desc
cudnnTensorDescriptor_t _y_desc
Definition: instanceNormalizationPlugin.h:98
SampleSSD::processInput
bool processInput(const samplesCommon::BufferManager &buffers)
Reads the input and mean data, preprocesses, and stores the result in a managed buffer.
Definition: sampleSSD.cpp:276
SampleINT8Params::calBatchSize
int calBatchSize
The calibration batch size.
Definition: sampleINT8.cpp:50
bert::EmbLayerNormVarSeqlenPlugin::getOutputDimensions
nvinfer1::DimsExprs getOutputDimensions(int outputIndex, const nvinfer1::DimsExprs *inputs, int nbInputs, nvinfer1::IExprBuilder &exprBuilder) override
Definition: embLayerNormVarSeqlenPlugin.cpp:131
nvinfer1::plugin::DetectionOutput::supportsFormat
bool supportsFormat(DataType type, PluginFormat format) const override
Check format support.
Definition: nmsPlugin.cpp:142
nvinfer1::ActivationType::kTANH
@ kTANH
TanH activation.
samplesCommon::SampleParams::batchSize
int32_t batchSize
Number of inputs in a batch.
Definition: argsParser.h:38
onnx_graphsurgeon.ir.node
Definition: node.py:1
anonymous_namespace{proposalLayerPlugin.cpp}::PROPOSALLAYER_PLUGIN_VERSION
const char * PROPOSALLAYER_PLUGIN_VERSION
Definition: proposalLayerPlugin.cpp:32
polygraphy.tools.args.comparator.ComparatorCompareArgs.runners
runners
Definition: tools/args/comparator.py:112
tests.quant_conv_test.TestQuantConv1D
Definition: quant_conv_test.py:267
nvcaffeparser1::BlobNameToTensor::find
nvinfer1::ITensor * find(const char *name) const override
Given a blob name, returns a pointer to a ITensor object.
Definition: blobNameToTensor.h:36
sample::anonymous_namespace{sampleInference.cpp}::EnqueueTimes
std::array< TimePoint, 2 > EnqueueTimes
Definition: sampleInference.cpp:275
BatchStream::skip
void skip(int skipCount) override
Definition: BatchStream.h:243
util.test_misc.test_unique_list
def test_unique_list(case)
Definition: test_misc.py:100
SharedMemory::SharedMemory
SharedMemory(const char *modelStreamFd)
Definition: sampleMovieLensMPS.cpp:121
model.Decoder.gate_layer
gate_layer
Definition: demo/Tacotron2/tacotron2/model.py:275
nvinfer1::plugin::GroupNormalizationPlugin::getWorkspaceSize
size_t getWorkspaceSize(const nvinfer1::PluginTensorDesc *inputs, int nbInputs, const nvinfer1::PluginTensorDesc *outputs, int nbOutputs) const override
Definition: groupNormalizationPlugin.cpp:235
bert::fused_multihead_attention_fp16_384_64_kernel_sm75_cu_o_len
unsigned int fused_multihead_attention_fp16_384_64_kernel_sm75_cu_o_len
Definition: fused_multihead_attention_fp16_384_64_kernel.sm75.cpp:1819
backend.onnx.test_loader.TestModifyOnnx.test_custom_outputs
def test_custom_outputs(self)
Definition: onnx/test_loader.py:60
test_func.TestExtend.test_extend_can_modify_rv
def test_extend_can_modify_rv(self)
Definition: test_func.py:106
builder_varseqlen.BertConfig.hidden_size
hidden_size
Definition: builder_varseqlen.py:94
polygraphy.backend.tf.runner.TfRunner.run_metadata
run_metadata
Definition: backend/tf/runner.py:56
sample::Logger::severityPrefix
static const char * severityPrefix(Severity severity)
returns an appropriate string for prefixing a log message with the given severity
Definition: samples/common/logging.h:379
polygraphy.tools.args.trt.loader.TrtLoaderArgs.trt_max_shapes
trt_max_shapes
Definition: tools/args/trt/loader.py:93
SampleCharRNNBase::constructNetwork
void constructNetwork(SampleUniquePtr< nvinfer1::IBuilder > &builder, SampleUniquePtr< nvinfer1::INetworkDefinition > &network, SampleUniquePtr< nvinfer1::IBuilderConfig > &config)
Create full model using the TensorRT network definition API and build the engine.
Definition: sampleCharRNN.cpp:761
AlgorithmCacheReader::AlgorithmCacheItem::implementation
int64_t implementation
Definition: sampleAlgorithmSelector.cpp:237
nvinfer1::IPaddingLayer
Layer that represents a padding operation.
Definition: NvInfer.h:3638
sample::LogStreamConsumerBuffer::LogStreamConsumerBuffer
LogStreamConsumerBuffer(std::ostream &stream, const std::string &prefix, bool shouldLog)
Definition: samples/common/logging.h:37
nvinfer1::plugin::MultilevelProposeROI::mFeatureCnt
int mFeatureCnt
Definition: multilevelProposeROIPlugin.h:97
nvinfer1::plugin::MultilevelCropAndResize::clone
IPluginV2Ext * clone() const override
Clone the plugin object.
Definition: multilevelCropAndResizePlugin.cpp:140
nvinfer1::PluginType::kYOLOREGION
@ kYOLOREGION
YOLO Region Plugin.
polygraphy.backend.trt.loader.LoadPlugins.obj
obj
Definition: backend/trt/loader.py:48
nvinfer1::plugin::CoordConvACPluginCreator::CoordConvACPluginCreator
CoordConvACPluginCreator()
Definition: coordConvACPlugin.cpp:178
polygraphy.tools.args.logger
Definition: Polygraphy/polygraphy/tools/args/logger.py:1
sample::anonymous_namespace{sampleInference.cpp}::SyncStruct
Threads synchronization structure.
Definition: sampleInference.cpp:166
half_float::detail::sqrt
expr sqrt(half arg)
Square root.
Definition: ieee_half.h:2379
backend.test_tensorrt_legacy.test_uff_identity
def test_uff_identity(fp16, tf32)
Definition: test_tensorrt_legacy.py:26
nvinfer1::plugin::ProposalPluginCreator::getPluginName
const char * getPluginName() const override
Return the plugin name.
Definition: proposalPlugin.cpp:334
sample
Definition: logger.cpp:20
tools.test_run.TestOnnxrt
Definition: test_run.py:181
nvinfer1::plugin::Normalize::detachFromContext
void detachFromContext() override
Detach the plugin object from its execution context.
Definition: normalizePlugin.cpp:227
nvinfer1::CUDADriverWrapper::_cuLaunchCooperativeKernel
CUresult(* _cuLaunchCooperativeKernel)(CUfunction, unsigned int, unsigned int, unsigned int, unsigned int, unsigned int, unsigned int, unsigned int, CUstream, void **)
Definition: cudaDriverWrapper.h:81
bert::fused_multihead_attention_v2_fp16_64_64_kernel_sm80_cubin
unsigned char fused_multihead_attention_v2_fp16_64_64_kernel_sm80_cubin[]
Definition: fused_multihead_attention_v2_fp16_64_64_kernel.sm80.cpp:19
nvinfer1::plugin::DetectionLayer::enqueue
int enqueue(int batch_size, const void *const *inputs, void **outputs, void *workspace, cudaStream_t stream) override
Definition: detectionLayerPlugin.cpp:254
stft.STFT.__init__
def __init__(self, filter_length=800, hop_length=200, win_length=800, window='hann')
Definition: stft.py:44
bert::FusedMultiHeadAttentionKernelMetaInfoV1::mS
unsigned int mS
Definition: fused_multihead_attention.h:118
SampleOnnxMNIST::infer
bool infer()
Runs the TensorRT inference engine for this sample.
Definition: sampleOnnxMNIST.cpp:196
nvinfer1::TensorFormat::kCHW4
@ kCHW4
Four wide channel vectorized row major format.
tests.conftest.pytest_addoption
def pytest_addoption(parser)
Definition: conftest.py:22
FCPlugin::terminate
virtual void terminate() override
Release resources acquired during plugin layer initialization.
Definition: samples/opensource/samplePlugin/fcPlugin.h:146
bert::FusedMultiHeadAttentionXMMAKernelV2::hashID
virtual uint64_t hashID(const KernelMeta &kernelMeta) const
Definition: fused_multihead_attention_v2.h:438
onnx_graphsurgeon.exporters.base_exporter.BaseExporter
Definition: base_exporter.py:19
polygraphy.tools.args.trt_legacy.TrtLegacyArgs.batch_size
batch_size
Definition: tools/args/trt_legacy.py:61
helpers.tokenization.WordpieceTokenizer.max_input_chars_per_word
max_input_chars_per_word
Definition: tokenization.py:336
polygraphy.tools.base.tool.Tool.subscribe_args
def subscribe_args(self, maker)
Definition: tool.py:35
anonymous_namespace{serialize.hpp}::Serializer< std::vector< T >, typename std::enable_if< std::is_arithmetic< T >::value||std::is_enum< T >::value||std::is_pod< T >::value >::type >::serialize
static void serialize(void **buffer, std::vector< T > const &value)
Definition: serialize.hpp:95
sample::InferenceTrace::~InferenceTrace
~InferenceTrace()=default
tools.test_run.TestTrt.test_trt_int8_calibration_cache
def test_trt_int8_calibration_cache(self)
Definition: test_run.py:131
nvinfer1::plugin::NormalizePluginCreator::getPluginVersion
const char * getPluginVersion() const override
Return the plugin version.
Definition: normalizePlugin.cpp:272
SampleConfig::mTopK
uint64_t mTopK
Definition: sampleConfig.h:57
nmtSample::BeamSearchPolicy::backtrack
void backtrack(int lastTimestepId, int sampleId, int lastTimestepRayId, int *hOutputData, int lastTimestepWriteId) const
Definition: beamSearchPolicy.cpp:153
denoiser.Denoiser.forward
def forward(self, audio, strength=0.1)
Definition: denoiser.py:47
nmtSample::BLEUScoreWriter::finalize
void finalize() override
it is called right after inference ends
Definition: bleuScoreWriter.cpp:185
nmtSample::Decoder::ptr
std::shared_ptr< Decoder > ptr
Definition: decoder.h:36
nvinfer1::LayerType::kTRIP_LIMIT
@ kTRIP_LIMIT
Loop Trip limit layer.
tests.integration_test.TestNetwork
Definition: integration_test.py:43
sample::ReportingOptions::parse
void parse(Arguments &arguments) override
Definition: sampleOptions.cpp:708
half_float::half::operator--
half operator--(int)
Postfix decrement.
Definition: samples/common/half.h:1546
polygraphy.util.cuda.DeviceBuffer.copy_from
def copy_from(self, host_buffer, stream=None)
Definition: cuda.py:160
copyright-scan.main
def main()
Definition: copyright-scan.py:165
model.Decoder.get_go_frame
def get_go_frame(self, memory)
Definition: demo/Tacotron2/tacotron2/model.py:286
UffPoolPluginV2::clone
IPluginV2Ext * clone() const override
Clone the plugin object.
Definition: sampleUffPluginV2Ext.cpp:552
nvinfer1::plugin::BatchTilePluginCreator::getPluginName
const char * getPluginName() const override
Return the plugin name.
Definition: batchTilePlugin.cpp:189
ParserOnnxConfig::mPrintLayercInfo
bool mPrintLayercInfo
Definition: parserOnnxConfig.h:49
SampleINT8APIArgs::referenceFileName
std::string referenceFileName
Definition: sampleINT8API.cpp:653
nvinfer1::plugin::PyramidROIAlign::PyramidROIAlign
PyramidROIAlign(int pooled_size)
Definition: pyramidROIAlignPlugin.cpp:78
nmtSample::SLPEmbedder::addToModel
void addToModel(nvinfer1::INetworkDefinition *network, nvinfer1::ITensor *input, nvinfer1::ITensor **output) override
add the embedding vector calculation to the network
Definition: slpEmbedder.cpp:41
nmtSample::Encoder::getStateSizes
virtual std::vector< nvinfer1::Dims > getStateSizes()=0
get the sizes (vector of them) of the hidden state vectors
ParserOnnxConfig::getPrintLayerInfo
virtual bool getPrintLayerInfo() const
Get whether the layer information will be printed.
Definition: parserOnnxConfig.h:128
tests.fixtures.models.LeNet.conv1
conv1
Definition: tools/pytorch-quantization/tests/fixtures/models.py:29
Args::randomSeed
int randomSeed
Definition: demo/BERT/infer_c/common.h:34
nmtSample::Attention::Attention
Attention()=default
nmtSample::SLPAttention::mInputChannelCount
int mInputChannelCount
Definition: slpAttention.h:49
nvinfer1::CUDADriverWrapper::cuModuleGetFunction
CUresult cuModuleGetFunction(CUfunction *hfunc, CUmodule hmod, const char *name) const
Definition: cudaDriverWrapper.cpp:107
polygraphy.comparator.comparator.Comparator.default_comparisons
def default_comparisons(run_results)
Definition: comparator/comparator.py:208
SampleMLP::constructNetwork
bool constructNetwork(SampleUniquePtr< nvinfer1::IBuilder > &builder, SampleUniquePtr< nvinfer1::INetworkDefinition > &network, SampleUniquePtr< nvinfer1::IBuilderConfig > &config)
Uses the API to create the MLP Network.
Definition: sampleMLP.cpp:188
nvinfer1::plugin::DetectionOutput::numPriors
int numPriors
Definition: nmsPlugin.h:90
nmtSample::Segment_t
std::vector< std::string > Segment_t
Definition: bleuScoreWriter.cpp:32
test_ir.build_basic_graph
def build_basic_graph()
Definition: test_ir.py:338
nvinfer1::ElementWiseOperation::kFLOOR_DIV
@ kFLOOR_DIV
Floor division of the first element by the second.
nvinfer1::plugin::LReLU::supportsFormat
bool supportsFormat(DataType type, PluginFormat format) const override
Check format support.
Definition: lReluPlugin.cpp:93
nmtSample::Component::Component
Component()=default
samplesCommon::GenericBuffer::mCapacity
size_t mCapacity
Definition: buffers.h:172
stft.STFT.window
window
Definition: stft.py:49
polygraphy.util.misc.lazy_write
def lazy_write(contents, path, mode="wb")
Definition: Polygraphy/polygraphy/util/misc.py:295
nvinfer1::plugin::GridAnchorPluginCreator::GridAnchorPluginCreator
GridAnchorPluginCreator()
Definition: gridAnchorPlugin.cpp:497
polygraphy.tools.args.base.BaseArgs.check_registered
def check_registered(self)
Definition: args/base.py:53
tests.optim_helper_test.TestQuantWeightInPlace
Definition: optim_helper_test.py:103
polygraphy.tools.registry.MissingTool.__call__
def __call__(self, args)
Definition: registry.py:37
SampleConfig::InputDataFormat
InputDataFormat
Definition: sampleConfig.h:29
nvinfer1::plugin::BatchTilePluginCreator::mNamespace
std::string mNamespace
Definition: batchTilePlugin.h:111
nvinfer1::plugin::DetectionLayer::mIOUThreshold
float mIOUThreshold
Definition: detectionLayerPlugin.h:97
polygraphy.tools.args.data_loader.DataLoaderArgs.__init__
def __init__(self, read=True)
Definition: tools/args/data_loader.py:23
nvinfer1::plugin::ResizeNearestPluginCreator::mFC
static PluginFieldCollection mFC
Definition: resizeNearestPlugin.h:114
tests.clip_test.TestClip
Definition: clip_test.py:36
nvinfer1::plugin::RegionPluginCreator::deserializePlugin
IPluginV2Ext * deserializePlugin(const char *name, const void *serialData, size_t serialLength) override
Called during deserialization of plugin layer.
Definition: regionPlugin.cpp:528
tests.fixtures.models.quant_lenet
def quant_lenet()
Definition: tools/pytorch-quantization/tests/fixtures/models.py:67
model.Postnet.convolutions
convolutions
Definition: demo/Tacotron2/tacotron2/model.py:129
tests.tensor_quantizer_test.TestTensorQuantizer.test_simple_run
def test_simple_run(self)
Definition: tensor_quantizer_test.py:38
nvinfer1::plugin::CropAndResizePlugin::mDepth
size_t mDepth
Definition: cropAndResizePlugin.h:101
nvcaffeparser1::CaffeParser::parseDetectionOutputParam
std::vector< nvinfer1::PluginField > parseDetectionOutputParam(const trtcaffe::LayerParameter &msg, CaffeWeightFactory &weightFactory, BlobNameToTensor &tensors)
Definition: caffeParser.cpp:158
sample::anonymous_namespace{sampleEngines.cpp}::RndInt8Calibrator
Definition: sampleEngines.cpp:157
SampleConfig::mFailurePercentage
float mFailurePercentage
Definition: sampleConfig.h:58
nvcaffeparser1::parsePower
nvinfer1::ILayer * parsePower(nvinfer1::INetworkDefinition &network, const trtcaffe::LayerParameter &msg, CaffeWeightFactory &weightFactory, BlobNameToTensor &tensors)
Definition: parsePower.cpp:23
backend.trt.test_runner.TestLoggerCallbacks
Definition: trt/test_runner.py:29
onnx_graphsurgeon.ir.node.Node.op
op
Definition: node.py:37
samplesCommon::Args::saveEngine
std::string saveEngine
Definition: argsParser.h:87
sample::HostDeallocator::operator()
void operator()(void *ptr)
Definition: sampleDevice.h:299
builder.BertConfig.use_strict
use_strict
Definition: builder.py:100
half_float::detail::modf
half modf(half arg, half *iptr)
Extract integer and fractional parts.
Definition: ieee_half.h:2643
nvinfer1::LayerType
LayerType
The type values of layer classes.
Definition: NvInfer.h:444
SampleCharRNNBindingNames::OUTPUT_BLOB_NAME
const char * OUTPUT_BLOB_NAME
Definition: sampleCharRNN.cpp:92
polygraphy.util
Definition: tools/Polygraphy/polygraphy/util/__init__.py:1
bert::SkipLayerNormInterleavedPluginCreator::setPluginNamespace
void setPluginNamespace(const char *pluginNamespace) override
Set the namespace of the plugin creator based on the plugin library it belongs to.
Definition: skipLayerNormInt8InterleavedPlugin.cpp:339
MaskRCNNUtils::addBBoxPPM
void addBBoxPPM(PPM< uint8_t > &ppm, const BBoxInfo &box, const PPM< uint8_t > &resized_mask)
Definition: sampleUffMaskRCNN.cpp:265
nmtSample::LikelihoodCombinationOperator::~LikelihoodCombinationOperator
virtual ~LikelihoodCombinationOperator()=default
sample::anonymous_namespace{sampleOptions.cpp}::splitInsertKeyValue
void splitInsertKeyValue(const std::vector< std::string > &kvList, T &map)
Definition: sampleOptions.cpp:163
test_api.TestApi
Definition: test_api.py:25
SampleCharRNNv2::SampleCharRNNv2
SampleCharRNNv2(SampleCharRNNParams params)
Definition: sampleCharRNN.cpp:226
sample::BaseModelOptions::format
ModelFormat format
Definition: sampleOptions.h:76
nvinfer1::plugin::DetectionOutput::isOutputBroadcastAcrossBatch
bool isOutputBroadcastAcrossBatch(int outputIndex, const bool *inputIsBroadcasted, int nbInputs) const override
Definition: nmsPlugin.cpp:196
half_float::detail::acos
expr acos(half arg)
Arc cosine function.
Definition: ieee_half.h:2445
polygraphy.backend.onnx.loader.OnnxFromTfGraph
Definition: backend/onnx/loader.py:49
polygraphy.backend.onnx.loader.BytesFromOnnx
Definition: backend/onnx/loader.py:178
polygraphy.backend.tf.loader.UseTfTrt.__init__
def __init__(self, graph, max_workspace_size=None, fp16=None, int8=None, max_batch_size=None, is_dynamic_op=False, minimum_segment_size=None)
Definition: backend/tf/loader.py:213
bert::EmbLayerNormVarSeqlenPlugin::clone
nvinfer1::IPluginV2DynamicExt * clone() const override
Clone the plugin object.
Definition: embLayerNormVarSeqlenPlugin.cpp:121
nvinfer1::plugin::SpecialSlicePluginCreator::mPluginAttributes
static std::vector< PluginField > mPluginAttributes
Definition: specialSlicePlugin.h:118
samplesCommon::CaffeSampleParams
The CaffeSampleParams structure groups the additional parameters required by networks that use caffe.
Definition: argsParser.h:51
polygraphy.backend.base.runner.BaseRunner.last_inference_time
def last_inference_time(self)
Definition: backend/base/runner.py:51
sample::TrtCudaStream
Managed CUDA stream.
Definition: sampleDevice.h:57
sample::anonymous_namespace{sampleInference.cpp}::Iteration::mStream
MultiStream mStream
Definition: sampleInference.cpp:467
nvcaffeparser1::ICaffeParser::setProtobufBufferSize
virtual void setProtobufBufferSize(size_t size)=0
Set buffer size for the parsing and storage of the learned model.
nmtSample::Vocabulary::operator>>
friend std::istream & operator>>(std::istream &input, Vocabulary &value)
Definition: vocabulary.cpp:61
SampleMovieLensArgs::verbose
bool verbose
Definition: sampleMovieLens.cpp:529
polygraphy.tools.args.tf.runner.TfRunnerArgs.add_to_parser
def add_to_parser(self, parser)
Definition: tools/args/tf/runner.py:22
Semaphore::mSemEngine
sem_t * mSemEngine
Definition: sampleMovieLensMPS.cpp:115
SampleUffFasterRcnn::mEngine
std::shared_ptr< nvinfer1::ICudaEngine > mEngine
The TensorRT engine used to run the network.
Definition: sampleUffFasterRCNN.cpp:125
nvinfer1::DimsExprs::nbDims
int32_t nbDims
The number of dimensions.
Definition: NvInferRuntime.h:378
SampleFasterRCNN::bboxTransformInvAndClip
void bboxTransformInvAndClip(const float *rois, const float *deltas, float *predBBoxes, const float *imInfo, const int N, const int nmsMaxOut, const int numCls)
Performs inverse bounding box transform and clipping.
Definition: sampleFasterRCNN.cpp:384
nvinfer1::plugin::GenerateDetection::getPluginVersion
const char * getPluginVersion() const override
Return the plugin version.
Definition: generateDetectionPlugin.cpp:170
nvinfer1::plugin::ProposalLayer::clone
IPluginV2Ext * clone() const override
Clone the plugin object.
Definition: proposalLayerPlugin.cpp:175
nvinfer1::plugin::BatchTilePluginCreator::getFieldNames
const PluginFieldCollection * getFieldNames() override
Return a list of fields that needs to be passed to createPlugin.
Definition: batchTilePlugin.cpp:199
anonymous_namespace{groupNormalizationPlugin.cpp}::GROUP_NORM_NAME
constexpr const char * GROUP_NORM_NAME
Definition: groupNormalizationPlugin.cpp:38
samplesCommon::SampleParams::dataDirs
std::vector< std::string > dataDirs
Directory paths where sample data files are stored.
Definition: argsParser.h:42
nmtSample::Component::ptr
std::shared_ptr< Component > ptr
Definition: component.h:33
nmtSample::Alignment
represents the core of attention mechanism
Definition: alignment.h:32
nvinfer1::plugin::ResizeNearest::setPluginNamespace
void setPluginNamespace(const char *libNamespace) override
Set the namespace that this plugin object belongs to.
Definition: resizeNearestPlugin.cpp:182
nvinfer1::UnaryOperation::kTAN
@ kTAN
Tangent.
model.WN
Definition: demo/Tacotron2/waveglow/model.py:82
SampleConfig::setOutputFileName
void setOutputFileName(const char *outputFilename)
Definition: sampleConfig.h:263
anonymous_namespace{regionPlugin.cpp}::REGION_PLUGIN_VERSION
const char * REGION_PLUGIN_VERSION
Definition: regionPlugin.cpp:27
nvinfer1::IInt8EntropyCalibrator::~IInt8EntropyCalibrator
virtual ~IInt8EntropyCalibrator()
Definition: NvInfer.h:6071
onnx_graphsurgeon.ir.tensor.Tensor.__init__
def __init__(self)
Definition: tensor.py:28
inference.question_features
def question_features(tokens, question)
Definition: BERT/inference.py:114
nvinfer1::IPluginV2::getPluginNamespace
virtual const char * getPluginNamespace() const =0
Return the namespace of the plugin object.
backend.trt.test_loader.TestLoadPlugins
Definition: trt/test_loader.py:62
polygraphy.tools.precision.precision.STLinear.__init__
def __init__(self)
Definition: precision.py:234
convert_weights.float_to_hex
def float_to_hex(f)
Definition: convert_weights.py:58
nvinfer1::plugin::LReluPluginCreator::mPluginAttributes
static std::vector< PluginField > mPluginAttributes
Definition: lReluPlugin.h:96
nvcaffeparser1::WeightType::kGENERIC
@ kGENERIC
nvinfer1::PluginType::kPRELU
@ kPRELU
YOLO PReLU Plugin.
polygraphy.comparator.data_loader.DataLoaderCache.cache
cache
Definition: comparator/data_loader.py:201
polygraphy.util.misc.receive_on_queue
def receive_on_queue(queue, timeout=None)
Definition: Polygraphy/polygraphy/util/misc.py:243
nvinfer1::BuilderFlag::kTF32
@ kTF32
Allow (but not require) computations on tensors of type DataType::kFLOAT to use TF32.
tests.tensor_quant_test.TestFakeTensorQuant.test_clip_gradient
def test_clip_gradient(self)
Definition: tensor_quant_test.py:249
SampleMovieLens::readInputSample
void readInputSample(std::ifstream &file, OutputParams &outParams, std::string line)
Extracts needed dataset values for a single user in the MovieLens, dataset .csv file,...
Definition: sampleMovieLens.cpp:374
sample::InferenceTime::enq
float enq
Definition: sampleReporting.h:52
Semaphore::Semaphore
Semaphore(const char *semName)
Definition: sampleMovieLensMPS.cpp:83
anonymous_namespace{generateDetectionPlugin.cpp}::GENERATEDETECTION_PLUGIN_NAME
const char * GENERATEDETECTION_PLUGIN_NAME
Definition: generateDetectionPlugin.cpp:32
polygraphy.backend.trt.loader.EngineFromBytes.__call__
def __call__(self)
Definition: backend/trt/loader.py:472
nvinfer1::plugin::BatchTilePluginCreator::getPluginVersion
const char * getPluginVersion() const override
Return the plugin version.
Definition: batchTilePlugin.cpp:194
nvinfer1::EnumMax< RNNGateType >
constexpr int32_t EnumMax< RNNGateType >()
Maximum number of elements in RNNGateType enum.
Definition: NvInfer.h:3214
EntropyCalibratorImpl
Implements common functionality for Entropy calibrators.
Definition: EntropyCalibrator.h:28
model.WaveGlow.WN
WN
Definition: demo/Tacotron2/waveglow/model.py:169
polygraphy.logger.logger.LoggerIndent.__enter__
def __enter__(self)
Definition: Polygraphy/polygraphy/logger/logger.py:49
sample::MirroredBuffer
Coupled host and device buffers.
Definition: sampleDevice.h:313
RefineDetectionWorkSpace::sortClassSampleIdxDims
nvinfer1::DimsHW sortClassSampleIdxDims
Definition: maskRCNNKernels.h:102
nmtSample::Attention::~Attention
~Attention() override=default
nmtSample::LikelihoodCombinationOperator::init
virtual float init() const =0
nvinfer1::Weights::type
DataType type
The type of the weights.
Definition: NvInferRuntime.h:80
polygraphy.tools.args.trt.loader.TrtLoaderArgs.tf32
tf32
Definition: tools/args/trt/loader.py:99
sample::InferenceTrace::enqStart
float enqStart
Definition: sampleReporting.h:92
onnx_graphsurgeon.util.misc.SynchronizedList
Definition: onnx-graphsurgeon/onnx_graphsurgeon/util/misc.py:69
nvinfer1::plugin::LReLU
Definition: lReluPlugin.h:32
nvinfer1::plugin::InstanceNormalizationPlugin::_nchan
int _nchan
Definition: instanceNormalizationPlugin.h:91
backend.tf.test_loader.TestModifyGraph
Definition: tf/test_loader.py:50
SampleReformatFreeIO::compareResult
void compareResult()
Used to compare the CPU reference with the TRT result.
nvinfer1::plugin::RPROIParams::poolingW
int32_t poolingW
Definition: NvInferPluginUtils.h:119
anonymous_namespace{instanceNormalizationPlugin.cpp}::INSTANCE_PLUGIN_NAME
constexpr const char * INSTANCE_PLUGIN_NAME
Definition: instanceNormalizationPlugin.cpp:65
nvinfer1::plugin::reduced_divisor::reduced_divisor
__host__ __device__ __forceinline__ reduced_divisor(unsigned _mul_coeff, unsigned _shift_coeff, int _y)
Definition: reducedMathPlugin.h:63
half_float::detail::functions::multiplies
static expr multiplies(float x, float y)
Multiplication implementation.
Definition: samples/common/half.h:1607
BatchStream::mDims
nvinfer1::Dims mDims
Input dimensions.
Definition: BatchStream.h:385
models.meta.check_reshape
def check_reshape(runner)
Definition: meta.py:76
anonymous_namespace{serialize.hpp}::Serializer< T, typename std::enable_if< std::is_arithmetic< T >::value||std::is_enum< T >::value||std::is_pod< T >::value >::type >::serialize
static void serialize(void **buffer, T const &value)
Definition: serialize.hpp:51
SampleINT8API::getInputOutputNames
void getInputOutputNames()
Populates input and output mapping of the network.
Definition: sampleINT8API.cpp:153
SampleCharRNNParams::bindingNames
SampleCharRNNBindingNames bindingNames
Definition: sampleCharRNN.cpp:129
nvinfer1::plugin::TRTException::file
const char * file
Definition: checkMacrosPlugin.h:93
polygraphy.backend.trt.runner.TrtRunner.infer_impl
def infer_impl(self, feed_dict)
Definition: backend/trt/runner.py:152
nvinfer1::IPluginV2Layer
Layer type for pluginV2.
Definition: NvInfer.h:3457
nvinfer1::plugin::NMSPluginCreator::getFieldNames
const PluginFieldCollection * getFieldNames() override
Return a list of fields that needs to be passed to createPlugin.
Definition: nmsPlugin.cpp:293
sample::DeviceAllocator::operator()
void operator()(void **ptr, size_t size)
Definition: sampleDevice.h:275
pytorch_quantization.utils.reduce_amax.reduce_amax
def reduce_amax(input, axis=None, keepdims=True)
Definition: reduce_amax.py:24
onnx_graphsurgeon.ir.node.Node.name
name
Definition: node.py:38
loss_function.WaveGlowLoss
Definition: waveglow/loss_function.py:19
nvinfer1::ScaleMode::kUNIFORM
@ kUNIFORM
Identical coefficients across all elements of the tensor.
nvinfer1::plugin::NormalizePluginCreator::mPluginAttributes
static std::vector< PluginField > mPluginAttributes
Definition: normalizePlugin.h:130
nvinfer1::plugin::MultilevelCropAndResize::mNameSpace
std::string mNameSpace
Definition: multilevelCropAndResizePlugin.h:102
polygraphy.logger.logger.LogMode
Definition: Polygraphy/polygraphy/logger/logger.py:72
polygraphy.util.cuda.DeviceBuffer.shape
shape
Definition: cuda.py:122
nvinfer1::plugin::DetectionLayerPluginCreator::mScoreThreshold
float mScoreThreshold
Definition: detectionLayerPlugin.h:129
polygraphy.tools.util.script.Script.loaders
loaders
Definition: script.py:137
nvinfer1::RNNDirection
RNNDirection
Enumerates the RNN direction that may be performed by an RNN layer.
Definition: NvInfer.h:2813
bert::fused_multihead_attention_v2_int8_256_64_kernel_sm75_cubin
unsigned char fused_multihead_attention_v2_int8_256_64_kernel_sm75_cubin[]
Definition: fused_multihead_attention_v2_int8_256_64_kernel.sm75.cpp:19
sample::getEngine
TrtUniquePtr< nvinfer1::ICudaEngine > getEngine(const ModelOptions &model, const BuildOptions &build, const SystemOptions &sys, std::ostream &err)
Create an engine from model or serialized file, and optionally save engine.
Definition: sampleEngines.cpp:682
onnx_graphsurgeon.ir.graph.Graph
Definition: graph.py:59
sample::ModelOptions::baseModel
BaseModelOptions baseModel
Definition: sampleOptions.h:96
polygraphy.backend.trt.loader.CreateConfig.fp16
fp16
Definition: backend/trt/loader.py:323
tools.test_args.TestParseMeta.test_parse_shape_with_dim_param_single_quote
def test_parse_shape_with_dim_param_single_quote(self)
Definition: test_args.py:41
nvinfer1::IGpuAllocator::~IGpuAllocator
virtual ~IGpuAllocator()
Destructor declared virtual as general good practice for a class with virtual methods.
Definition: NvInferRuntimeCommon.h:1065
nmtSample::Decoder::~Decoder
~Decoder() override=default
nvcaffeparser1::CaffeParser::mPluginFactoryIsExt
bool mPluginFactoryIsExt
Definition: caffeParser.h:89
SampleCharRNNWeightNames::RNNB_L0_NAME
const std::string RNNB_L0_NAME
Definition: sampleCharRNN.cpp:74
SampleMLP::loadWeights
std::map< std::string, std::pair< nvinfer1::Dims, nvinfer1::Weights > > loadWeights(const std::string &file)
Loads weights from weights file.
Definition: sampleMLP.cpp:362
bert::EmbLayerNormVarSeqlenPlugin::mWordEmb
bert::WeightsWithOwnership mWordEmb
Definition: embLayerNormVarSeqlenPlugin.h:99
nvinfer1::plugin::LReluPluginCreator::~LReluPluginCreator
~LReluPluginCreator() override=default
nmtSample::TextWriter::mVocabulary
Vocabulary::ptr mVocabulary
Definition: textWriter.h:50
nvinfer1::plugin::CoordConvACPlugin::mNamespace
std::string mNamespace
Definition: coordConvACPlugin.h:89
nvinfer1::RNNOperation::kLSTM
@ kLSTM
Four-gate LSTM network w/o peephole connections.
FCPlugin::mDstDescriptor
cudnnTensorDescriptor_t mDstDescriptor
Definition: samples/opensource/samplePlugin/fcPlugin.h:317
nvinfer1::plugin::FlattenConcat::mCublas
cublasHandle_t mCublas
Definition: flattenConcat.h:105
polygraphy.comparator.struct.AccuracyResult.__bool__
def __bool__(self)
Definition: comparator/struct.py:154
sample::BuildOptions::save
bool save
Definition: sampleOptions.h:117
SampleCharRNNLoop::LstmIO::hidden
nvinfer1::ITensor * hidden
Definition: sampleCharRNN.cpp:244
SampleCharRNNBase::stepOnce
bool stepOnce(samplesCommon::BufferManager &buffers, SampleUniquePtr< nvinfer1::IExecutionContext > &context, cudaStream_t &stream)
Perform one time step of inference with the TensorRT execution context.
Definition: sampleCharRNN.cpp:908
nvinfer1::DimensionOperation::kMAX
@ kMAX
Maximum of the two operands.
nvinfer1::ElementWiseOperation::kGREATER
@ kGREATER
Check if element in first tensor is greater than corresponding element in second tensor.
nvinfer1::plugin::SpecialSlice::getWorkspaceSize
size_t getWorkspaceSize(int) const override
Definition: specialSlicePlugin.cpp:67
anonymous_namespace{multilevelCropAndResizePlugin.cpp}::MULTILEVELCROPANDRESIZE_PLUGIN_VERSION
const char * MULTILEVELCROPANDRESIZE_PLUGIN_VERSION
Definition: multilevelCropAndResizePlugin.cpp:31
backend.trt.test_calibrator.TestCalibrator.test_calibrator_with_file_object_cache
def test_calibrator_with_file_object_cache(self, identity_builder_network, mode)
Definition: test_calibrator.py:101
BertInference::mHostOutput
std::vector< float > mHostOutput
Definition: bert_infer.h:348
bert::FusedMultiHeadAttentionKernelMetaInfoV1::mFuncName
const char * mFuncName
Definition: fused_multihead_attention.h:123
nvinfer1::plugin::ProposalPlugin::clone
nvinfer1::IPluginV2Ext * clone() const override
Clone the plugin object.
Definition: proposalPlugin.cpp:256
polygraphy.tools.util.misc.parse_profile_shapes
def parse_profile_shapes(default_shapes, min_args, opt_args, max_args)
Definition: Polygraphy/polygraphy/tools/util/misc.py:133
nvinfer1::plugin::ProposalLayer::enqueue
int enqueue(int batch_size, const void *const *inputs, void **outputs, void *workspace, cudaStream_t stream) override
Definition: proposalLayerPlugin.cpp:308
str
nvinfer1::TripLimit
TripLimit
Enum that describes kinds of trip limits.
Definition: NvInfer.h:4507
IBatchStream::getBatch
virtual float * getBatch()=0
nvinfer1::plugin::FlattenConcat::getOutputDimensions
Dims getOutputDimensions(int index, const Dims *inputs, int nbInputDims) override
Definition: flattenConcat.cpp:87
polygraphy.comparator.data_loader.DataLoaderCache.__init__
def __init__(self, data_loader, save_inputs_path=None)
Definition: comparator/data_loader.py:199
onnx_models.Model.nodes
nodes
Definition: onnx_models.py:39
text.numbers._expand_decimal_point
def _expand_decimal_point(m)
Definition: numbers.py:35
pytorch_quantization.nn.modules.quant_rnn.QuantLSTMCell.bias_ih
bias_ih
Definition: modules/quant_rnn.py:393
nvinfer1::plugin::SplitPlugin::_ny
int _ny
Definition: split.h:42
half_float::half::operator/=
half & operator/=(float rhs)
Arithmetic assignment.
Definition: samples/common/half.h:1515
polygraphy.backend.trt.loader.Profile.__getitem__
def __getitem__(self, key)
Definition: backend/trt/loader.py:285
polygraphy.backend.trt.loader.EngineFromBytes
Definition: backend/trt/loader.py:458
nvcaffeparser1::BinaryProtoBlob::mDataType
nvinfer1::DataType mDataType
Definition: binaryProtoBlob.h:62
nvinfer1::plugin::Reorg::getOutputDimensions
Dims getOutputDimensions(int index, const Dims *inputs, int nbInputDims) override
Definition: reorgPlugin.cpp:56
nmtSample::TextWriter::~TextWriter
~TextWriter() override=default
SampleConfig::getOutputFileName
const char * getOutputFileName() const
< specifies the file to save the results
Definition: sampleConfig.h:259
tests.fixtures.models.QuantLeNet.__init__
def __init__(self, **kwargs)
Definition: tools/pytorch-quantization/tests/fixtures/models.py:45
pytorch_quantization.nn.modules.quant_rnn.QuantRNNCellBase
Definition: modules/quant_rnn.py:306
dumpTFWts.float_to_hex
def float_to_hex(f)
Definition: dumpTFWts.py:51
pytorch_quantization.nn.modules.quant_conv._QuantConvTransposeNd._quant
def _quant(self, input)
Definition: quant_conv.py:252
nvinfer1::IRecurrenceLayer
Definition: NvInfer.h:4530
nvinfer1::ElementWiseOperation::kXOR
@ kXOR
Logical XOR of two elements.
nvinfer1::plugin::RPROIPlugin::W
int W
Definition: nvFasterRCNNPlugin.h:99
sample::anonymous_namespace{sampleEngines.cpp}::RndInt8Calibrator::~RndInt8Calibrator
~RndInt8Calibrator()
Definition: sampleEngines.cpp:163
nvinfer1::ElementWiseOperation::kSUB
@ kSUB
Substract the second element from the first.
sample::InferenceEnvironment::engine
TrtUniquePtr< nvinfer1::ICudaEngine > engine
Definition: sampleInference.h:35
polygraphy.logger.logger.Logger
Definition: Polygraphy/polygraphy/logger/logger.py:77
bert::SkipLayerNormInterleavedPlugin::mNamespace
std::string mNamespace
Definition: skipLayerNormInt8InterleavedPlugin.h:72
polygraphy.backend.trt.loader.ModifyNetwork.outputs
outputs
Definition: backend/trt/loader.py:204
bert::Data_type
Data_type
Definition: fused_multihead_attention_common.h:22
nvinfer1::plugin::ProposalLayer::getPluginType
const char * getPluginType() const override
Return the plugin type.
Definition: proposalLayerPlugin.cpp:165
onnx_graphsurgeon.logger.logger.Logger.colors
colors
Definition: onnx-graphsurgeon/onnx_graphsurgeon/logger/logger.py:104
tests.tensor_quant_test.TestFakeAffineTensorQuant
Definition: tensor_quant_test.py:352
nmtSample::LimitedSamplesDataReader
wraps another data reader and limits the number of samples to read
Definition: limitedSamplesDataReader.h:29
nvinfer1::IScaleLayer::~IScaleLayer
virtual ~IScaleLayer()
Definition: NvInfer.h:2164
nvinfer1::INetworkDefinition::setName
virtual void setName(const char *name)=0
Sets the name of the network.
nvinfer1::plugin::ProposalLayerPluginCreator::getFieldNames
const PluginFieldCollection * getFieldNames() override
Return a list of fields that needs to be passed to createPlugin.
Definition: proposalLayerPlugin.cpp:61
bert::SkipLayerNormInterleavedPluginCreator::mPluginAttributes
static std::vector< nvinfer1::PluginField > mPluginAttributes
Definition: skipLayerNormInt8InterleavedPlugin.h:115
nvinfer1::EnumMax< BuilderFlag >
constexpr int32_t EnumMax< BuilderFlag >()
Maximum number of builder flags in BuilderFlag enum.
Definition: NvInfer.h:6402
nvinfer1::plugin::MultilevelCropAndResize::getPluginType
const char * getPluginType() const override
Return the plugin type.
Definition: multilevelCropAndResizePlugin.cpp:130
inference_trt.swap_inputs_outputs
def swap_inputs_outputs(decoder_inputs, decoder_outputs)
Definition: inference_trt.py:148
tests.calibrator_test.TestEntropyCalibrator.test_one_tensor
def test_one_tensor(self, verbose)
Definition: calibrator_test.py:142
pytorch_quantization.tensor_quant.ScaledQuantDescriptor._axis
_axis
Definition: tensor_quant.py:85
test_ir.TestVariable.setup_method
def setup_method(self)
Definition: test_ir.py:111
polygraphy.backend.onnxrt.runner.OnnxrtRunner.sess
sess
Definition: backend/onnxrt/runner.py:44
bert::SkipLayerNormInterleavedPlugin::getPluginVersion
const char * getPluginVersion() const override
Return the plugin version.
Definition: skipLayerNormInt8InterleavedPlugin.cpp:214
polygraphy.tools.surgeon.subtool.sanitize.Sanitize.run
def run(self, args)
Definition: sanitize.py:33
nvinfer1::ActivationType::kLEAKY_RELU
@ kLEAKY_RELU
LeakyRelu activation: x>=0 ? x : alpha * x.
nvinfer1::utility::TRTOption::valueRequired
bool valueRequired
True if a value is needed for an option (i.e. -N 4, –foo bar)
Definition: getOptions.h:38
half_float::detail::functions::sqrt
static expr sqrt(float arg)
Square root implementation.
Definition: samples/common/half.h:1839
half_float::detail::isunordered
bool isunordered(half x, half y)
Check if unordered.
Definition: ieee_half.h:2828
tests.fixtures.models.LeNet.forward
def forward(self, x)
Definition: tools/pytorch-quantization/tests/fixtures/models.py:34
nvinfer1::plugin::CoordConvACPlugin::getOutputDataType
nvinfer1::DataType getOutputDataType(int index, const nvinfer1::DataType *inputType, int nbInputs) const override
Definition: coordConvACPlugin.cpp:160
bert::QKVToContextInterleavedPlugin::getSerializationSize
size_t getSerializationSize() const override
Find the size of the serialization buffer required.
Definition: qkvToContextInt8InterleavedPlugin.cpp:189
helpers.tokenization.BertTokenizer.basic_tokenizer
basic_tokenizer
Definition: tokenization.py:191
tools.test_run.TestTrt.test_trt_multiple_profiles
def test_trt_multiple_profiles(self)
Definition: test_run.py:127
nvcaffeparser1::CaffeParser::mTmpAllocs
std::vector< void * > mTmpAllocs
Definition: caffeParser.h:84
polygraphy.util.misc.volume
def volume(obj)
Definition: Polygraphy/polygraphy/util/misc.py:161
tools.test_inspect.TestInspectModel.test_model_trt_sanity
def test_model_trt_sanity(self, run_inspect_model, model)
Definition: test_inspect.py:269
sample::BaseModelOptions::help
static void help(std::ostream &out)
Definition: sampleOptions.cpp:1136
util.test_cuda.TestDeviceBuffer.test_device_buffer_memcpy_sync
def test_device_buffer_memcpy_sync(self)
Definition: test_cuda.py:66
SampleGoogleNet::constructNetwork
void constructNetwork(SampleUniquePtr< nvcaffeparser1::ICaffeParser > &parser, SampleUniquePtr< nvinfer1::INetworkDefinition > &network)
Parses a Caffe model for GoogleNet and creates a TensorRT network.
Definition: sampleGoogleNet.cpp:139
nvinfer1::plugin::ProposalLayerPluginCreator::~ProposalLayerPluginCreator
~ProposalLayerPluginCreator()
Definition: proposalLayerPlugin.h:121
onnx_graphsurgeon.ir.node.Node.outputs
outputs
Definition: node.py:41
pytorch_quantization.calib.calibrator._Calibrator.reset
def reset(self)
Definition: tools/pytorch-quantization/pytorch_quantization/calib/calibrator.py:44
sample::BuildOptions::parse
void parse(Arguments &arguments) override
Definition: sampleOptions.cpp:494
bert::EmbLayerNormVarSeqlenPlugin::mGamma
bert::WeightsWithOwnership mGamma
Definition: embLayerNormVarSeqlenPlugin.h:98
test_struct.TestTensorMetadata.test_str
def test_str(self)
Definition: common/test_struct.py:21
nvinfer1::plugin::CropAndResizePluginCreator::getPluginName
const char * getPluginName() const override
Return the plugin name.
Definition: cropAndResizePlugin.cpp:253
nvinfer1::FillOperation
FillOperation
Enumerates the tensor fill operations that may performed by a fill layer.
Definition: NvInfer.h:4731
nmtSample::Context
calculates context vector from raw alignment scores and memory states
Definition: contextNMT.h:32
samplesCommon::FloatMemory
TypedHostMemory< float, DataType::kFLOAT > FloatMemory
Definition: samples/common/common.h:354
nvinfer1::plugin::GridAnchorParameters::H
int32_t H
Definition: NvInferPluginUtils.h:147
backend.tf.test_loader.TestModifyGraph.test_layerwise
def test_layerwise(self)
Definition: tf/test_loader.py:51
SampleOnnxMnistCoordConvAC::mNumber
int mNumber
The number to classify.
Definition: sampleOnnxMnistCoordConvAC.cpp:79
nvinfer1::plugin::GridAnchorGenerator::detachFromContext
void detachFromContext() override
Detach the plugin object from its execution context.
Definition: gridAnchorPlugin.cpp:337
nvinfer1::plugin::PriorBox::serialize
void serialize(void *buffer) const override
Serialize the layer.
Definition: priorBoxPlugin.cpp:192
polygraphy.backend.trt_legacy.TrtLegacyRunner.HostDeviceMem.device
device
Definition: backend/trt_legacy.py:182
nvinfer1::plugin::MultilevelCropAndResize::supportsFormat
bool supportsFormat(DataType type, PluginFormat format) const override
Check format support.
Definition: multilevelCropAndResizePlugin.cpp:125
onnx_graphsurgeon.ir.graph.Graph.__getattr__
def __getattr__(self, name)
Definition: graph.py:129
nvinfer1::plugin::GenerateDetectionPluginCreator::deserializePlugin
IPluginV2Ext * deserializePlugin(const char *name, const void *data, size_t length) override
Called during deserialization of plugin layer.
Definition: generateDetectionPlugin.cpp:103
UffPoolPluginV2Creator::setPluginNamespace
void setPluginNamespace(const char *libNamespace) override
Set the namespace of the plugin creator based on the plugin library it belongs to.
Definition: sampleUffPluginV2Ext.cpp:678
pytorch_quantization.nn.modules.quant_rnn
Definition: modules/quant_rnn.py:1
bert::SkipLayerNormInterleavedPlugin::mBeta
bert::WeightsWithOwnership mBeta
Definition: skipLayerNormInt8InterleavedPlugin.h:78
sample::anonymous_namespace{sampleInference.cpp}::EventType
EventType
Definition: sampleInference.cpp:260
test_infer_trt.main
def main()
Definition: test_infer_trt.py:140
nvinfer1::plugin::SplitPluginCreator::getPluginNamespace
const char * getPluginNamespace() const
Return the namespace of the plugin creator object.
Definition: split.h:174
polygraphy.backend.trt.loader.SaveEngine.path
path
Definition: backend/trt/loader.py:502
pytorch_quantization.calib.max.MaxCalibrator.__init__
def __init__(self, num_bits, axis, unsigned, track_amax=False)
Definition: max.py:37
nvinfer1::plugin::write
void write(char *&buffer, const T &val)
Definition: plugin.h:79
half_float::detail::true_type
bool_type< true > true_type
Definition: ieee_half.h:301
onnx_graphsurgeon.util.misc.SynchronizedList.__init__
def __init__(self, parent_obj, field_name, initial)
Definition: onnx-graphsurgeon/onnx_graphsurgeon/util/misc.py:70
nvinfer1::plugin::NMSParameters::shareLocation
bool shareLocation
Definition: NvInferPluginUtils.h:249
DimsCHW
Descriptor for data with one channel dimension and two spatial dimensions.
polygraphy.util.misc.pickle_load
def pickle_load(path)
Definition: Polygraphy/polygraphy/util/misc.py:197
profiler.refName
def refName(name)
Definition: profiler.py:83
onnx_graphsurgeon.util.misc.SynchronizedList.append
def append(self, x)
Definition: onnx-graphsurgeon/onnx_graphsurgeon/util/misc.py:97
polygraphy.backend.trt.loader.ModifyNetwork._network
_network
Definition: backend/trt/loader.py:203
tests.quant_pooling_test.TestQuantAvgPool3d.test_input_fake_quant
def test_input_fake_quant(self)
Definition: quant_pooling_test.py:207
SampleBuffer::format
TensorFormat format
Definition: sampleReformatFreeIO.cpp:187
perf_varseqlen.DeviceBuffer.free
def free(self)
Definition: perf_varseqlen.py:35
nmtSample::Embedder::getInputDimensionSize
virtual int getInputDimensionSize()=0
get the upper bound for the possible values of indices
nvcaffeparser1::CaffeParser::mBlobNameToTensor
BlobNameToTensor * mBlobNameToTensor
Definition: caffeParser.h:85
samplesCommon::OnnxSampleParams
The OnnxSampleParams structure groups the additional parameters required by networks that use ONNX.
Definition: argsParser.h:62
nvcaffeparser1::CaffeParser::mProtobufBufferSize
size_t mProtobufBufferSize
Definition: caffeParser.h:86
nvinfer1::INetworkDefinition::getLayer
virtual ILayer * getLayer(int32_t index) const =0
Get the layer specified by the given index.
tools.test_run.TestLogging.test_logger_verbosity
def test_logger_verbosity(self)
Definition: test_run.py:46
polygraphy.backend.trt.util.get_input_metadata_from_engine
def get_input_metadata_from_engine(engine, start_binding, end_binding)
Definition: trt/util.py:373
text.cmudict.CMUDict
Definition: cmudict.py:34
sample::TrtCudaBuffer::TrtCudaBuffer
TrtCudaBuffer(TrtCudaBuffer &&rhs)
Definition: sampleDevice.h:223
pytorch_quantization.nn.functional.ClipFunction
Definition: functional.py:27
tests.fixtures.models.LeNet.fc1
fc1
Definition: tools/pytorch-quantization/tests/fixtures/models.py:31
FCPlugin::mKernelWeights
nvinfer1::Weights mKernelWeights
Definition: samples/opensource/samplePlugin/fcPlugin.h:309
bert::fused_multihead_attention_v2_int8_384_64_kernel_cubin
unsigned char fused_multihead_attention_v2_int8_384_64_kernel_cubin[]
Definition: fused_multihead_attention_v2_int8_384_64_kernel.sm72.cpp:19
SampleConfig::getImageFileName
const char * getImageFileName() const
< set Image file name (PPM or ASCII)
Definition: sampleConfig.h:233
nvinfer1::TensorLocation::kDEVICE
@ kDEVICE
Data stored on device.
nvinfer1::plugin::Normalize::initialize
int initialize() override
Initialize the layer for execution.
Definition: normalizePlugin.cpp:95
nvinfer1::plugin::GroupNormalizationPlugin
Definition: groupNormalizationPlugin.h:39
bert::SkipLayerNormInterleavedPluginCreator::SkipLayerNormInterleavedPluginCreator
SkipLayerNormInterleavedPluginCreator()
Definition: skipLayerNormInt8InterleavedPlugin.cpp:269
tests.quant_modules_test.TestQuantModuleReplace
Definition: quant_modules_test.py:32
nvinfer1::plugin::MultilevelProposeROI::enqueue
int enqueue(int batch_size, const void *const *inputs, void **outputs, void *workspace, cudaStream_t stream) override
Definition: multilevelProposeROIPlugin.cpp:391
nmtSample::BLEUScoreWriter::~BLEUScoreWriter
~BLEUScoreWriter() override=default
pytorch_quantization.nn.modules.quant_conv.QuantConvTranspose2d
Definition: quant_conv.py:309
nvinfer1::LayerType::kITERATOR
@ kITERATOR
Loop Iterator layer.
nvinfer1::plugin::PriorBoxParameters::imgH
int32_t imgH
Definition: NvInferPluginUtils.h:98
nvinfer1::plugin::MultilevelCropAndResize::mInputHeight
int mInputHeight
Definition: multilevelCropAndResizePlugin.h:99
test_ir.TestGraph.test_fold_constants
def test_fold_constants(self)
Definition: test_ir.py:698
nvcaffeparser1::CaffeWeightFactory
Definition: caffeWeightFactory.h:30
half_float
Main namespace for half precision functionality.
Definition: ieee_half.h:262
polygraphy.comparator.data_loader.DataLoader.seed
seed
Definition: comparator/data_loader.py:86
SampleUffMNIST::processInput
bool processInput(const samplesCommon::BufferManager &buffers, const std::string &inputTensorName, int inputFileIdx) const
Reads the input and mean data, preprocesses, and stores the result in a managed buffer.
Definition: sampleUffMNIST.cpp:196
nvcaffeparser1::CaffeWeightFactory::getWeights
nvinfer1::Weights getWeights(const trtcaffe::BlobProto &blobMsg, const std::string &layerName)
Definition: caffeWeightFactory.cpp:395
pytorch_quantization.nn.modules.quant_instancenorm
Definition: quant_instancenorm.py:1
sample::InferenceOptions::inputs
std::unordered_map< std::string, std::string > inputs
Definition: sampleOptions.h:161
option::flag
int * flag
Definition: getopt.h:78
sample::ReportingOptions::exportProfile
std::string exportProfile
Definition: sampleOptions.h:179
nvinfer1::CUDADriverWrapper::~CUDADriverWrapper
~CUDADriverWrapper()
Definition: cudaDriverWrapper.cpp:66
nvinfer1::plugin::GenerateDetectionPluginCreator
Definition: generateDetectionPlugin.h:112
nvinfer1::plugin::CoordConvACPlugin::configurePlugin
void configurePlugin(const Dims *inputDims, int nbInputs, const Dims *outputDims, int nbOutputs, const DataType *inputTypes, const DataType *outputTypes, const bool *inputIsBroadcast, const bool *outputIsBroadcast, PluginFormat floatFormat, int maxBatchSize) override
Definition: coordConvACPlugin.cpp:106
polygraphy.backend.trt.register_callback
def register_callback()
Definition: tools/Polygraphy/polygraphy/backend/trt/__init__.py:7
test_ir.TestNodeIO.test_extend
def test_extend(self, field_names)
Definition: test_ir.py:237
nvinfer1::plugin::PriorBox::aspectRatios
Weights aspectRatios
Definition: priorBoxPlugin.h:94
nvinfer1::IPluginV2Ext::isOutputBroadcastAcrossBatch
virtual bool isOutputBroadcastAcrossBatch(int32_t outputIndex, const bool *inputIsBroadcasted, int32_t nbInputs) const =0
Return true if output tensor is broadcast across a batch.
tools.test_args.TestParseMeta.test_parse_shape_with_dim_param_including_x
def test_parse_shape_with_dim_param_including_x(self)
Definition: test_args.py:53
nvinfer1::ErrorCode::kFAILED_INITIALIZATION
@ kFAILED_INITIALIZATION
One, or more, of the components that TensorRT relies on did not initialize correctly.
nvinfer1::plugin::ReorgPluginCreator
Definition: reorgPlugin.h:92
nvinfer1::plugin::DetectionLayer::getPluginVersion
const char * getPluginVersion() const override
Return the plugin version.
Definition: detectionLayerPlugin.cpp:153
polygraphy.backend.trt.util.get_bindings_per_profile
def get_bindings_per_profile(engine)
Definition: trt/util.py:428
nvinfer1::EnumMax< ResizeMode >
constexpr int32_t EnumMax< ResizeMode >()
Maximum number of elements in ResizeMode enum.
Definition: NvInfer.h:4339
backend.onnxrt.test_runner.TestOnnxrtRunner.test_dim_param_converted_to_int_shape
def test_dim_param_converted_to_int_shape(self)
Definition: onnxrt/test_runner.py:51
test_ir.TestGraph.test_generate_name
def test_generate_name(self)
Definition: test_ir.py:451
SampleCharRNNLoop::LstmParams::maxSequenceSize
nvinfer1::ITensor * maxSequenceSize
Definition: sampleCharRNN.cpp:254
sample::anonymous_namespace{sampleOptions.cpp}::printShapes
void printShapes(std::ostream &os, const char *phase, const T &shapes)
Definition: sampleOptions.cpp:312
tests.quant_conv_transposed_test.TestQuantConvTranspose3D.test_fake_quant_per_channel_other_prec
def test_fake_quant_per_channel_other_prec(self)
Definition: quant_conv_transposed_test.py:259
polygraphy.logger.logger.LoggerIndent.logger
logger
Definition: Polygraphy/polygraphy/logger/logger.py:45
nvinfer1::plugin::ReorgPluginCreator::getFieldNames
const PluginFieldCollection * getFieldNames() override
Return a list of fields that needs to be passed to createPlugin.
Definition: reorgPlugin.cpp:200
backend.tf.test_loader.TestFrozenGraphLoader.test_load_graph
def test_load_graph(self)
Definition: tf/test_loader.py:35
sample::ReportingOptions::profile
bool profile
Definition: sampleOptions.h:176
ParserOnnxConfig::~ParserOnnxConfig
~ParserOnnxConfig()
Definition: parserOnnxConfig.h:66
nvinfer1::plugin::GenerateDetectionPluginCreator::mKeepTopK
int mKeepTopK
Definition: generateDetectionPlugin.h:132
nvinfer1::EnumMax< ActivationType >
constexpr int32_t EnumMax< ActivationType >()
Maximum number of elements in ActivationType enum.
Definition: NvInferRuntimeCommon.h:126
polygraphy.tools.args.tf.config.TfConfigArgs.add_to_script
def add_to_script(self, script)
Definition: tools/Polygraphy/polygraphy/tools/args/tf/config.py:35
polygraphy.tools.util.script.Script.invoke_if_nondefault
def invoke_if_nondefault(type_str, *args, **kwargs)
Definition: script.py:110
SampleGoogleNet::teardown
bool teardown()
Used to clean up any state created in the sample class.
Definition: sampleGoogleNet.cpp:198
nvinfer1::plugin::LReLU::getPluginType
const char * getPluginType() const override
Return the plugin type.
Definition: lReluPlugin.cpp:110
Logger::log
void log(Severity severity, const char *msg) override
Definition: demo/BERT/infer_c/logging.h:69
onnx_graphsurgeon.ir.tensor.Tensor.is_empty
def is_empty(self)
Definition: tensor.py:46
SampleUffFasterRcnnParams::repeat
int repeat
Definition: sampleUffFasterRCNN.cpp:82
nvcaffeparser1::IBlobNameToTensor
Object used to store and query Tensors after they have been extracted from a Caffe model using the IC...
Definition: NvCaffeParser.h:47
nvinfer1::plugin::ResizeNearestPluginCreator::~ResizeNearestPluginCreator
~ResizeNearestPluginCreator()
Definition: resizeNearestPlugin.h:101
nvinfer1::plugin::CoordConvACPlugin::getNbOutputs
int getNbOutputs() const override
Get the number of outputs from the layer.
Definition: coordConvACPlugin.cpp:59
UffPoolPluginV2Creator::getPluginNamespace
const char * getPluginNamespace() const override
Return the namespace of the plugin creator object.
Definition: sampleUffPluginV2Ext.cpp:683
FCPlugin::copyToDevice
void * copyToDevice(const void *data, size_t count)
Definition: samples/opensource/samplePlugin/fcPlugin.h:243
EntropyCalibratorImpl::getBatchSize
int getBatchSize() const
Definition: EntropyCalibrator.h:49
half_float::detail::bool_type
Helper for tag dispatching.
Definition: ieee_half.h:300
SampleOnnxMnistCoordConvAC::SampleOnnxMnistCoordConvAC
SampleOnnxMnistCoordConvAC(const samplesCommon::OnnxSampleParams &params)
Definition: sampleOnnxMnistCoordConvAC.cpp:58
SampleMovieLensParams::numMoviesPerUser
int32_t numMoviesPerUser
Definition: sampleMovieLens.cpp:67
SampleOnnxMnistCoordConvAC::verifyOutput
bool verifyOutput(const samplesCommon::BufferManager &buffers)
Classifies digits and verify result.
Definition: sampleOnnxMnistCoordConvAC.cpp:279
pytorch_quantization.nn.modules.quant_bert.QuantBertSelfAttention.matmul_k_input_quantizer
matmul_k_input_quantizer
Definition: quant_bert.py:74
comparator.test_struct.TestRunResults.test_getitem_out_of_bounds
def test_getitem_out_of_bounds(self, run_results)
Definition: comparator/test_struct.py:53
bert::EmbLayerNormVarSeqlenPlugin::serialize
void serialize(void *buffer) const override
Serialize the layer.
Definition: embLayerNormVarSeqlenPlugin.cpp:377
nmtSample::SLPProjection::mWeights
ComponentWeights::ptr mWeights
Definition: slpProjection.h:51
nvinfer1::LayerType::kCONSTANT
@ kCONSTANT
Constant layer.
nvinfer1::plugin::RPROIPlugin::canBroadcastInputAcrossBatch
bool canBroadcastInputAcrossBatch(int inputIndex) const override
Definition: nvFasterRCNNPlugin.cpp:268
nvinfer1::utility::matches
bool matches(const TRTOption &a, const TRTOption &b)
Matching for TRTOptions is defined as follows:
Definition: getOptions.cpp:47
sample::setReportableSeverity
void setReportableSeverity(Logger::Severity severity)
Definition: logger.cpp:29
test_struct.TestTensorMetadata
Definition: common/test_struct.py:20
BatchStream::getBatch
float * getBatch()
Definition: frcnnUtils.h:299
bert::Fused_multihead_attention_params_v2::use_int8_scale_max
bool use_int8_scale_max
Definition: fused_multihead_attention_v2.h:71
nvinfer1::plugin::PyramidROIAlignPluginCreator::PyramidROIAlignPluginCreator
PyramidROIAlignPluginCreator()
Definition: pyramidROIAlignPlugin.cpp:35
polygraphy.backend.tf.loader.GraphFromFrozen.__init__
def __init__(self, path)
Definition: backend/tf/loader.py:139
helpers.tokenization.BasicTokenizer.tokenize
def tokenize(self, text)
Definition: tokenization.py:226
SampleMNISTAPI::weightsMemory
std::vector< SampleUniquePtr< nvinfer1::IHostMemory > > weightsMemory
Host weights memory holder.
Definition: sampleMNISTAPI.cpp:93
nvinfer1::QuantizationFlag
QuantizationFlag
List of valid flags for quantizing the network to int8.
Definition: NvInfer.h:6354
text._symbols_to_sequence
def _symbols_to_sequence(symbols)
Definition: demo/Tacotron2/tacotron2/text/__init__.py:65
model.Decoder.p_decoder_dropout
p_decoder_dropout
Definition: demo/Tacotron2/tacotron2/model.py:251
ProposalWorkSpace::argMaxScoreDims
nvinfer1::DimsHW argMaxScoreDims
Definition: maskRCNNKernels.h:129
polygraphy.backend.trt_legacy.TrtLegacyRunner.load_engine
load_engine
Definition: backend/trt_legacy.py:219
onnx_graphsurgeon.util.misc.SynchronizedList.remove
def remove(self, x)
Definition: onnx-graphsurgeon/onnx_graphsurgeon/util/misc.py:112
pytorch_quantization.nn.modules.quant_pooling.QuantMaxPool2d.forward
def forward(self, input)
Definition: quant_pooling.py:58
RefineNMSParameters::iouThreshold
float iouThreshold
Definition: maskRCNNKernels.h:87
BatchStream::BatchStream
BatchStream(int batchSize, int maxBatches, std::string prefix, std::vector< std::string > directories)
Definition: BatchStream.h:186
backend.trt.test_calibrator.TestCalibrator.test_calibrator_iterable_data
def test_calibrator_iterable_data(self, identity_builder_network)
Definition: test_calibrator.py:39
model.Tacotron2.parse_batch
def parse_batch(self, batch)
Definition: demo/Tacotron2/tacotron2/model.py:621
data_function.MelAudioLoader.stft
stft
Definition: waveglow/data_function.py:33
stft.STFT.phase
phase
Definition: stft.py:140
sample::defaultMaxBatch
constexpr int defaultMaxBatch
Definition: sampleOptions.h:35
MaskRCNNConfig::IMAGE_SHAPE
static const nvinfer1::Dims3 IMAGE_SHAPE
Definition: plugin/proposalLayerPlugin/mrcnn_config.h:26
half_float::detail::enable< T, half, expr, half >::type
T type
Definition: ieee_half.h:381
half_float::detail::fpclassify
int fpclassify(half arg)
Classify floating point value.
Definition: ieee_half.h:2720
BatchStream::skip
void skip(int skipCount)
Definition: frcnnUtils.h:281
nvinfer1::plugin::Reorg::getWorkspaceSize
size_t getWorkspaceSize(int maxBatchSize) const override
Definition: reorgPlugin.cpp:100
pytorch_quantization.nn._functions.quant_rnn.variable_recurrent_factory
def variable_recurrent_factory(inner, reverse=False)
Definition: _functions/quant_rnn.py:149
SampleINT8API
The SampleINT8API class implements INT8 inference on classification networks.
Definition: sampleINT8API.cpp:76
BertInferenceRunner::run
py::array_t< float > run(py::array_t< int > inputIds, py::array_t< int > segmentIds, py::array_t< int > inputMask)
Definition: infer_c.cpp:36
sample::BuildOptions::safe
bool safe
Definition: sampleOptions.h:116
polygraphy.logger.logger.Logger.exit_on_errors
exit_on_errors
Definition: Polygraphy/polygraphy/logger/logger.py:134
SampleErrorRecorder
A simple imeplementation of the IErrorRecorder interface for use by samples.
Definition: ErrorRecorder.h:37
builder.load_onnx_weights_and_quant
def load_onnx_weights_and_quant(path, config)
Definition: builder.py:478
nvinfer1::ProfilingVerbosity::kNONE
@ kNONE
Turn off NVTX traces.
nvinfer1::plugin::ProposalLayer::canBroadcastInputAcrossBatch
bool canBroadcastInputAcrossBatch(int inputIndex) const override
Definition: proposalLayerPlugin.cpp:343
SampleConfig::isDebug
bool isDebug() const
Definition: sampleConfig.h:306
nvinfer1::IPluginRegistry
Single registration point for all plugins in an application. It is used to find plugin implementation...
Definition: NvInferRuntimeCommon.h:958
nvinfer1::plugin::GroupNormalizationPluginCreator::mNamespace
std::string mNamespace
Definition: groupNormalizationPlugin.h:143
nmtSample::BeamSearchPolicy::mLikelihoodCombinationOperator
LikelihoodCombinationOperator::ptr mLikelihoodCombinationOperator
Definition: beamSearchPolicy.h:67
half_float::detail::logb
half logb(half arg)
Extract exponent.
Definition: ieee_half.h:2676
samplesCommon::GenericBuffer::data
const void * data() const
Returns pointer to underlying array.
Definition: buffers.h:120
pytorch_quantization.nn.modules.quant_rnn.QuantLSTMCell.weight_hh
weight_hh
Definition: modules/quant_rnn.py:391
nvinfer1::plugin::ProposalLayer::getSerializationSize
size_t getSerializationSize() const override
Find the size of the serialization buffer required.
Definition: proposalLayerPlugin.cpp:192
samplesCommon::infer_object
std::shared_ptr< T > infer_object(T *obj)
Definition: samples/common/common.h:388
nvcaffeparser1::ICaffeParser::getErrorRecorder
virtual nvinfer1::IErrorRecorder * getErrorRecorder() const =0
get the ErrorRecorder assigned to this interface.
nvinfer1::plugin::MultilevelProposeROI::supportsFormat
bool supportsFormat(DataType type, PluginFormat format) const override
Check format support.
Definition: multilevelProposeROIPlugin.cpp:206
polygraphy.tools.surgeon.subtool.extract.Extract.add_parser_args
def add_parser_args(self, parser)
Definition: extract.py:32
nvinfer1::plugin::ProposalPlugin::mRpnStride
size_t mRpnStride
Definition: proposalPlugin.h:114
anonymous_namespace{coordConvACPlugin.cpp}::COORDCONV_AC_PLUGIN_NAME
const char * COORDCONV_AC_PLUGIN_NAME
Definition: coordConvACPlugin.cpp:28
pytorch_quantization.nn.modules.quant_pooling.QuantMaxPool3d
Definition: quant_pooling.py:62
pytorch_quantization.nn.modules.quant_rnn.QuantRNNBase.batch_first
batch_first
Definition: modules/quant_rnn.py:50
nvinfer1::plugin::ReorgPluginCreator::ReorgPluginCreator
ReorgPluginCreator()
Definition: reorgPlugin.cpp:182
nvinfer1::LayerType::kRECURRENCE
@ kRECURRENCE
Loop Recurrence layer.
polygraphy.tools.base.tool.Tool.makers
makers
Definition: tool.py:31
polygraphy.comparator.compare
Definition: compare.py:1
SampleConfig::setTextFileName
void setTextFileName(const char *textFilename)
Set the File Name of the Network Description as a Text File.
Definition: sampleConfig.h:141
bert::threadsPerCta128
constexpr size_t threadsPerCta128
Definition: embLayerNormVarSeqlenPlugin.cpp:32
nvinfer1::plugin::ProposalPluginCreator::getFieldNames
const PluginFieldCollection * getFieldNames() override
Return a list of fields that needs to be passed to createPlugin.
Definition: proposalPlugin.cpp:344
sample::anonymous_namespace{sampleInference.cpp}::Enqueue
Definition: sampleInference.cpp:175
nvinfer1::ErrorCode::kFAILED_COMPUTATION
@ kFAILED_COMPUTATION
An error occurred during execution that caused the data to become corrupted, but execution finished.
nvinfer1::plugin::InstanceNormalizationPluginCreator::mPluginAttributes
static std::vector< PluginField > mPluginAttributes
Definition: instanceNormalizationPlugin.h:121
nmtSample::Projection::ptr
std::shared_ptr< Projection > ptr
Definition: projection.h:35
IRNNv2Layer
An RNN layer in a network definition, version 2.
half_float::detail::sinh
expr sinh(half arg)
Hyperbolic sine.
Definition: ieee_half.h:2473
nvinfer1::plugin::ResizeNearest::ResizeNearest
ResizeNearest(float scale)
Definition: resizeNearestPlugin.cpp:82
nvinfer1::plugin::Normalize::getPluginVersion
const char * getPluginVersion() const override
Return the plugin version.
Definition: normalizePlugin.cpp:234
samplesCommon::InferDeleter::operator()
void operator()(T *obj) const
Definition: samples/common/common.h:378
nvinfer1::plugin::BatchedNMSPlugin::clone
IPluginV2Ext * clone() const override
Clone the plugin object.
Definition: batchedNMSPlugin.cpp:385
MultilevelProposeROIWorkSpace::totalSize
size_t totalSize
Definition: maskRCNNKernels.h:186
SampleMovieLensParams::topKMovies
int32_t topKMovies
Definition: sampleMovieLens.cpp:66
nvinfer1::IPaddingLayer::~IPaddingLayer
virtual ~IPaddingLayer()
Definition: NvInfer.h:3682
sample::TrtCudaGraph
Managed CUDA graph.
Definition: sampleDevice.h:165
nvinfer1::plugin::Normalize::channelShared
bool channelShared
Definition: normalizePlugin.h:99
nvinfer1::IPluginV2::getWorkspaceSize
virtual size_t getWorkspaceSize(int32_t maxBatchSize) const =0
Find the workspace size required by the layer.
nmtSample::BLEUScoreWriter::mMatchesByOrder
std::vector< size_t > mMatchesByOrder
Definition: bleuScoreWriter.h:58
comparator.test_comparator.TestComparator.test_errors_do_not_hang
def test_errors_do_not_hang(self)
Definition: test_comparator.py:98
pytorch_quantization.tensor_quant.ScaledQuantDescriptor._calib_method
_calib_method
Definition: tensor_quant.py:103
sample::saveEngine
bool saveEngine(const ICudaEngine &engine, const std::string &fileName, std::ostream &err)
Save an engine into a file.
Definition: sampleEngines.cpp:662
pytorch_quantization.nn._functions.quant_rnn.RNNTanhCell
def RNNTanhCell(input, hidden, w_ih, w_hh, b_ih=None, b_hh=None)
Definition: _functions/quant_rnn.py:41
nvinfer1::IPluginV2::supportsFormat
virtual bool supportsFormat(DataType type, PluginFormat format) const =0
Check format support.
pytorch_quantization.nn.modules.quant_conv.QuantConv3d
Definition: quant_conv.py:134
polygraphy.comparator.data_loader.DataLoader.__getitem__
def __getitem__(self, index)
Definition: comparator/data_loader.py:98
SampleErrorRecorder::getNbErrors
int32_t getNbErrors() const noexcept final
Return the number of errors.
Definition: ErrorRecorder.h:46
pytorch_quantization.calib.max.MaxCalibrator._amaxs
_amaxs
Definition: max.py:41
pytorch_quantization.nn.modules.quant_pooling.QuantAvgPool1d.__init__
def __init__(self, kernel_size, stride=None, padding=0, ceil_mode=False, count_include_pad=True, **kwargs)
Definition: quant_pooling.py:78
backend.onnx.test_loader.TestExportOnnxFromTf
Definition: onnx/test_loader.py:41
helpers.data_processing._get_best_indexes
def _get_best_indexes(logits, n_best_size)
Definition: data_processing.py:211
SampleCharRNNLoop::SampleCharRNNLoop
SampleCharRNNLoop(SampleCharRNNParams params)
Definition: sampleCharRNN.cpp:257
model.Decoder
Definition: demo/Tacotron2/tacotron2/model.py:239
nvinfer1::plugin::FlattenConcat
Definition: flattenConcat.h:33
data_function.TextMelLoader.get_mel
def get_mel(self, filename)
Definition: tacotron2/data_function.py:53
nvinfer1::plugin::GroupNormalizationPluginCreator::GroupNormalizationPluginCreator
GroupNormalizationPluginCreator()
Definition: groupNormalizationPlugin.cpp:251
polygraphy.util.cuda.DeviceBuffer
Definition: cuda.py:113
bert::Fused_multihead_attention_params::qkv_ptr
void * qkv_ptr
Definition: fused_multihead_attention.h:51
sample::TrtCudaGraph::endCapture
void endCapture(TrtCudaStream &stream)
Definition: sampleDevice.h:197
nvinfer1::plugin::NormalizePluginCreator::deserializePlugin
IPluginV2Ext * deserializePlugin(const char *name, const void *serialData, size_t serialLength) override
Called during deserialization of plugin layer.
Definition: normalizePlugin.cpp:329
tests.fixtures.models.QuantLeNet.conv1
conv1
Definition: tools/pytorch-quantization/tests/fixtures/models.py:47
tests.calibrator_test.TestMSECalibrator.test_one_tensor
def test_one_tensor(self, verbose)
Definition: calibrator_test.py:198
nvinfer1::plugin::BatchedNMSDynamicPlugin::getPluginNamespace
const char * getPluginNamespace() const override
Return the namespace of the plugin object.
Definition: batchedNMSPlugin.cpp:424
nvinfer1::plugin::SplitPluginCreator::mNamespace
std::string mNamespace
Definition: split.h:180
common.version
def version(version_str)
Definition: common.py:19
nvinfer1::plugin::BatchedNMSPlugin::serialize
void serialize(void *buffer) const override
Serialize the layer.
Definition: batchedNMSPlugin.cpp:254
pytorch_quantization.tensor_quant.ScaledQuantDescriptor._unsigned
_unsigned
Definition: tensor_quant.py:104
nvinfer1::plugin::GridAnchorGenerator::setPluginNamespace
void setPluginNamespace(const char *pluginNamespace) override
Set the namespace that this plugin object belongs to.
Definition: gridAnchorPlugin.cpp:290
samplesCommon::TypedHostMemory
Definition: samples/common/common.h:335
nmtSample::BenchmarkWriter::mInputTokenCount
int mInputTokenCount
Definition: benchmarkWriter.h:49
nmtSample::BeamSearchPolicy::Ray::vocabularyId
int vocabularyId
Definition: beamSearchPolicy.h:58
nvinfer1::plugin::GridAnchorGenerator::supportsFormat
bool supportsFormat(DataType type, PluginFormat format) const override
Check format support.
Definition: gridAnchorPlugin.cpp:274
samplesCommon::ManagedBuffer::deviceBuffer
DeviceBuffer deviceBuffer
Definition: buffers.h:225
tests.quant_pooling_test.TestQuantAvgPool1d.test_input_fake_quant
def test_input_fake_quant(self)
Definition: quant_pooling_test.py:138
nmtSample::BeamSearchPolicy::mValidSamples
std::vector< bool > mValidSamples
Definition: beamSearchPolicy.h:69
BatchStream::next
bool next() override
Definition: BatchStream.h:217
backend.tf.test_runner.TestTfRunner.test_can_name_runner
def test_can_name_runner(self)
Definition: tf/test_runner.py:27
nvinfer1::plugin::InstanceNormalizationPlugin::getPluginType
const char * getPluginType() const override
Return the plugin type.
Definition: instanceNormalizationPlugin.cpp:239
anonymous_namespace{pluginLogging.h}::LOG_INFO
LogStreamConsumer LOG_INFO(const Logger &logger)
produces a LogStreamConsumer object that can be used to log messages of severity kINFO
Definition: pluginLogging.h:485
bert::QKVToContextInterleavedPluginCreator::setPluginNamespace
void setPluginNamespace(const char *pluginNamespace) override
Set the namespace of the plugin creator based on the plugin library it belongs to.
Definition: qkvToContextInt8InterleavedPlugin.cpp:353
FCPlugin::enqueue
virtual int enqueue(int batchSize, const void *const *inputs, void **outputs, void *workspace, cudaStream_t stream) override
Definition: samples/opensource/samplePlugin/fcPlugin.h:169
sample::Profiler::reportLayerTime
void reportLayerTime(const char *layerName, float timeMs) override
Layer time reporting callback.
Definition: sampleReporting.cpp:243
nvinfer1::plugin::ProposalLayer::generate_pyramid_anchors
void generate_pyramid_anchors(const nvinfer1::Dims &image_size)
Definition: proposalLayerPlugin.cpp:272
nmtSample::SequenceProperties
provides encoder/decoder relevant properties of sequences
Definition: sequenceProperties.h:29
EntropyCalibratorImpl::mCalibrationTableName
std::string mCalibrationTableName
Definition: EntropyCalibrator.h:89
parserutils::combineIndexDimensions
int combineIndexDimensions(int batchSize, const nvinfer1::Dims &d)
Definition: parserUtils.h:126
nvinfer1::plugin::InstanceNormalizationPluginCreator::getPluginVersion
const char * getPluginVersion() const override
Return the plugin version.
Definition: instanceNormalizationPlugin.cpp:340
nvinfer1::Dims3
Descriptor for three-dimensional data.
Definition: NvInfer.h:164
test_examples.test_api_examples
def test_api_examples(example)
Definition: Polygraphy/tests/test_examples.py:101
BboxInfo::label
int label
Definition: bboxUtils.h:43
UffPoolPluginV2::destroy
void destroy() override
Destroy the plugin object.
Definition: sampleUffPluginV2Ext.cpp:547
half_float::detail::half_cast
T half_cast(U arg)
Cast to or from half-precision floating point number.
Definition: ieee_half.h:2848
polygraphy.tools.surgeon.subtool.base.BaseSurgeonSubtool.__init__
def __init__(self, name, inputs=None, data=False, shape_inference_default=None)
Definition: surgeon/subtool/base.py:28
nvinfer1::IPluginV2DynamicExt::kFORMAT_COMBINATION_LIMIT
static constexpr int32_t kFORMAT_COMBINATION_LIMIT
Limit on number of format combinations accepted.
Definition: NvInferRuntime.h:453
half_float::detail::conditional::type
T type
Definition: samples/common/half.h:324
nvinfer1::ILogger::Severity::kINTERNAL_ERROR
@ kINTERNAL_ERROR
Internal error has occurred. Execution is unrecoverable.
nvinfer1::plugin::MultilevelProposeROIPluginCreator::getPluginName
const char * getPluginName() const override
Return the plugin name.
Definition: multilevelProposeROIPlugin.cpp:54
nvinfer1::OptProfileSelector
OptProfileSelector
When setting or querying optimization profile parameters (such as shape tensor inputs or dynamic dime...
Definition: NvInferRuntime.h:998
pytorch_quantization.nn.modules.quant_rnn.QuantRNNBase.input_size
input_size
Definition: modules/quant_rnn.py:46
export_tacotron2_onnx.DecoderIter.gate_layer
gate_layer
Definition: export_tacotron2_onnx.py:129
onnx_graphsurgeon.ir.node.Node.__str__
def __str__(self)
Definition: node.py:105
RefineDetectionWorkSpace::sortClassScoreDims
nvinfer1::DimsHW sortClassScoreDims
Definition: maskRCNNKernels.h:100
nvinfer1::plugin::InstanceNormalizationPlugin::supportsFormatCombination
bool supportsFormatCombination(int pos, const nvinfer1::PluginTensorDesc *inOut, int nbInputs, int nbOutputs) override
Definition: instanceNormalizationPlugin.cpp:231
pytorch_quantization.nn.modules.quant_rnn.QuantRNNCell.__init__
def __init__(self, input_size, hidden_size, bias=True, nonlinearity="tanh")
Definition: modules/quant_rnn.py:340
polygraphy.tools.args.tf.config.TfConfigArgs.gpu_memory_fraction
gpu_memory_fraction
Definition: tools/Polygraphy/polygraphy/tools/args/tf/config.py:30
sample::TrtCudaEvent::mEvent
cudaEvent_t mEvent
Definition: sampleDevice.h:153
nvinfer1::ITensor
A tensor in a network definition.
Definition: NvInfer.h:502
nvinfer1::plugin::SpecialSlicePluginCreator::deserializePlugin
IPluginV2Ext * deserializePlugin(const char *name, const void *data, size_t length) override
Called during deserialization of plugin layer.
Definition: specialSlicePlugin.cpp:62
SampleConfig::getFailurePercentage
float getFailurePercentage() const
Definition: sampleConfig.h:286
SampleCharRNNLoop::LstmParams::recurrentWeights
nvinfer1::ITensor * recurrentWeights
Definition: sampleCharRNN.cpp:251
pytorch_quantization.quant_modules.QuantModuleReplacementHelper.default_quant_map
default_quant_map
Definition: quant_modules.py:59
test_ir.TestNodeIO.test_del_index
def test_del_index(self, field_names)
Definition: test_ir.py:277
Batch::mContext
SampleUniquePtr< nvinfer1::IExecutionContext > mContext
Definition: sampleMovieLensMPS.cpp:246
nvinfer1::WeightsRole
WeightsRole
How a layer uses particular Weights.
Definition: NvInferRuntime.h:685
SampleCharRNNMaps
Definition: sampleCharRNN.cpp:96
nvinfer1::plugin::ProposalLayer::mPreNMSTopK
int mPreNMSTopK
Definition: proposalLayerPlugin.h:98
nvinfer1::plugin::CropAndResizePlugin::mLayerName
const std::string mLayerName
Definition: cropAndResizePlugin.h:100
SampleUffFasterRcnnParams::saveEngine
std::string saveEngine
Definition: sampleUffFasterRCNN.cpp:85
sample::anonymous_namespace{sampleInference.cpp}::StreamType
StreamType
Definition: sampleInference.cpp:252
polygraphy.tools.args.onnx.loader.OnnxLoaderArgs
Definition: tools/args/onnx/loader.py:22
nvinfer1::plugin::Region::getNbOutputs
int getNbOutputs() const override
Get the number of outputs from the layer.
Definition: regionPlugin.cpp:234
nvinfer1::plugin::BatchedNMSPlugin::BatchedNMSPlugin
BatchedNMSPlugin(NMSParameters param)
Definition: batchedNMSPlugin.cpp:41
polygraphy.tools.args.base.BaseArgs.parse
def parse(self, args)
Definition: args/base.py:32
nvinfer1::plugin::Region::configurePlugin
void configurePlugin(const Dims *inputDims, int nbInputs, const Dims *outputDims, int nbOutputs, const DataType *inputTypes, const DataType *outputTypes, const bool *inputIsBroadcast, const bool *outputIsBroadcast, PluginFormat floatFormat, int maxBatchSize) override
Definition: regionPlugin.cpp:444
nvinfer1::IRuntime::getDLACore
virtual int32_t getDLACore() const noexcept=0
Get the DLA core that the engine executes on.
text.text_to_sequence
def text_to_sequence(text, cleaner_names)
Definition: demo/Tacotron2/tacotron2/text/__init__.py:15
polygraphy.tools.precision.precision.STCheckerBase.precision
precision
Definition: precision.py:60
nvinfer1::plugin::FlattenConcat::attachToContext
void attachToContext(cudnnContext *cudnnContext, cublasContext *cublasContext, IGpuAllocator *gpuAllocator) override
Attach the plugin object to an execution context and grant the plugin the access to some context reso...
Definition: flattenConcat.cpp:188
tests.fixtures.models.LeNet.conv2
conv2
Definition: tools/pytorch-quantization/tests/fixtures/models.py:30
nvinfer1::plugin::GroupNormalizationPlugin::setPluginNamespace
void setPluginNamespace(const char *pluginNamespace) override
Set the namespace that this plugin object belongs to.
Definition: groupNormalizationPlugin.cpp:241
nvinfer1::EnumMax< OptProfileSelector >
constexpr int32_t EnumMax< OptProfileSelector >()
Definition: NvInferRuntime.h:1007
nvinfer1::ILayer
Base class for all layer classes in a network definition.
Definition: NvInfer.h:773
inference_trt.infer_waveglow_trt
def infer_waveglow_trt(waveglow, waveglow_context, mel, measurements, fp16)
Definition: inference_trt.py:258
sample::Logger::reportFail
static int reportFail(const TestAtom &testAtom)
Definition: samples/common/logging.h:353
nvinfer1::plugin::BatchTilePlugin::getOutputDimensions
Dims getOutputDimensions(int index, const Dims *inputs, int nbInputDims) override
Definition: batchTilePlugin.cpp:73
SampleUffSSD::SampleUniquePtr
std::unique_ptr< T, samplesCommon::InferDeleter > SampleUniquePtr
Definition: sampleUffSSD.cpp:66
tests.quant_pooling_test.TestQuantMaxPool1d.test_input_fake_quant
def test_input_fake_quant(self)
Definition: quant_pooling_test.py:48
Int8EntropyCalibrator2::getBatch
bool getBatch(void *bindings[], const char *names[], int nbBindings) override
Definition: EntropyCalibrator.h:116
SampleMovieLensArgs::help
bool help
Definition: sampleMovieLens.cpp:524
nvinfer1::IDimensionExpr
Definition: NvInferRuntime.h:325
nvinfer1::ActivationType::kSIGMOID
@ kSIGMOID
Sigmoid activation.
onnx_graphsurgeon.logger.logger.LoggerIndent.old_indent
old_indent
Definition: onnx-graphsurgeon/onnx_graphsurgeon/logger/logger.py:29
polygraphy.backend.trt.loader.CreateConfig.__init__
def __init__(self, max_workspace_size=None, tf32=None, fp16=None, int8=None, profiles=None, calibrator=None, strict_types=None)
Definition: backend/trt/loader.py:300
nvinfer1::plugin::CropAndResizePlugin::destroy
void destroy() override
Destroy the plugin object.
Definition: cropAndResizePlugin.cpp:175
SampleBuffer::getBufferSize
int getBufferSize()
Definition: sampleReformatFreeIO.cpp:193
nvuffparser::UffInputOrder
UffInputOrder
The different possible supported input order.
Definition: NvUffParser.h:45
SampleFasterRCNN::kIMG_CHANNELS
static const int kIMG_CHANNELS
Definition: sampleFasterRCNN.cpp:87
model.Decoder.infer
def infer(self, memory, memory_lengths)
Definition: demo/Tacotron2/tacotron2/model.py:504
MaskRCNNConfig::BBOX_STD_DEV
static const float BBOX_STD_DEV[]
Definition: plugin/proposalLayerPlugin/mrcnn_config.h:37
half_float::detail::functions::log
static expr log(float arg)
Logarithm implementation.
Definition: samples/common/half.h:1799
nvinfer1::IRuntime::setErrorRecorder
virtual void setErrorRecorder(IErrorRecorder *recorder) noexcept=0
Set the ErrorRecorder for this interface.
nvinfer1::IPluginV2Ext
Plugin class for user-implemented layers.
Definition: NvInferRuntimeCommon.h:581
pytorch_quantization.nn.modules._utils.QuantMixin.default_quant_desc_input
default_quant_desc_input
Definition: _utils.py:33
comparator.test_data_loader.TestDataLoader
Definition: test_data_loader.py:28
nvinfer1::plugin::CoordConvACPlugin::getSerializationSize
size_t getSerializationSize() const override
Find the size of the serialization buffer required.
Definition: coordConvACPlugin.cpp:88
sample::ReportingOptions::verbose
bool verbose
Definition: sampleOptions.h:171
nvinfer1::plugin::MultilevelCropAndResizePluginCreator::getPluginVersion
const char * getPluginVersion() const override
Return the plugin version.
Definition: multilevelCropAndResizePlugin.cpp:52
MultilevelProposeROIWorkSpace::sortClassSampleIdxDims
nvinfer1::DimsHW sortClassSampleIdxDims
Definition: maskRCNNKernels.h:169
nvinfer1::plugin::CropAndResizePlugin::terminate
void terminate() override
Release resources acquired during plugin layer initialization.
Definition: cropAndResizePlugin.cpp:173
polygraphy.tools.args.model.ModelArgs._model_type
_model_type
Definition: tools/Polygraphy/polygraphy/tools/args/model.py:30
SampleMovieLens::verifyOutput
bool verifyOutput(uint32_t *userInputPtr, uint32_t *, uint32_t *topKItemNumberPtr, float *topKItemProbPtr)
Verifies the inference output with ground truth and logs the results.
Definition: sampleMovieLens.cpp:476
nvinfer1::IBuilder::createBuilderConfig
virtual nvinfer1::IBuilderConfig * createBuilderConfig()=0
Create a builder configuration object.
evaluate-v1.metric_max_over_ground_truths
def metric_max_over_ground_truths(metric_fn, prediction, ground_truths)
Definition: evaluate-v1.1.py:64
SampleSSD::constructNetwork
bool constructNetwork(SampleUniquePtr< nvinfer1::IBuilder > &builder, SampleUniquePtr< nvinfer1::INetworkDefinition > &network, SampleUniquePtr< nvinfer1::IBuilderConfig > &config, SampleUniquePtr< nvcaffeparser1::ICaffeParser > &parser)
Parses a Caffe model for SSD and creates a TensorRT network.
Definition: sampleSSD.cpp:171
bert::fused_multihead_attention_v2_int8_192_64_kernel_sm75_cubin
unsigned char fused_multihead_attention_v2_int8_192_64_kernel_sm75_cubin[]
Definition: fused_multihead_attention_v2_int8_192_64_kernel.sm75.cpp:19
BufferDesc::BufferDesc
BufferDesc()=default
nvinfer1::PaddingMode::kCAFFE_ROUND_UP
@ kCAFFE_ROUND_UP
Use CAFFE padding, rounding output size up, uses prePadding value.
half_float::detail::tan
expr tan(half arg)
Tangent function.
Definition: ieee_half.h:2431
onnx_graphsurgeon.ir.node.Node.inputs
inputs
Definition: node.py:40
nvinfer1::plugin::BatchedNMSPlugin::terminate
void terminate() override
Release resources acquired during plugin layer initialization.
Definition: batchedNMSPlugin.cpp:95
layers.TacotronSTFT.sampling_rate
sampling_rate
Definition: layers.py:61
bert::SkipLayerNormInterleavedPlugin::terminate
void terminate() override
Release resources acquired during plugin layer initialization.
Definition: skipLayerNormInt8InterleavedPlugin.cpp:229
sample::anonymous_namespace{sampleOptions.cpp}::splitNameAndValue
std::pair< std::string, T > splitNameAndValue(const std::string &s)
Definition: sampleOptions.cpp:145
polygraphy.backend.trt_legacy.ParseNetworkFromOnnxLegacy.__call__
def __call__(self)
Definition: backend/trt_legacy.py:130
bert::fused_multihead_attention_v2_int8_128_64_kernel_cubin_len
unsigned int fused_multihead_attention_v2_int8_128_64_kernel_cubin_len
Definition: fused_multihead_attention_v2_int8_128_64_kernel.sm72.cpp:20170
nvinfer1::plugin::PriorBoxParameters::stepH
float stepH
Definition: NvInferPluginUtils.h:99
inference.prepare_input_sequence
def prepare_input_sequence(texts, cpu_run=False)
Definition: Tacotron2/inference.py:146
util.test_misc.test_is_valid_shape_override
def test_is_valid_shape_override(case)
Definition: test_misc.py:57
nvinfer1::IShapeLayer::~IShapeLayer
virtual ~IShapeLayer()
Definition: NvInfer.h:4047
sample::dumpRefittable
void dumpRefittable(nvinfer1::ICudaEngine &engine)
Log refittable layers and weights of a refittable engine.
Definition: sampleEngines.cpp:617
polygraphy.tools.args.trt.loader.TrtLoaderArgs.add_trt_serialized_engine_loader
def add_trt_serialized_engine_loader(self, script)
Definition: tools/args/trt/loader.py:194
pytorch_quantization.nn.modules.tensor_quantizer.TensorQuantizer.clip
clip
Definition: tensor_quantizer.py:94
nvinfer1::EnumMax< RNNDirection >
constexpr int32_t EnumMax< RNNDirection >()
Maximum number of elements in RNNDirection enum.
Definition: NvInfer.h:2821
polygraphy.logger.logger.Logger.verbosity
def verbosity(self, severity=CRITICAL)
Definition: Polygraphy/polygraphy/logger/logger.py:169
nvinfer1::plugin::NMSPluginCreator::mFC
static PluginFieldCollection mFC
Definition: nmsPlugin.h:112
PoolParameters::mW
int mW
Definition: sampleUffPluginV2Ext.cpp:175
half_float::detail::functions::lgamma
static expr lgamma(float arg)
Gamma logarithm implementation.
Definition: samples/common/half.h:2030
builder.generate_calibration_cache
def generate_calibration_cache(sequence_lengths, workspace_size, config, weights_dict, squad_json, vocab_file, calibrationCacheFile, calib_num)
Definition: builder.py:623
SampleErrorRecorder::mStackLock
std::mutex mStackLock
Definition: ErrorRecorder.h:125
samplesCommon::OnnxSampleParams::onnxFileName
std::string onnxFileName
Filename of ONNX file of a network.
Definition: argsParser.h:64
tests.quant_rnn_test.TestQuantLSTMCell.test_basic_forward
def test_basic_forward(self, verbose)
Definition: quant_rnn_test.py:66
tests.quant_conv_transposed_test.TestQuantConvTranspose2D.test_weight_fake_quant_per_tensor
def test_weight_fake_quant_per_tensor(self)
Definition: quant_conv_transposed_test.py:69
nvinfer1::plugin::PyramidROIAlignPluginCreator::deserializePlugin
IPluginV2Ext * deserializePlugin(const char *name, const void *data, size_t length) override
Called during deserialization of plugin layer.
Definition: pyramidROIAlignPlugin.cpp:73
MaskRCNNConfig::BACKBONE_STRIDES
static const std::vector< float > BACKBONE_STRIDES
Definition: plugin/proposalLayerPlugin/mrcnn_config.h:51
Semaphore::open
void open()
Definition: sampleMovieLensMPS.cpp:104
nvinfer1::plugin::PluginCreatorRegistry::PluginCreatorRegistry
PluginCreatorRegistry()
Definition: InferPlugin.cpp:140
half_float::detail::functions::fma
static expr fma(float x, float y, float z)
Fused multiply-add implementation.
Definition: samples/common/half.h:1748
polygraphy.backend.tf.util.find_nodes_by_ops
def find_nodes_by_ops(graphdef, ops)
Definition: tf/util.py:51
pytorch_quantization.nn.modules.quant_pooling.QuantAdaptiveAvgPool2d.__init__
def __init__(self, output_size, **kwargs)
Definition: quant_pooling.py:131
nvinfer1::IRefitter::setErrorRecorder
virtual void setErrorRecorder(IErrorRecorder *recorder)=0
Set the ErrorRecorder for this interface.
samplesCommon::PPM::buffer
uint8_t buffer[C *H *W]
Definition: samples/common/common.h:656
Args::enableGraph
bool enableGraph
Definition: demo/BERT/infer_c/common.h:35
samplesCommon::GenericBuffer::operator=
GenericBuffer & operator=(GenericBuffer &&buf)
Definition: buffers.h:92
SampleUffSSDParams::labelsFileName
std::string labelsFileName
The file namefo the class labels.
Definition: sampleUffSSD.cpp:51
nvinfer1::plugin::LReLU::terminate
void terminate() override
Release resources acquired during plugin layer initialization.
Definition: lReluPlugin.cpp:103
main
Definition: main.py:1
polygraphy.tools.util.script.Script.from_imports
from_imports
Definition: script.py:136
inference_trt.init_decoder_inputs
def init_decoder_inputs(memory, processed_memory, memory_lengths)
Definition: inference_trt.py:68
nvinfer1::plugin::ProposalPlugin::getWorkspaceSize
size_t getWorkspaceSize(int) const override
Definition: proposalPlugin.cpp:183
PluginFactory::createPlugin
virtual IPlugin * createPlugin(const char *layerName, const nvinfer1::Weights *weights, int nbWeights) override
Definition: samples/opensource/samplePlugin/fcPlugin.h:335
model.Encoder.convolutions
convolutions
Definition: demo/Tacotron2/tacotron2/model.py:192
nvinfer1::IProfiler
Application-implemented interface for profiling.
Definition: NvInferRuntime.h:665
option
Definition: getopt.h:74
SampleConfig::mMaxBatchSize
int64_t mMaxBatchSize
Definition: sampleConfig.h:45
nvinfer1::plugin::PriorBoxParameters::numAspectRatios
int32_t numAspectRatios
Definition: NvInferPluginUtils.h:94
MNISTBatchStream::getDims
nvinfer1::Dims getDims() const override
Definition: BatchStream.h:93
polygraphy.tools.args.trt.runner.TrtRunnerArgs.parse
def parse(self, args)
Definition: tools/args/trt/runner.py:47
IRNNLayer
A RNN layer in a network definition.
polygraphy.tools.inspect.subtool.model.Model.add_parser_args
def add_parser_args(self, parser)
Definition: tools/Polygraphy/polygraphy/tools/inspect/subtool/model.py:35
nvinfer1::plugin::Reorg::Reorg
Reorg(int stride)
Definition: reorgPlugin.cpp:36
nvinfer1::plugin::ProposalLayer
Definition: proposalLayerPlugin.h:35
SampleOnnxMnistCoordConvAC::SampleUniquePtr
std::unique_ptr< T, samplesCommon::InferDeleter > SampleUniquePtr
Definition: sampleOnnxMnistCoordConvAC.cpp:55
nvinfer1::DimsHW::h
int32_t & h()
Get the height.
Definition: NvInfer.h:124
nvinfer1::plugin::CoordConvACPlugin
Definition: coordConvACPlugin.h:32
perf_varseqlen.DeviceBuffer
Definition: perf_varseqlen.py:28
bert::Fused_multihead_attention_params_v2::cu_seqlens
int * cu_seqlens
Definition: fused_multihead_attention_v2.h:65
sample::BuildOptions::enabledTactics
nvinfer1::TacticSources enabledTactics
Definition: sampleOptions.h:127
Logger::Logger
Logger(Severity severity=Severity::kWARNING)
Definition: pluginLogging.h:230
nvinfer1::plugin::NormalizePluginCreator::getFieldNames
const PluginFieldCollection * getFieldNames() override
Return a list of fields that needs to be passed to createPlugin.
Definition: normalizePlugin.cpp:277
nvinfer1::IPlugin::initialize
virtual int32_t initialize()=0
Initialize the layer for execution.
tools.test_inspect.TestInspectData.test_results
def test_results(self, opts)
Definition: test_inspect.py:293
tests.quant_instancenorm_test.TestQuantInstanceNorm1D.test_fake_quant_per_tensor
def test_fake_quant_per_tensor(self)
Definition: quant_instancenorm_test.py:63
nvinfer1::ILoopOutputLayer
An ILoopOutputLayer is the sole way to get output from a loop.
Definition: NvInfer.h:4571
polygraphy.backend.trt.runner.TrtRunner
Definition: backend/trt/runner.py:30
nvinfer1::plugin::RegionPluginCreator::mPluginAttributes
static std::vector< PluginField > mPluginAttributes
Definition: regionPlugin.h:122
sample::Logger::TestAtom
Opaque handle that holds logging information for a particular test.
Definition: samples/common/logging.h:266
nmtSample::SLPAttention::getInfo
std::string getInfo() override
get the textual description of the component
Definition: slpAttention.cpp:73
Args::pargsVec
std::vector< OutputArgs > pargsVec
Definition: sampleMovieLensMPS.cpp:185
nvinfer1::plugin::ProposalLayer::mBackgroundLabel
int mBackgroundLabel
Definition: proposalLayerPlugin.h:97
nvinfer1::plugin::PriorBox::getPluginVersion
const char * getPluginVersion() const override
Return the plugin version.
Definition: priorBoxPlugin.cpp:239
nmtSample::BeamSearchPolicy::mBeamWidth
int mBeamWidth
Definition: beamSearchPolicy.h:68
SampleBuffer::buffer
uint8_t * buffer
Definition: sampleReformatFreeIO.cpp:191
nvinfer1::IInt8MinMaxCalibrator
MinMax Calibrator.
Definition: NvInfer.h:6093
BertInference::mOutputSize
size_t mOutputSize
Definition: bert_infer.h:350
tests.quant_pooling_test.TestQuantAdaptiveAvgPool2d.test_input_fake_quant
def test_input_fake_quant(self)
Definition: quant_pooling_test.py:249
nvinfer1::plugin::ProposalLayer::mType
nvinfer1::DataType mType
Definition: proposalLayerPlugin.h:110
SampleCharRNNv2::addLSTMLayers
nvinfer1::ILayer * addLSTMLayers(SampleCharRNNBase::SampleUniquePtr< nvinfer1::INetworkDefinition > &network) final
Add inputs to the TensorRT network and configure LSTM layers using network definition API.
Definition: sampleCharRNN.cpp:662
bert::xmmasM128
constexpr size_t xmmasM128
Definition: embLayerNormVarSeqlenPlugin.cpp:37
SampleConfig::mOutputFilename
std::string mOutputFilename
Definition: sampleConfig.h:42
SamplePlugin::runtimePluginFactory
PluginFactory runtimePluginFactory
Definition: samplePlugin.cpp:116
nvinfer1::plugin::GridAnchorGenerator::copyToDevice
Weights copyToDevice(const void *hostData, size_t count)
Definition: gridAnchorPlugin.cpp:254
nvinfer1::CalibrationAlgoType::kENTROPY_CALIBRATION
@ kENTROPY_CALIBRATION
pytorch_quantization.nn.modules._utils.QuantMixin._weight_quantizers
_weight_quantizers
Definition: _utils.py:84
model.fused_add_tanh_sigmoid_multiply
def fused_add_tanh_sigmoid_multiply(input_a, input_b, n_channels)
Definition: demo/Tacotron2/waveglow/model.py:23
bert::EmbLayerNormVarSeqlenPluginCreator::deserializePlugin
nvinfer1::IPluginV2 * deserializePlugin(const char *name, const void *serialData, size_t serialLength) override
Called during deserialization of plugin layer.
Definition: embLayerNormVarSeqlenPlugin.cpp:506
tests.quant_conv_test.TestQuantConv3D.test_fake_quant_quant_per_channel_other_prec
def test_fake_quant_quant_per_channel_other_prec(self)
Definition: quant_conv_test.py:482
nvinfer1::plugin::ProposalPlugin::mLayerName
const std::string mLayerName
Definition: proposalPlugin.h:107
nmtSample::LimitedSamplesDataReader::reset
void reset() override
Reset the reader position, the data reader is ready to read the data from th ebeginning again after t...
Definition: limitedSamplesDataReader.cpp:41
tests.quant_conv_test.TestQuantConv3D
Definition: quant_conv_test.py:460
EntropyCalibratorImpl::mStream
TBatchStream mStream
Definition: EntropyCalibrator.h:87
BertInference
Definition: bert_infer.h:33
half_float::detail::enable< T, half, half, expr >::type
T type
Definition: ieee_half.h:380
tests.quant_conv_test.TestQuantConv1D.test_weight_fake_quant_per_tensor
def test_weight_fake_quant_per_tensor(self)
Definition: quant_conv_test.py:288
nvinfer1::INetworkDefinition::addConvolutionNd
virtual IConvolutionLayer * addConvolutionNd(ITensor &input, int32_t nbOutputMaps, Dims kernelSize, Weights kernelWeights, Weights biasWeights)=0
Add a multi-dimension convolution layer to the network.
SampleUffFasterRcnnParams::nbCalBatches
int nbCalBatches
Definition: sampleUffFasterRCNN.cpp:80
tools.test_run.TestOther.test_save_load_outputs
def test_save_load_outputs(self, tmp_path)
Definition: test_run.py:243
nvonnxparser::IParser
an object for parsing ONNX models into a TensorRT network definition
Definition: NvOnnxParser.h:119
sample::InferenceTime
Measurement times in milliseconds.
Definition: sampleReporting.h:34
nvinfer1::TopKOperation::kMAX
@ kMAX
Maximum of the elements.
TLTMaskRCNNConfig::RPN_NMS_THRESHOLD
static const float RPN_NMS_THRESHOLD
Definition: tlt_mrcnn_config.h:80
SampleBuffer::desc
BufferDesc desc
Definition: sampleReformatFreeIO.cpp:189
bert::cuSeqlensToPackedMask
void cuSeqlensToPackedMask(const uint32_t S, const uint32_t B, const uint32_t warps_m, const uint32_t warps_n, const uint32_t warps_k, const int *cuSeqlens, uint32_t *inputMaskX, cudaStream_t stream)
Semaphore::mSemName
const char * mSemName
Definition: sampleMovieLensMPS.cpp:114
sample::anonymous_namespace{sampleInference.cpp}::Iteration::moveNext
void moveNext()
Definition: sampleInference.cpp:382
polygraphy.backend.onnxrt.loader
Definition: backend/onnxrt/loader.py:1
nmtSample::Vocabulary::mSosId
int mSosId
Definition: vocabulary.h:76
pytorch_quantization.nn.modules.quant_rnn.QuantRNN
Definition: modules/quant_rnn.py:269
onnx_graphsurgeon.importers.onnx_importer.get_onnx_tensor_dtype
np.dtype get_onnx_tensor_dtype(Union[onnx.ValueInfoProto, onnx.TensorProto] onnx_tensor)
Definition: onnx_importer.py:59
nvinfer1::plugin::TRTException::line
int line
Definition: checkMacrosPlugin.h:95
builder_varseqlen.load_tf_weights
def load_tf_weights(inputbase, config)
Definition: builder_varseqlen.py:332
samplesCommon::roundUp
int roundUp(int m, int n)
Definition: samples/common/common.h:876
samplesCommon::GenericBuffer::GenericBuffer
GenericBuffer(nvinfer1::DataType type=nvinfer1::DataType::kFLOAT)
Construct an empty buffer.
Definition: buffers.h:58
polygraphy.tools.args.onnx.loader.OnnxLoaderArgs.exclude_outputs
exclude_outputs
Definition: tools/args/onnx/loader.py:68
sample::InferenceTime::InferenceTime
InferenceTime()=default
tests.quant_conv_transposed_test.TestQuantConvTranspose2D.test_fake_quant_input
def test_fake_quant_input(self)
Definition: quant_conv_transposed_test.py:109
nvinfer1::LayerType::kREDUCE
@ kREDUCE
Reduce layer.
MultilevelProposeROIWorkSpace::preRefineBboxDims
nvinfer1::DimsHW preRefineBboxDims
Definition: maskRCNNKernels.h:163
nvinfer1::plugin::detail::find_log_2
int find_log_2(int x, bool round_up=false)
Definition: reducedMathPlugin.cpp:39
SampleMovieLensArgs
Definition: sampleMovieLens.cpp:522
SampleConfig::mInputDataFormat
InputDataFormat mInputDataFormat
Definition: sampleConfig.h:56
nvinfer1::utility::validateTRTOptions
std::string validateTRTOptions(const std::vector< TRTOption > &options)
validateTRTOptions will return a string containing an error message if any options contain non-numeri...
Definition: getOptions.cpp:116
builder_varseqlen.BertConfig.use_fp16
use_fp16
Definition: builder_varseqlen.py:98
polygraphy.tools.args.tf.loader.TfLoaderArgs.dynamic_op
dynamic_op
Definition: tools/args/tf/loader.py:80
text.sequence_to_text
def sequence_to_text(sequence)
Definition: demo/Tacotron2/tacotron2/text/__init__.py:43
polygraphy.backend.trt.runner.TrtRunner.stream
stream
Definition: backend/trt/runner.py:101
nmtSample::DeviceBuffer::mBuffer
T * mBuffer
Definition: deviceBuffer.h:57
backend.onnxrt.test_runner.TestLoggerCallbacks.test_set_severity
def test_set_severity(self, sev)
Definition: onnxrt/test_runner.py:26
samplesCommon::BufferManager::dumpBuffer
void dumpBuffer(std::ostream &os, const std::string &tensorName)
Dump host buffer with specified tensorName to ostream.
Definition: buffers.h:325
Logger::reportPass
static int reportPass(const TestAtom &testAtom)
Definition: pluginLogging.h:368
tests.quant_conv_transposed_test.TestQuantConvTranspose1D.test_fake_quant_input
def test_fake_quant_input(self)
Definition: quant_conv_transposed_test.py:395
nvinfer1::plugin::SpecialSlice
Definition: specialSlicePlugin.h:34
polygraphy.backend.trt.calibrator.Calibrator
def Calibrator(data_loader, cache=None, BaseClass=trt.IInt8MinMaxCalibrator, batch_size=None)
Definition: tools/Polygraphy/polygraphy/backend/trt/calibrator.py:26
polygraphy.logger.logger.LoggerVerbosity.logger
logger
Definition: Polygraphy/polygraphy/logger/logger.py:60
nmtSample::SLPProjection::SLPProjection
SLPProjection(ComponentWeights::ptr weights)
Definition: slpProjection.cpp:25
nvinfer1::plugin::NMSPluginCreator::getPluginVersion
const char * getPluginVersion() const override
Return the plugin version.
Definition: nmsPlugin.cpp:287
SampleConfig::getModelDtype
nvinfer1::DataType getModelDtype() const
Get the Model Data Type.
Definition: sampleConfig.h:89
nvinfer1::plugin::PyramidROIAlign::getPluginType
const char * getPluginType() const override
Return the plugin type.
Definition: pyramidROIAlignPlugin.cpp:117
bert::fused_multihead_attention_v2_int8_256_64_kernel_sm75_cubin_len
unsigned int fused_multihead_attention_v2_int8_256_64_kernel_sm75_cubin_len
Definition: fused_multihead_attention_v2_int8_256_64_kernel.sm75.cpp:13968
model.WaveGlow.n_flows
n_flows
Definition: demo/Tacotron2/waveglow/model.py:165
nvcaffeparser1::BlobNameToTensor::setTensorNames
void setTensorNames()
Definition: blobNameToTensor.h:51
builder_varseqlen.skipln
def skipln(prefix, config, init_dict, network, input_tensor, skip)
Definition: builder_varseqlen.py:183
AlgorithmCacheReader
Replicates the algorithm selection using a cache file.
Definition: sampleAlgorithmSelector.cpp:120
polygraphy.backend.trt.runner.TrtRunner.owns_context
owns_context
Definition: backend/trt/runner.py:81
nvinfer1::plugin::DetectionOutput::getWorkspaceSize
size_t getWorkspaceSize(int maxBatchSize) const override
Definition: nmsPlugin.cpp:96
half_float::detail::functions::fmod
static expr fmod(float x, float y)
Modulo implementation.
Definition: samples/common/half.h:1648
nvinfer1::plugin::GroupNormalizationPluginCreator::mFC
static PluginFieldCollection mFC
Definition: groupNormalizationPlugin.h:141
sample::anonymous_namespace{sampleEngines.cpp}::CaffeBufferShutter::~CaffeBufferShutter
~CaffeBufferShutter()
Definition: sampleEngines.cpp:45
polygraphy.tools.run.run.generate_summary
def generate_summary(model_file, runners, load_results)
Definition: run.py:56
nvcaffeparser1::IBinaryProtoBlob::getData
virtual const void * getData()=0
nvinfer1::plugin::PriorBox::initialize
int initialize() override
Initialize the layer for execution.
Definition: priorBoxPlugin.cpp:163
nvinfer1::plugin::MultilevelProposeROI::terminate
void terminate() override
Release resources acquired during plugin layer initialization.
Definition: multilevelProposeROIPlugin.cpp:199
nvinfer1::plugin::LogStream::LogStream
LogStream()
Definition: checkMacrosPlugin.h:58
tests.quant_pooling_test.TestQuantAvgPool2d.test_input_variable_bits
def test_input_variable_bits(self)
Definition: quant_pooling_test.py:170
polygraphy.tools.args.tf.loader.TfLoaderArgs.minimum_segment_size
minimum_segment_size
Definition: tools/args/tf/loader.py:79
backend.onnx.test_runner.TestOnnxTfRunner.test_basic
def test_basic(self)
Definition: onnx/test_runner.py:28
pytorch_quantization.nn.modules.tensor_quantizer.TensorQuantizer.init_learn_amax
def init_learn_amax(self)
Definition: tensor_quantizer.py:245
nvinfer1::UnaryOperation::kSIN
@ kSIN
Sine.
nvinfer1::plugin::BatchedNMSPlugin
Definition: batchedNMSPlugin.h:32
sample::UffInput
Definition: sampleOptions.h:84
onnx_models.Model.inputs
inputs
Definition: onnx_models.py:37
helpers.calibrator.BertCalibrator.get_regression_cutoff
def get_regression_cutoff(self)
Definition: demo/BERT/helpers/calibrator.py:104
SampleUffMNIST::infer
bool infer()
Runs the TensorRT inference engine for this sample.
Definition: sampleUffMNIST.cpp:268
sample::TrtUniquePtr
std::unique_ptr< T, TrtDestroyer< T > > TrtUniquePtr
Definition: sampleUtils.h:466
tracer.hasTimestamp
def hasTimestamp(metrics)
Definition: tracer.py:64
nvinfer1::ErrorCode::kUNSPECIFIED_ERROR
@ kUNSPECIFIED_ERROR
An error that does not fall into any other category.
nvinfer1::plugin::CropAndResizePlugin::getPluginVersion
const char * getPluginVersion() const override
Return the plugin version.
Definition: cropAndResizePlugin.cpp:103
nvinfer1::plugin::MultilevelProposeROI::getNbOutputs
int getNbOutputs() const override
Get the number of outputs from the layer.
Definition: multilevelProposeROIPlugin.cpp:138
onnx_graphsurgeon.ir.tensor.Tensor.shape
shape
Definition: tensor.py:87
samplesCommon::BufferManager::print
void print(std::ostream &os, void *buf, size_t bufSize, size_t rowCount)
Templated print function that dumps buffers of arbitrary type to std::ostream.
Definition: buffers.h:369
nvinfer1::plugin::MultilevelCropAndResize::check_valid_inputs
void check_valid_inputs(const nvinfer1::Dims *inputs, int nbInputDims)
Definition: multilevelCropAndResizePlugin.cpp:155
pytorch_quantization.nn.modules.quant_pooling.QuantAvgPool3d.__init__
def __init__(self, kernel_size, stride=None, padding=0, ceil_mode=False, count_include_pad=True, divisor_override=None, **kwargs)
Definition: quant_pooling.py:105
nvcaffeparser1::BinaryProtoBlob::getData
const void * getData() override
Definition: binaryProtoBlob.h:46
Logger::getTRTLogger
nvinfer1::ILogger & getTRTLogger()
Forward-compatible method for retrieving the nvinfer::ILogger associated with this Logger.
Definition: pluginLogging.h:254
polygraphy.tools.inspect.inspect.Inspect.__init__
def __init__(self)
Definition: inspect.py:24
polygraphy.backend.onnx.util.get_shape
def get_shape(tensor)
Definition: onnx/util.py:117
std::numeric_limits< half_float::half >::min
static half_float::half min()
Smallest positive normal value.
Definition: samples/common/half.h:4216
polygraphy.tools.util.script.Script.add_loader
def add_loader(self, loader_str, loader_id, suffix=None)
Definition: script.py:160
nvinfer1::EnumMax< TripLimit >
constexpr int32_t EnumMax< TripLimit >()
Maximum number of elements in TripLimit enum.
Definition: NvInfer.h:4516
pytorch_quantization.calib.max.MaxCalibrator
Definition: max.py:25
bert::threadsPerCta256
constexpr size_t threadsPerCta256
Definition: embLayerNormVarSeqlenPlugin.cpp:33
polygraphy.logger
Definition: tools/Polygraphy/polygraphy/logger/__init__.py:1
nvinfer1::INetworkDefinition::addReduce
virtual IReduceLayer * addReduce(ITensor &input, ReduceOperation operation, uint32_t reduceAxes, bool keepDimensions)=0
Add a reduce layer to the network.
polygraphy.backend.trt.loader.NetworkFromOnnxPath.__init__
def __init__(self, path, explicit_precision=None)
Definition: backend/trt/loader.py:141
ParserOnnxConfig::setVerbosityLevel
virtual void setVerbosityLevel(nvonnxparser::IOnnxConfig::Verbosity verbosity)
Set to specific verbosity Level.
Definition: parserOnnxConfig.h:107
classification_flow.calibrate_model
def calibrate_model(model, model_name, data_loader, num_calib_batch, calibrator, hist_percentile, out_dir)
Definition: classification_flow.py:333
bert::QKVToContextInterleavedPluginCreator::mNamespace
std::string mNamespace
Definition: qkvToContextInt8InterleavedPlugin.h:125
tests.tensor_quant_test.TestFakeAffineTensorQuant.test_clip_gradient
def test_clip_gradient(self)
Definition: tensor_quant_test.py:364
trt_utils.engine_info
def engine_info(engine_filepath)
Definition: trt_utils.py:52
anonymous_namespace{cropAndResizePlugin.cpp}::CROP_AND_RESIZE_PLUGIN_VERSION
static const char * CROP_AND_RESIZE_PLUGIN_VERSION
Definition: cropAndResizePlugin.cpp:31
polygraphy.util.cuda.Cuda.MemcpyKind
Definition: cuda.py:25
polygraphy.tools.args.trt.loader.TrtLoaderArgs.add_trt_network_loader
def add_trt_network_loader(self, script)
Definition: tools/args/trt/loader.py:116
nvcaffeparser1::WeightType::kNVMEAN
@ kNVMEAN
util.test_misc.test_volume
def test_volume(case)
Definition: test_misc.py:29
Int8EntropyCalibrator2::mImpl
EntropyCalibratorImpl< TBatchStream > mImpl
Definition: EntropyCalibrator.h:132
FCPlugin::mCublas
cublasHandle_t mCublas
Definition: samples/opensource/samplePlugin/fcPlugin.h:316
nvinfer1::plugin::DetectionLayer::getSerializationSize
size_t getSerializationSize() const override
Find the size of the serialization buffer required.
Definition: detectionLayerPlugin.cpp:175
SampleINT8
The SampleINT8 class implements the INT8 sample.
Definition: sampleINT8.cpp:58
onnx_graphsurgeon.ir.graph.Graph._get_node_id
def _get_node_id(self, node)
Definition: graph.py:174
nvinfer1::plugin::GroupNormalizationPlugin::supportsFormatCombination
bool supportsFormatCombination(int pos, const nvinfer1::PluginTensorDesc *inOut, int nbInputs, int nbOutputs) override
Definition: groupNormalizationPlugin.cpp:173
nvinfer1::plugin::FlattenConcatPluginCreator::deserializePlugin
IPluginV2Ext * deserializePlugin(const char *name, const void *serialData, size_t serialLength) override
Called during deserialization of plugin layer.
Definition: flattenConcat.cpp:339
nvinfer1::plugin::ResizeNearest
Definition: resizeNearestPlugin.h:34
SampleMLP::addMLPLayer
nvinfer1::ILayer * addMLPLayer(nvinfer1::INetworkDefinition *network, nvinfer1::ITensor &inputTensor, int32_t hiddenSize, nvinfer1::Weights wts, nvinfer1::Weights bias, nvinfer1::ActivationType actType, int idx)
Add an MLP layer.
Definition: sampleMLP.cpp:490
nvinfer1::plugin::GenerateDetectionPluginCreator::mIOUThreshold
float mIOUThreshold
Definition: generateDetectionPlugin.h:134
pytorch_quantization.nn.modules.quant_conv.QuantConv1d.padding_mode
padding_mode
Definition: quant_conv.py:208
nvinfer1::plugin::GroupNormalizationPlugin::GroupNormalizationPlugin
GroupNormalizationPlugin()=delete
nvinfer1::plugin::BatchedNMSBasePluginCreator::getFieldNames
const PluginFieldCollection * getFieldNames() override
Return a list of fields that needs to be passed to createPlugin.
Definition: batchedNMSPlugin.cpp:506
pytorch_quantization.tensor_quant.TensorQuantFunction
Definition: tensor_quant.py:236
tests.tensor_quantizer_test.TestTensorQuantizer.test_simple_run_no_fake
def test_simple_run_no_fake(self)
Definition: tensor_quantizer_test.py:47
sample::Bindings::dumpBindings
void dumpBindings(const nvinfer1::IExecutionContext &context, std::ostream &os) const
Definition: sampleUtils.h:395
bert::QKVToContextInterleavedPlugin::mHiddenSize
int mHiddenSize
Definition: qkvToContextInt8InterleavedPlugin.h:85
EntropyCalibratorImpl::readCalibrationCache
const void * readCalibrationCache(size_t &length)
Definition: EntropyCalibrator.h:66
nvinfer1::plugin::InstanceNormalizationPlugin::terminate
void terminate() override
Release resources acquired during plugin layer initialization.
Definition: instanceNormalizationPlugin.cpp:160
text.cmudict.CMUDict.__init__
def __init__(self, file_or_path, keep_ambiguous=True)
Definition: cmudict.py:36
pytorch_quantization.nn.modules.quant_linear.QuantLinear.__init__
def __init__(self, in_features, out_features, bias=True, **kwargs)
Definition: quant_linear.py:62
nvinfer1::plugin::SplitPlugin::getWorkspaceSize
size_t getWorkspaceSize(const nvinfer1::PluginTensorDesc *, int, const nvinfer1::PluginTensorDesc *, int) const TRTNOEXCEPT override
Definition: split.h:114
anonymous_namespace{serialize.hpp}::Serializer< const char * >::serialized_size
static size_t serialized_size(const char *value)
Definition: serialize.hpp:68
nmtSample::ComponentWeights::mMetaData
std::vector< int > mMetaData
Definition: componentWeights.h:41
polygraphy.backend.pyt.loader
Definition: backend/pyt/loader.py:1
nvinfer1::plugin::DetectionLayerPluginCreator
Definition: detectionLayerPlugin.h:108
util.test_cuda.ResizeTestCase.old
old
Definition: test_cuda.py:27
UffPoolPluginV2::UffPoolPluginV2
UffPoolPluginV2(const void *data, size_t length)
Definition: sampleUffPluginV2Ext.cpp:354
SampleFasterRCNN::nonMaximumSuppression
std::vector< int > nonMaximumSuppression(std::vector< std::pair< float, int >> &scoreIndex, float *bbox, const int classNum, const int numClasses, const float nmsThreshold)
Performs non maximum suppression on final bounding boxes.
Definition: sampleFasterRCNN.cpp:419
SampleFasterRCNN::teardown
bool teardown()
Cleans up any state created in the sample class.
Definition: sampleFasterRCNN.cpp:247
nvinfer1::IMatrixMultiplyLayer::~IMatrixMultiplyLayer
virtual ~IMatrixMultiplyLayer()
Definition: NvInfer.h:4224
sample::anonymous_namespace{sampleInference.cpp}::StreamType::kCOMPUTE
@ kCOMPUTE
tests.quant_conv_test.TestQuantConv2D.test_weight_fake_quant_per_tensor
def test_weight_fake_quant_per_tensor(self)
Definition: quant_conv_test.py:71
nvcaffeparser1::CaffeParser::destroy
void destroy() override
Destroy this ICaffeParser object.
Definition: caffeParser.h:58
generate.mul
def mul(self, a, b)
Definition: 07_creating_a_model_with_the_layer_api/generate.py:46
nvinfer1::plugin::detail::clz
int clz(int x)
Definition: reducedMathPlugin.cpp:25
sample::InferenceOptions::iterations
int iterations
Definition: sampleOptions.h:149
nvinfer1::plugin::BaseCreator::setPluginNamespace
void setPluginNamespace(const char *libNamespace) override
Set the namespace of the plugin creator based on the plugin library it belongs to.
Definition: plugin.h:63
bert::Fused_multihead_attention_params_v2::ignore_b1opt
bool ignore_b1opt
Definition: fused_multihead_attention_v2.h:69
half_float::detail::scalbln
half scalbln(half arg, long exp)
Multiply by power of two.
Definition: ieee_half.h:2659
nvinfer1::IPluginRegistry::getPluginCreator
virtual IPluginCreator * getPluginCreator(const char *pluginType, const char *pluginVersion, const char *pluginNamespace="") noexcept=0
Return plugin creator based on plugin type, version and namespace associated with plugin during netwo...
pytorch_quantization.nn.modules.tensor_quantizer.TensorQuantizer.disable
def disable(self)
Definition: tensor_quantizer.py:150
nvinfer1::plugin::CoordConvACPlugin::terminate
void terminate() override
Release resources acquired during plugin layer initialization.
Definition: coordConvACPlugin.cpp:69
half_float::detail::expm1
expr expm1(half arg)
Exponential minus one.
Definition: ieee_half.h:2333
nvinfer1::plugin::FlattenConcat::mConcatAxisID
int mConcatAxisID
Definition: flattenConcat.h:102
tests.tensor_quant_test.TestQuantDescriptor.test_amax
def test_amax(self)
Definition: tensor_quant_test.py:310
SampleMNIST::SampleUniquePtr
std::unique_ptr< T, samplesCommon::InferDeleter > SampleUniquePtr
Definition: sampleMNIST.cpp:51
SampleMLP::SampleUniquePtr
std::unique_ptr< T, samplesCommon::InferDeleter > SampleUniquePtr
Definition: sampleMLP.cpp:60
generate.C
C
Definition: 07_creating_a_model_with_the_layer_api/generate.py:95
SampleINT8::calculateScore
int calculateScore(const samplesCommon::BufferManager &buffers, float *labels, int batchSize, int outputSize, int threshold)
Scores model.
Definition: sampleINT8.cpp:380
tools.test_run.TestTrt.test_trt_input_shape
def test_trt_input_shape(self)
Definition: test_run.py:102
preprocess_audio2mel.parse_args
def parse_args(parser)
Definition: preprocess_audio2mel.py:23
sample::Bindings::transferOutputToHost
void transferOutputToHost(TrtCudaStream &stream)
Definition: sampleUtils.h:350
MaskRCNNUtils::PPM::h
int h
Definition: sampleUffMaskRCNN.cpp:75
half_float::detail::round_half
uint16 round_half(uint16 value)
Round half-precision number to nearest integer value.
Definition: ieee_half.h:1041
nmtSample::SequenceProperties::ptr
std::shared_ptr< SequenceProperties > ptr
Definition: sequenceProperties.h:32
nvinfer1::IElementWiseLayer
A elementwise layer in a network definition.
Definition: NvInfer.h:2644
nvcaffeparser1::CaffeWeightFactory::isOK
bool isOK()
Definition: caffeWeightFactory.cpp:184
nvinfer1::plugin::GroupNormalizationPluginCreator::getPluginVersion
const char * getPluginVersion() const override
Return the plugin version.
Definition: groupNormalizationPlugin.cpp:265
tools.test_run.TestOnnxrt.test_onnx_rt_layerwise_outputs
def test_onnx_rt_layerwise_outputs(self)
Definition: test_run.py:202
UffPoolPluginV2::getWorkspaceSize
size_t getWorkspaceSize(int maxBatchSize) const override
Definition: sampleUffPluginV2Ext.cpp:419
nvinfer1::plugin::ResizeNearest::canBroadcastInputAcrossBatch
bool canBroadcastInputAcrossBatch(int inputIndex) const override
Definition: resizeNearestPlugin.cpp:234
polygraphy.logger.logger.LoggerIndent.__init__
def __init__(self, logger, indent)
Definition: Polygraphy/polygraphy/logger/logger.py:44
tools.test_args.TestParseMeta.test_parse_shape_single_dim
def test_parse_shape_single_dim(self)
Definition: test_args.py:14
samplesCommon::DeviceFree
Definition: buffers.h:188
EntropyCalibratorImpl::EntropyCalibratorImpl
EntropyCalibratorImpl(BatchStream &stream, int firstBatch, const std::string &networkName, const char *inputBlobName, bool readCache=true)
Definition: frcnnUtils.h:434
backend.trt.test_loader.TestSerializedEngineLoader
Definition: trt/test_loader.py:72
nvinfer1::plugin::Normalize::canBroadcastInputAcrossBatch
bool canBroadcastInputAcrossBatch(int inputIndex) const override
Definition: normalizePlugin.cpp:190
nvinfer1::plugin::DetectionLayer::getPluginType
const char * getPluginType() const override
Return the plugin type.
Definition: detectionLayerPlugin.cpp:148
bert::QKVToContextInterleavedPlugin::mS
int mS
Definition: qkvToContextInt8InterleavedPlugin.h:81
SampleOnnxMnistCoordConvAC::build
bool build()
Function builds the network engine.
Definition: sampleOnnxMnistCoordConvAC.cpp:109
SampleOnnxMNIST::SampleOnnxMNIST
SampleOnnxMNIST(const samplesCommon::OnnxSampleParams &params)
Definition: sampleOnnxMNIST.cpp:52
sample::fillBuffer
void fillBuffer(void *buffer, int volume, T min, T max)
Definition: sampleUtils.h:165
builder.BertConfig.use_int8
use_int8
Definition: builder.py:98
sample::Profiler
Collect per-layer profile information, assuming times are reported in the same order.
Definition: sampleReporting.h:168
SampleCharRNNLoop::addLSTMCell
nvinfer1::ILayer * addLSTMCell(SampleUniquePtr< nvinfer1::INetworkDefinition > &network, const LstmIO &inputTensors, nvinfer1::ITensor *sequenceSize, const LstmParams &params, LstmIO &outputTensors)
Definition: sampleCharRNN.cpp:485
nvinfer1::plugin::GenerateDetectionPluginCreator::GenerateDetectionPluginCreator
GenerateDetectionPluginCreator()
Definition: generateDetectionPlugin.cpp:38
polygraphy.comparator.compare.OutputCompareResult.__init__
def __init__(self, passed, max_absdiff, max_reldiff)
Definition: compare.py:29
MNISTBatchStream::MNISTBatchStream
MNISTBatchStream(int batchSize, int maxBatches, const std::string &dataFile, const std::string &labelsFile, const std::vector< std::string > &directories)
Definition: BatchStream.h:43
sample::AllOptions::build
BuildOptions build
Definition: sampleOptions.h:189
inference_varseqlen.print_single_query
def print_single_query(eval_time_elapsed, prediction, nbest_json)
Definition: inference_varseqlen.py:209
polygraphy.backend.cntk.runner.CNTKRunner.inputs
inputs
Definition: backend/cntk/runner.py:34
SampleBuffer::dataWidth
int dataWidth
Definition: sampleReformatFreeIO.cpp:185
SampleINT8APIArgs::networkTensorsFileName
std::string networkTensorsFileName
Definition: sampleINT8API.cpp:655
MaskRCNNConfig::MAX_PRE_NMS_RESULTS
static const int MAX_PRE_NMS_RESULTS
Definition: plugin/proposalLayerPlugin/mrcnn_config.h:76
nvcaffeparser1::CaffeWeightFactory::CaffeWeightFactory
CaffeWeightFactory(const trtcaffe::NetParameter &msg, nvinfer1::DataType dataType, std::vector< void * > &tmpAllocs, bool isInitialized)
Definition: caffeWeightFactory.cpp:59
nvinfer1::plugin::BatchedNMSPlugin::canBroadcastInputAcrossBatch
bool canBroadcastInputAcrossBatch(int inputIndex) const override
Definition: batchedNMSPlugin.cpp:464
sample::ReportingOptions::percentile
float percentile
Definition: sampleOptions.h:173
polygraphy.tools.args.trt_legacy.TrtLegacyArgs.add_to_script
def add_to_script(self, script)
Definition: tools/args/trt_legacy.py:67
nvinfer1::CUDADriverWrapper::cuFuncSetAttribute
CUresult cuFuncSetAttribute(CUfunction hfunc, CUfunction_attribute attrib, int value) const
Definition: cudaDriverWrapper.cpp:76
nvinfer1::plugin::CoordConvACPluginCreator::deserializePlugin
IPluginV2Ext * deserializePlugin(const char *name, const void *serialData, size_t serialLength) override
Called during deserialization of plugin layer.
Definition: coordConvACPlugin.cpp:202
polygraphy.tools.util.script.invoke_impl
def invoke_impl(type_str, *args, **kwargs)
Definition: script.py:44
tests.quant_pooling_test.TestQuantAdaptiveAvgPool2d.test_raise
def test_raise(self)
Definition: quant_pooling_test.py:241
samplesCommon::swapEndianness
T swapEndianness(const T &value)
Definition: samples/common/common.h:296
onnx_graphsurgeon.ir.graph.NodeIDAdder.__enter__
def __enter__(self)
Definition: graph.py:48
polygraphy.tools.surgeon.subtool.insert
Definition: insert.py:1
test_ir.TestConstant.test_can_get_dtype
def test_can_get_dtype(self)
Definition: test_ir.py:130
SampleUffSSD
The SampleUffSSD class implements the SSD sample.
Definition: sampleUffSSD.cpp:63
nvonnxparser::IParserError
an object containing information about an error
Definition: NvOnnxParser.h:89
polygraphy.tools.args.tf.loader.TfLoaderArgs.add_to_script
def add_to_script(self, script, disable_outputs=None, suffix=None)
Definition: tools/args/tf/loader.py:83
nvinfer1::plugin::ProposalLayerPluginCreator::ProposalLayerPluginCreator
ProposalLayerPluginCreator()
Definition: proposalLayerPlugin.cpp:39
SampleINT8Params
The SampleINT8Params structure groups the additional parameters required by the INT8 sample.
Definition: sampleINT8.cpp:47
helpers.tokenization.WordpieceTokenizer.unk_token
unk_token
Definition: tokenization.py:335
nvinfer1::TacticSource::kCUBLAS_LT
@ kCUBLAS_LT
cuBLAS LT tactics
bert::fused_multihead_attention_int8_128_64_kernel_sm75_cu_o_len
unsigned int fused_multihead_attention_int8_128_64_kernel_sm75_cu_o_len
Definition: fused_multihead_attention_int8_128_64_kernel.sm75.cpp:3590
SimpleProfiler::Record
Definition: samples/common/common.h:141
polygraphy.logger.logger.Logger.finish
def finish(self, message, mode=LogMode.EACH)
Definition: Polygraphy/polygraphy/logger/logger.py:274
nvinfer1::IRaggedSoftMaxLayer::~IRaggedSoftMaxLayer
virtual ~IRaggedSoftMaxLayer()
Definition: NvInfer.h:4244
nvcaffeparser1::ICaffeParser::parseBuffers
virtual const IBlobNameToTensor * parseBuffers(const char *deployBuffer, std::size_t deployLength, const char *modelBuffer, std::size_t modelLength, nvinfer1::INetworkDefinition &network, nvinfer1::DataType weightType)=0
Parse a deploy prototxt a binaryproto Caffe model from memory buffers to extract network definition a...
nvinfer1::plugin::LReluPluginCreator::getPluginName
const char * getPluginName() const override
Return the plugin name.
Definition: lReluPlugin.cpp:140
comparator.test_data_loader.TestDataLoader.test_range_min_max_equal
def test_range_min_max_equal(self)
Definition: test_data_loader.py:42
polygraphy.backend.onnx.runner.OnnxTfRunner.deactivate_impl
def deactivate_impl(self)
Definition: backend/onnx/runner.py:46
polygraphy.tools.args.comparator.ComparatorRunArgs
Definition: tools/args/comparator.py:23
half_float::detail::functions::isequal
static bool isequal(half x, half y)
Comparison implementation.
Definition: samples/common/half.h:2388
polygraphy.backend.trt.util.str_from_engine
def str_from_engine(engine)
Definition: trt/util.py:389
samplesCommon::CpuTimer::clock_type
Clock clock_type
Definition: samples/common/common.h:843
nvinfer1::plugin::SplitPlugin::enqueue
int enqueue(const PluginTensorDesc *inputDesc, const PluginTensorDesc *outputDesc, const void *const *inputs, void *const *outputs, void *workspace, cudaStream_t stream) TRTNOEXCEPT override
Execute the layer.
nvinfer1::DataType::kFLOAT
@ kFLOAT
32-bit floating point format.
tests.print_test.TestPrint.conv
conv
Definition: print_test.py:47
nvinfer1::plugin::reduced_divisor::reduced_divisor
reduced_divisor()
Definition: reducedMathPlugin.h:57
tools.test_args.TestParseMeta
Definition: test_args.py:6
polygraphy.backend.tf.runner.TfRunner.__init__
def __init__(self, sess, timeline_dir=None, name=None)
Definition: backend/tf/runner.py:34
test_ir.TensorBaseTests.test_can_convert_in_place_to_variable
def test_can_convert_in_place_to_variable(self)
Definition: test_ir.py:42
BertInference::run
void run(int profIdx, int batchSize, const void *inputIds, const void *segmentIds, const void *inputMask, int warmUps, int iterations)
Definition: bert_infer.h:297
polygraphy.comparator.compare.OutputCompareResult.max_absdiff
max_absdiff
Definition: compare.py:39
nvinfer1::plugin::ReorgPluginCreator::mPluginAttributes
static std::vector< PluginField > mPluginAttributes
Definition: reorgPlugin.h:112
bert::FusedMultiHeadAttentionKernelMetaInfoV1::mSM
unsigned int mSM
Definition: fused_multihead_attention.h:120
polygraphy.backend.trt.util.check_outputs_not_found
def check_outputs_not_found(not_found, available_outputs)
Definition: trt/util.py:190
half_float::detail::half_caster< half, U, R >::cast
static half cast(U arg)
Definition: samples/common/half.h:2622
sample::Logger::TestAtom::mName
std::string mName
Definition: samples/common/logging.h:282
builder.BertConfig.hidden_size
hidden_size
Definition: builder.py:93
infer.DeviceBuffer.buf
buf
Definition: infer.py:34
nvinfer1::DimsHW::DimsHW
DimsHW(int32_t height, int32_t width)
Construct a DimsHW given height and width.
Definition: NvInfer.h:113
half_float::detail::functions::erfc
static expr erfc(float arg)
Complementary implementation.
Definition: samples/common/half.h:2018
polygraphy.tools.args.trt.loader.TrtLoaderArgs.register
def register(self, maker)
Definition: tools/args/trt/loader.py:65
nvcaffeparser1::CaffeParser::allocMemory
T * allocMemory(int size=1)
Definition: caffeParser.h:70
sample::Parser
Definition: sampleEngines.h:33
polygraphy.backend.trt.loader.EngineFromNetwork.__init__
def __init__(self, network, config=None)
Definition: backend/trt/loader.py:386
LogStreamConsumerBuffer::LogStreamConsumerBuffer
LogStreamConsumerBuffer(std::ostream &stream, const std::string &prefix, bool shouldLog)
Definition: pluginLogging.h:35
nvinfer1::plugin::MultilevelProposeROI::mAnchorsCnt
std::vector< int > mAnchorsCnt
Definition: multilevelProposeROIPlugin.h:102
generate.shape
tuple shape
Definition: 03_isolating_a_subgraph/generate.py:24
UffPoolPluginV2Creator::getPluginName
const char * getPluginName() const override
Return the plugin name.
Definition: sampleUffPluginV2Ext.cpp:648
IBatchStream::getBatchSize
virtual int getBatchSize() const =0
SampleReformatFreeIO::mEngine
std::shared_ptr< nvinfer1::ICudaEngine > mEngine
The TensorRT engine used to run the network.
Definition: sampleReformatFreeIO.cpp:259
nvinfer1::Permutation
Definition: NvInfer.h:3726
nvinfer1::plugin::GroupNormalizationPlugin::_cudnn_handle
cudnnHandle_t _cudnn_handle
Definition: groupNormalizationPlugin.h:102
pytorch_quantization.nn.modules.quant_instancenorm.QuantInstanceNorm1d.__init__
def __init__(self, int num_features, float eps=1e-5, float momentum=0.1, bool affine=False, bool track_running_stats=False, **kwargs)
Definition: quant_instancenorm.py:36
nmtSample::Vocabulary::getStartSequenceId
int getStartSequenceId() override
Definition: vocabulary.cpp:93
bert::SkipLayerNormInterleavedPluginCreator
Definition: skipLayerNormInt8InterleavedPlugin.h:94
bert::fused_multihead_attention_v2_fp16_128_64_kernel_sm80_cubin
unsigned char fused_multihead_attention_v2_fp16_128_64_kernel_sm80_cubin[]
Definition: fused_multihead_attention_v2_fp16_128_64_kernel.sm80.cpp:19
ProposalWorkSpace::sortClassPosDims
nvinfer1::DimsHW sortClassPosDims
Definition: maskRCNNKernels.h:136
half_float::detail::functions::fpclassify
static int fpclassify(half arg)
Classification implementation.
Definition: samples/common/half.h:2330
MNISTBatchStream::mBatchCount
int mBatchCount
The batch that will be read on the next invocation of next()
Definition: BatchStream.h:147
dumpTFWts.getTRTType
def getTRTType(tensor)
Definition: dumpTFWts.py:54
helpers.tokenization.BertTokenizer.__init__
def __init__(self, vocab_file, do_lower_case=True)
Definition: tokenization.py:187
BatchStream::getBatch
float * getBatch() override
Definition: BatchStream.h:259
polygraphy.comparator.data_loader.DataLoader
Definition: comparator/data_loader.py:47
test_importers.TestOnnxImporter.test_import_graph
def test_import_graph(self, model)
Definition: test_importers.py:93
nvinfer1::plugin::MultilevelProposeROI::getWorkspaceSize
size_t getWorkspaceSize(int maxBatchSize) const override
Definition: multilevelProposeROIPlugin.cpp:308
std::numeric_limits< half_float::half >::max
static half_float::half max()
Largest finite value.
Definition: samples/common/half.h:4228
polygraphy.backend.base.loader.BaseLoadModel
Definition: backend/base/loader.py:17
polygraphy.tools.args.trt.runner
Definition: tools/args/trt/runner.py:1
pytorch_quantization.quant_modules._quant_entry
_quant_entry
Definition: quant_modules.py:23
nvinfer1::IActivationLayer::setAlpha
virtual void setAlpha(float alpha)=0
Set the alpha parameter (must be finite).
nvinfer1::plugin::InstanceNormalizationPluginCreator::createPlugin
IPluginV2DynamicExt * createPlugin(const char *name, const nvinfer1::PluginFieldCollection *fc) override
Return a plugin object.
Definition: instanceNormalizationPlugin.cpp:350
nvcaffeparser1::CaffeParser::setPluginFactory
void setPluginFactory(nvcaffeparser1::IPluginFactory *factory) override
Set the IPluginFactory used to create the user defined plugins.
Definition: caffeParser.h:48
test_examples.Artifact.__init__
def __init__(self, name, infer=True)
Definition: onnx-graphsurgeon/tests/test_examples.py:34
tests.quant_pooling_test.TestQuantAdaptiveAvgPool1d.test_raise
def test_raise(self)
Definition: quant_pooling_test.py:220
nvinfer1::IPluginV2Layer::~IPluginV2Layer
virtual ~IPluginV2Layer()
Definition: NvInfer.h:3468
pytorch_quantization.tensor_quant.ScaledQuantDescriptor.fake_quant
def fake_quant(self)
Definition: tensor_quant.py:116
nvinfer1::plugin::reduced_divisor::y
int y
Definition: reducedMathPlugin.h:98
onnx_graphsurgeon.logger.logger.Logger.verbose
def verbose(self, message, mode=LogMode.EACH)
Definition: onnx-graphsurgeon/onnx_graphsurgeon/logger/logger.py:223
pytorch_quantization.nn.modules.quant_rnn.GRUCell.bias
bias
Definition: modules/quant_rnn.py:430
nvinfer1::plugin::gLogVerbose
LogStream< ILogger::Severity::kVERBOSE > gLogVerbose
Definition: checkMacrosPlugin.cpp:51
SampleINT8APIParams
The SampleINT8APIParams structure groups the additional parameters required by the INT8 API sample.
Definition: sampleINT8API.cpp:55
tests.quant_instancenorm_test.TestQuantInstanceNorm2D
Definition: quant_instancenorm_test.py:97
nvinfer1::LayerType::kCONVOLUTION
@ kCONVOLUTION
Convolution layer.
sample::TrtCudaStream::operator=
TrtCudaStream & operator=(const TrtCudaStream &)=delete
nvinfer1::BuilderFlags
uint32_t BuilderFlags
Represents a collection of one or more QuantizationFlag values using binary OR operations,...
Definition: NvInfer.h:6375
polygraphy.util.misc.send_on_queue
def send_on_queue(queue, obj)
Definition: Polygraphy/polygraphy/util/misc.py:211
bert::FusedMultiHeadAttentionXMMAKernelV2::FusedMultiHeadAttentionXMMAKernelV2
FusedMultiHeadAttentionXMMAKernelV2(const FusedMultiHeadAttentionKernelMetaInfoV2 *pMetaStart, unsigned int nMetaCount, Data_type type, unsigned int sm)
Definition: fused_multihead_attention_v2.h:426
polygraphy.backend.trt.loader.ModifyNetwork.__call__
def __call__(self)
Definition: backend/trt/loader.py:208
pytorch_quantization.nn.modules.quant_conv._QuantConvTransposeNd.__init__
def __init__(self, in_channels, out_channels, kernel_size, stride, padding, dilation, transposed, output_padding, groups, bias, padding_mode, quant_desc_input, quant_desc_weight)
Definition: quant_conv.py:245
helpers.tokenization.FullTokenizer.inv_vocab
inv_vocab
Definition: tokenization.py:165
nvinfer1::plugin::PyramidROIAlign::terminate
void terminate() override
Release resources acquired during plugin layer initialization.
Definition: pyramidROIAlignPlugin.cpp:98
nvcaffeparser1::IPluginFactoryV2
Plugin factory used to configure plugins.
Definition: NvCaffeParser.h:137
nvinfer1::DimensionOperation::kLESS
@ kLESS
1 if first operand is less than second operand, 0 otherwise.
backend.trt.test_loader.TestNetworkFromOnnxPath.test_loader
def test_loader(self)
Definition: trt/test_loader.py:106
Logger::TestResult::kFAILED
@ kFAILED
The test failed.
nvinfer1::ElementWiseOperation::kPROD
@ kPROD
Product of the two elements.
nvinfer1::plugin::CropAndResizePluginCreator::mNamespace
std::string mNamespace
Definition: cropAndResizePlugin.h:125
SampleMNIST::mParams
samplesCommon::CaffeSampleParams mParams
The parameters for the sample.
Definition: sampleMNIST.cpp:96
data_functions.get_batch_to_gpu
def get_batch_to_gpu(model_name)
Definition: data_functions.py:49
nvinfer1::EnumMax< LoopOutput >
constexpr int32_t EnumMax< LoopOutput >()
Maximum number of elements in LoopOutput enum.
Definition: NvInfer.h:4501
anonymous_namespace{multilevelCropAndResizePlugin.cpp}::MULTILEVELCROPANDRESIZE_PLUGIN_NAME
const char * MULTILEVELCROPANDRESIZE_PLUGIN_NAME
Definition: multilevelCropAndResizePlugin.cpp:32
polygraphy.comparator.data_loader.DataLoaderCache.__getitem__
def __getitem__(self, iteration)
Definition: comparator/data_loader.py:205
nvinfer1::plugin::PriorBoxPluginCreator::PriorBoxPluginCreator
PriorBoxPluginCreator()
Definition: priorBoxPlugin.cpp:345
polygraphy.backend.tf
Definition: tools/Polygraphy/polygraphy/backend/tf/__init__.py:1
bert::FusedMultiHeadAttentionXMMAKernelV2::run
virtual void run(Fused_multihead_attention_params_v2 &params, cudaStream_t ss) const
Definition: fused_multihead_attention_v2.h:444
pytorch_quantization.calib.histogram.HistogramCalibrator.__str__
def __str__(self)
Definition: histogram.py:241
Logger::severityOstream
static std::ostream & severityOstream(Severity severity)
returns an appropriate output stream (cout or cerr) to use with the given severity
Definition: pluginLogging.h:431
nvinfer1::plugin::PluginCreatorRegistry::mRegistry
std::stack< std::unique_ptr< IPluginCreator > > mRegistry
Definition: InferPlugin.cpp:143
std::numeric_limits< half_float::half >::epsilon
static half_float::half epsilon()
Difference between one and next representable value.
Definition: samples/common/half.h:4234
SampleUffFasterRcnn::visualize_boxes
void visualize_boxes(int img_num, int class_num, std::vector< float > &pred_boxes, std::vector< float > &pred_probs, std::vector< int > &pred_cls_ids, std::vector< int > &box_num_per_img, std::vector< vPPM > &ppms)
Helper function to dump bbox-overlayed images as PPM files.
Definition: sampleUffFasterRCNN.cpp:687
polygraphy.backend.tf.loader.ModifyGraph.outputs
outputs
Definition: backend/tf/loader.py:287
sample::InferenceOptions::batch
int batch
Definition: sampleOptions.h:148
nvinfer1::plugin::CropAndResizePlugin::isOutputBroadcastAcrossBatch
bool isOutputBroadcastAcrossBatch(int outputIndex, const bool *inputIsBroadcasted, int nbInputs) const override
Definition: cropAndResizePlugin.cpp:207
RefineDetectionWorkSpace::sortNMSMarkDims
nvinfer1::DimsHW sortNMSMarkDims
Definition: maskRCNNKernels.h:105
nmtSample::DataReader::~DataReader
~DataReader() override=default
builder.load_tf_weights
def load_tf_weights(inputbase, config)
Definition: builder.py:358
nvinfer1::plugin::throwCudnnError
void throwCudnnError(const char *file, const char *function, int line, int status, const char *msg)
Definition: checkMacrosPlugin.cpp:87
polygraphy.comparator.struct.RunResults.keys
def keys(self)
Definition: comparator/struct.py:69
nvinfer1::plugin::GroupNormalizationPluginCreator::mPluginAttributes
static std::vector< PluginField > mPluginAttributes
Definition: groupNormalizationPlugin.h:142
samplesCommon::ManagedBuffer::hostBuffer
HostBuffer hostBuffer
Definition: buffers.h:226
nvinfer1::INetworkDefinition::addParametricReLU
virtual IParametricReLULayer * addParametricReLU(ITensor &input, ITensor &slope) noexcept=0
Add a parametric ReLU layer to the network.
builder.skipln
def skipln(prefix, config, init_dict, network, input_tensor, skip, bias=None)
Definition: builder.py:178
tests.utils.assert_min_mse
def assert_min_mse(a, b, tol=1e-20)
Definition: tools/pytorch-quantization/tests/utils.py:110
MaskRCNNUtils::BBoxInfo
Definition: sampleUffMaskRCNN.cpp:62
nvuffparser::FieldType::kFLOAT
@ kFLOAT
FP32 field type.
half_float::detail::cosh
expr cosh(half arg)
Hyperbolic cosine.
Definition: ieee_half.h:2480
replace.replace_with_clip
def replace_with_clip(self, inputs, outputs)
Definition: replace.py:26
nvinfer1::UnaryOperation::kERF
@ kERF
Gauss error function.
sample::ModelFormat::kUFF
@ kUFF
onnx_graphsurgeon.logger.logger.Logger.timestamp
timestamp
Definition: onnx-graphsurgeon/onnx_graphsurgeon/logger/logger.py:106
onnx_graphsurgeon.importers.onnx_importer.OnnxImporter.import_tensor
Tensor import_tensor(Union[onnx.ValueInfoProto, onnx.TensorProto] onnx_tensor)
Definition: onnx_importer.py:80
nvinfer1::plugin::GridAnchorGenerator::mDeviceHeights
Weights * mDeviceHeights
Definition: gridAnchorPlugin.h:98
half_float::detail::functions::nexttoward
static half nexttoward(half from, long double to)
Enumeration implementation.
Definition: samples/common/half.h:2305
nmtSample::LimitedSamplesDataReader::gMaxSamplesToRead
int gMaxSamplesToRead
Definition: limitedSamplesDataReader.h:41
MaskRCNNConfig::RPN_ANCHOR_RATIOS
static const float RPN_ANCHOR_RATIOS[]
Definition: plugin/proposalLayerPlugin/mrcnn_config.h:67
SampleMovieLens::constructNetwork
void constructNetwork(SampleUniquePtr< nvinfer1::IBuilder > &builder, SampleUniquePtr< nvinfer1::INetworkDefinition > &network, SampleUniquePtr< nvinfer1::IBuilderConfig > &config, SampleUniquePtr< nvuffparser::IUffParser > &parser)
Parses a Uff model for a MLP NCF model, creates a TensorRT network, and builds a TensorRT engine.
Definition: sampleMovieLens.cpp:210
SampleDynamicReshape::mInput
samplesCommon::ManagedBuffer mInput
Host and device buffers for the input.
Definition: sampleDynamicReshape.cpp:88
nvinfer1::EngineCapability
EngineCapability
Forward declaration of IPluginFactory for use by other interfaces.
Definition: NvInferRuntime.h:49
nvinfer1::Dims4::Dims4
Dims4(int32_t d0, int32_t d1, int32_t d2, int32_t d3)
Construct a Dims4 from 4 elements.
Definition: NvInfer.h:310
pytorch_quantization.nn.modules.quant_rnn.QuantRNNBase.check_forward_args
def check_forward_args(self, input, hidden, batch_sizes)
Definition: modules/quant_rnn.py:152
bert::fused_multihead_attention_fp16_96_64_kernel_sm75_cu_o_len
unsigned int fused_multihead_attention_fp16_96_64_kernel_sm75_cu_o_len
Definition: fused_multihead_attention_fp16_96_64_kernel.sm75.cpp:2409
polygraphy.tools.args.trt_legacy.TrtLegacyArgs.caffe_model
caffe_model
Definition: tools/args/trt_legacy.py:60
half_float::detail::copysign
half copysign(half x, half y)
Take sign.
Definition: ieee_half.h:2702
RefineDetectionWorkSpace::argMaxBboxDims
nvinfer1::DimsHW argMaxBboxDims
Definition: maskRCNNKernels.h:98
inference_c.inference
def inference(features, tokens)
Definition: inference_c.py:131
polygraphy.backend.trt_legacy.LoadNetworkFromCaffe.batch_size
batch_size
Definition: backend/trt_legacy.py:157
nvinfer1::TensorFormat::kCHW2
@ kCHW2
Two wide channel vectorized row major format.
onnx_graphsurgeon.ir.tensor.Tensor
Definition: tensor.py:24
test_ir.TensorBaseTests.tensor
tensor
Definition: test_ir.py:53
SampleCharRNNParams::outputSize
int outputSize
Definition: sampleCharRNN.cpp:121
tools.test_run.TestTrt.test_trt_int8
def test_trt_int8(self)
Definition: test_run.py:97
nvinfer1::plugin::ResizeNearest::getPluginNamespace
const char * getPluginNamespace() const override
Return the namespace of the plugin object.
Definition: resizeNearestPlugin.cpp:187
nvinfer1::UnaryOperation::kLOG
@ kLOG
Log (base e).
onnx_graphsurgeon.logger.logger.Logger.info
def info(self, message, mode=LogMode.EACH)
Definition: onnx-graphsurgeon/onnx_graphsurgeon/logger/logger.py:231
samplesCommon::HostAllocator
Definition: buffers.h:197
nvinfer1::plugin::DetectionOutput::DetectionOutput
DetectionOutput(DetectionOutputParameters param)
Definition: nmsPlugin.cpp:38
pytorch_quantization.calib.histogram.HistogramCalibrator.reset
def reset(self)
Definition: histogram.py:87
nmtSample::BLEUScoreWriter::mTranslationLength
size_t mTranslationLength
Definition: bleuScoreWriter.h:55
polygraphy.backend.onnx.util.check_outputs_not_found
def check_outputs_not_found(not_found, all_outputs)
Definition: onnx/util.py:62
SampleINT8::infer
bool infer(std::vector< float > &score, int firstScoreBatch, int nbScoreBatches)
Runs the TensorRT inference engine for this sample.
Definition: sampleINT8.cpp:271
test_ir.TestNode.node
node
Definition: test_ir.py:138
generate.B
float B
Definition: 07_creating_a_model_with_the_layer_api/generate.py:88
nvinfer1::plugin::CoordConvACPlugin::getWorkspaceSize
size_t getWorkspaceSize(int maxBatchSize) const override
Definition: coordConvACPlugin.cpp:83
nvinfer1::plugin::gLogWarning
LogStream< ILogger::Severity::kWARNING > gLogWarning
Definition: checkMacrosPlugin.cpp:49
tools.test_run.TestLogging
Definition: test_run.py:45
pytorch_quantization.nn.modules.tensor_quantizer.TensorQuantizer._disabled
_disabled
Definition: tensor_quantizer.py:83
nvinfer1::plugin::InstanceNormalizationPlugin::getSerializationSize
size_t getSerializationSize() const override
Find the size of the serialization buffer required.
Definition: instanceNormalizationPlugin.cpp:218
tests.calibrator_test.TestHistogramCalibrator
Definition: calibrator_test.py:94
inference.inference
def inference(features, tokens)
Definition: BERT/inference.py:160
nvinfer1::IPlugin::getSerializationSize
virtual size_t getSerializationSize()=0
Find the size of the serialization buffer required.
nvinfer1::plugin::DetectionOutput
Definition: nmsPlugin.h:31
helpers.calibrator.BertCalibrator.batch_size
batch_size
Definition: demo/BERT/helpers/calibrator.py:38
UffPoolPluginV2::getPluginType
const char * getPluginType() const override
Return the plugin type.
Definition: sampleUffPluginV2Ext.cpp:537
bert::SkipLayerNormInterleavedPlugin::mBetaDev
bert::cuda_unique_ptr< void > mBetaDev
Definition: skipLayerNormInt8InterleavedPlugin.h:75
tools.test_surgeon.TestSurgeonSanitize.test_sanity
def test_sanity(self)
Definition: test_surgeon.py:81
polygraphy.tools.args.data_loader.DataLoaderArgs.iterations
iterations
Definition: tools/args/data_loader.py:62
SampleMovieLens::infer
bool infer()
Runs the TensorRT inference engine for this sample.
Definition: sampleMovieLens.cpp:277
bert::fused_multihead_attention_fp16_96_64_kernel_sm80_cu_o
unsigned char fused_multihead_attention_fp16_96_64_kernel_sm80_cu_o[]
Definition: fused_multihead_attention_fp16_96_64_kernel.sm80.cpp:19
nvinfer1::IPluginExt::configureWithFormat
virtual void configureWithFormat(const Dims *inputDims, int32_t nbInputs, const Dims *outputDims, int32_t nbOutputs, DataType type, PluginFormat format, int32_t maxBatchSize)=0
Configure the layer.
test_ir.TestVariable.output_node
output_node
Definition: test_ir.py:114
UffPoolPluginV2Creator
Definition: sampleUffPluginV2Ext.cpp:645
SampleConfig::mMaxNCalibBatch
int64_t mMaxNCalibBatch
Definition: sampleConfig.h:48
nvinfer1::plugin::BatchedNMSDynamicPlugin::setPluginNamespace
void setPluginNamespace(const char *libNamespace) override
Set the namespace that this plugin object belongs to.
Definition: batchedNMSPlugin.cpp:419
nvinfer1::plugin::BatchedNMSDynamicPlugin::clone
IPluginV2DynamicExt * clone() const override
Clone the plugin object.
Definition: batchedNMSPlugin.cpp:397
nvinfer1::plugin::TRTException::name
const char * name
Definition: checkMacrosPlugin.h:98
tracer.main
def main()
Definition: tracer.py:101
nvinfer1::plugin::PyramidROIAlignPluginCreator::mPooledSize
int mPooledSize
Definition: pyramidROIAlignPlugin.h:123
model.Decoder.parse_decoder_inputs
def parse_decoder_inputs(self, decoder_inputs)
Definition: demo/Tacotron2/tacotron2/model.py:341
pytorch_quantization.nn.modules.quant_rnn.QuantRNNBase.num_layers
num_layers
Definition: modules/quant_rnn.py:48
pytorch_quantization.nn.modules._utils.QuantMixin._fake_quant
_fake_quant
Definition: _utils.py:68
SampleCharRNNBase::weightsMemory
std::vector< SampleUniquePtr< nvinfer1::IHostMemory > > weightsMemory
Definition: sampleCharRNN.cpp:186
pytorch_quantization.quant_modules.deactivate
def deactivate()
Definition: quant_modules.py:138
nvcaffeparser1::IPluginFactoryV2::createPlugin
virtual nvinfer1::IPluginV2 * createPlugin(const char *layerName, const nvinfer1::Weights *weights, int32_t nbWeights, const char *libNamespace="")=0
Creates a plugin.
model.Encoder.infer
def infer(self, x, input_lengths)
Definition: demo/Tacotron2/tacotron2/model.py:220
sample::LogStreamConsumerBuffer::LogStreamConsumerBuffer
LogStreamConsumerBuffer(LogStreamConsumerBuffer &&other)
Definition: samples/common/logging.h:44
nvinfer1::LayerType::kLRN
@ kLRN
LRN layer.
polygraphy.backend.trt.util.unmark_outputs
def unmark_outputs(network, outputs)
Definition: trt/util.py:252
BatchStream
Custom BatchStream class for Faster-RCNN because we use variable input dimensions and different image...
Definition: BatchStream.h:154
MaskRCNNConfig::DETECTION_MAX_INSTANCES
static const int DETECTION_MAX_INSTANCES
Definition: plugin/proposalLayerPlugin/mrcnn_config.h:40
BufferDesc::dataWidth
int dataWidth
Definition: sampleReformatFreeIO.cpp:118
nvcaffeparser1::parseSigmoid
nvinfer1::ILayer * parseSigmoid(nvinfer1::INetworkDefinition &network, const trtcaffe::LayerParameter &msg, CaffeWeightFactory &, BlobNameToTensor &tensors)
Definition: parseSigmoid.cpp:23
bert::fused_multihead_attention_int8_128_64_kernel_sm80_cu_o_len
unsigned int fused_multihead_attention_int8_128_64_kernel_sm80_cu_o_len
Definition: fused_multihead_attention_int8_128_64_kernel.sm80.cpp:3267
nvinfer1::INetworkDefinition::addScale
virtual IScaleLayer * addScale(ITensor &input, ScaleMode mode, Weights shift, Weights scale, Weights power)=0
Add a Scale layer to the network.
polygraphy.tools.args.model
Definition: tools/Polygraphy/polygraphy/tools/args/model.py:1
bert::fused_multihead_attention_v2_int8_192_64_kernel_sm75_cubin_len
unsigned int fused_multihead_attention_v2_int8_192_64_kernel_sm75_cubin_len
Definition: fused_multihead_attention_v2_int8_192_64_kernel.sm75.cpp:20325
backend.onnxrt.test_runner.TestOnnxrtRunner
Definition: onnxrt/test_runner.py:30
test_examples.infer_model
def infer_model(path)
Definition: onnx-graphsurgeon/tests/test_examples.py:70
nmtSample::SoftmaxLikelihood::SoftmaxLikelihoodCombinationOperator::init
float init() const override
Definition: softmaxLikelihood.cpp:88
nvinfer1::EnumMax< UnaryOperation >
constexpr int32_t EnumMax< UnaryOperation >()
Maximum number of elements in UnaryOperation enum.
Definition: NvInfer.h:3505
SampleGoogleNet::build
bool build()
Builds the network engine.
Definition: sampleGoogleNet.cpp:92
pytorch_quantization.nn.modules.tensor_quantizer.TensorQuantizer.enable
def enable(self)
Definition: tensor_quantizer.py:154
samplesCommon::GenericBuffer::mType
nvinfer1::DataType mType
Definition: buffers.h:173
MaskRCNNUtils::RawDetection::y1
float y1
Definition: sampleUffMaskRCNN.cpp:54
backend.trt.test_runner.TestTrtRunner
Definition: trt/test_runner.py:35
sample::anonymous_namespace{sampleInference.cpp}::EnqueueGraph::operator()
void operator()(TrtCudaStream &stream) const
Definition: sampleInference.cpp:242
nvinfer1::plugin::MultilevelProposeROI::clone
IPluginV2Ext * clone() const override
Clone the plugin object.
Definition: multilevelProposeROIPlugin.cpp:221
bert::QKVToContextInterleavedPlugin::mB
int mB
Definition: qkvToContextInt8InterleavedPlugin.h:82
conf.setup
def setup(app)
Definition: onnx-graphsurgeon/docs/conf.py:86
helpers.tokenization.BertTokenizer.convert_tokens_to_ids
def convert_tokens_to_ids(self, tokens)
Definition: tokenization.py:201
tests.quant_linear_test.TestQuantLinear.test_weight_fake_per_tensor
def test_weight_fake_per_tensor(self)
Definition: quant_linear_test.py:53
polygraphy.backend.onnx.loader
Definition: backend/onnx/loader.py:1
nvinfer1::plugin::FlattenConcat::canBroadcastInputAcrossBatch
bool canBroadcastInputAcrossBatch(int inputIndex) const override
Definition: flattenConcat.cpp:204
samplesCommon::HostMemory::mData
void * mData
Definition: samples/common/common.h:329
SampleINT8API::setDynamicRange
bool setDynamicRange(SampleUniquePtr< nvinfer1::INetworkDefinition > &network)
Sets custom dynamic range for network tensors.
Definition: sampleINT8API.cpp:299
tools.test_surgeon.TestSurgeonInsert.test_input_is_output
def test_input_is_output(self)
Definition: test_surgeon.py:58
nvinfer1::plugin::reduced_divisor::mul_coeff
unsigned int mul_coeff
Definition: reducedMathPlugin.h:96
export_tacotron2_onnx.DecoderIter.decode
def decode(self, decoder_input, in_attention_hidden, in_attention_cell, in_decoder_hidden, in_decoder_cell, in_attention_weights, in_attention_weights_cum, in_attention_context, memory, processed_memory, mask)
Definition: export_tacotron2_onnx.py:132
prn_utils.combineDescriptions
def combineDescriptions(prolog, features, descriptions)
Definition: prn_utils.py:29
pytorch_quantization.nn.modules._utils.QuantInputMixin.input_quantizer
def input_quantizer(self)
Definition: _utils.py:134
MaskRCNNConfig::RPN_ANCHOR_SCALES
static const std::vector< float > RPN_ANCHOR_SCALES
Definition: plugin/proposalLayerPlugin/mrcnn_config.h:63
onnx_graphsurgeon.importers.base_importer.BaseImporter.import_graph
Graph import_graph(graph)
Definition: base_importer.py:21
nvinfer1::plugin::DetectionLayer::mAnchorsCnt
int mAnchorsCnt
Definition: detectionLayerPlugin.h:100
pytorch_quantization.nn.modules.quant_rnn.GRUCell.weight_ih
weight_ih
Definition: modules/quant_rnn.py:431
nvinfer1::plugin::ResizeNearest::getOutputDataType
DataType getOutputDataType(int index, const nvinfer1::DataType *inputTypes, int nbInputs) const override
Definition: resizeNearestPlugin.cpp:218
half_float::detail::functions::isunordered
static bool isunordered(half x, half y)
Comparison implementation.
Definition: samples/common/half.h:2470
sample::InferenceTime::compute
float compute
Definition: sampleReporting.h:54
perf.main
def main()
Definition: perf.py:39
polygraphy.tools.args.trt.loader.TrtLoaderArgs.add_trt_config_loader
def add_trt_config_loader(self, script, data_loader_name)
Definition: tools/args/trt/loader.py:160
samplesCommon::TimerBase::seconds
float seconds() const noexcept
Definition: samples/common/common.h:794
nvinfer1::EnumMax< DimensionOperation >
constexpr int32_t EnumMax< DimensionOperation >()
Maximum number of elements in DimensionOperation enum.
Definition: NvInferRuntime.h:310
example.calib_data
def calib_data()
Definition: Polygraphy/examples/api/04_int8_calibration_in_tensorrt/example.py:36
nvinfer1::IPluginV2::configureWithFormat
virtual void configureWithFormat(const Dims *inputDims, int32_t nbInputs, const Dims *outputDims, int32_t nbOutputs, DataType type, PluginFormat format, int32_t maxBatchSize)=0
Configure the layer.
nvinfer1::IAlgorithmSelector::~IAlgorithmSelector
virtual ~IAlgorithmSelector()
Definition: NvInfer.h:6336
nvinfer1::plugin::RPROIPlugin::terminate
void terminate() override
Release resources acquired during plugin layer initialization.
Definition: nvFasterRCNNPlugin.cpp:229
nvinfer1::plugin::SplitPluginCreator::createPlugin
nvinfer1::IPluginV2DynamicExt * createPlugin(const char *, const nvinfer1::PluginFieldCollection *)
Return a plugin object.
Definition: split.h:158
nmtSample::SequenceProperties::SequenceProperties
SequenceProperties()=default
nvcaffeparser1::parsePermute
nvinfer1::ILayer * parsePermute(nvinfer1::INetworkDefinition &network, const trtcaffe::LayerParameter &msg, CaffeWeightFactory &, BlobNameToTensor &tensors)
Definition: parsePermute.cpp:23
samplesCommon::InferDeleter
Definition: samples/common/common.h:375
backend.onnx.test_loader.TestOnnxFileLoader
Definition: onnx/test_loader.py:35
nvinfer1::plugin::BatchedNMSPlugin::boxesSize
int boxesSize
Definition: batchedNMSPlugin.h:68
nvinfer1::IRefitter::getAll
virtual int32_t getAll(int32_t size, const char **layerNames, WeightsRole *roles)=0
Get description of all weights that could be refit.
pytorch_quantization.tensor_quant.ScaledQuantDescriptor.calib_method
def calib_method(self)
Definition: tensor_quant.py:140
nvinfer1::plugin::GenerateDetection::check_valid_inputs
void check_valid_inputs(const nvinfer1::Dims *inputs, int nbInputDims)
Definition: generateDetectionPlugin.cpp:234
half_float::detail::functions::log10
static expr log10(float arg)
Common logarithm implementation.
Definition: samples/common/half.h:1807
nvinfer1::plugin::DetectionOutput::getOutputDimensions
Dims getOutputDimensions(int index, const Dims *inputs, int nbInputDims) override
Definition: nmsPlugin.cpp:81
nvinfer1::CUDADriverWrapper::_cuModuleGetFunction
CUresult(* _cuModuleGetFunction)(CUfunction *, CUmodule, const char *)
Definition: cudaDriverWrapper.h:77
polygraphy.tools.args.trt_legacy.TrtLegacyArgs.model_args
model_args
Definition: tools/args/trt_legacy.py:44
polygraphy.tools.args.onnx.runner.OnnxtfRunnerArgs.register
def register(self, maker)
Definition: tools/args/onnx/runner.py:21
nvinfer1::IMatrixMultiplyLayer
Layer that represents a Matrix Multiplication.
Definition: NvInfer.h:4186
SampleUffFasterRcnnParams
Definition: sampleUffFasterRCNN.cpp:59
polygraphy.tools.args.trt_legacy.TrtLegacyArgs.parse
def parse(self, args)
Definition: tools/args/trt_legacy.py:58
half_float::detail::functions::isless
static bool isless(half x, half y)
Comparison implementation.
Definition: samples/common/half.h:2432
nvinfer1::plugin::GroupNormalizationPlugin::getPluginType
const char * getPluginType() const override
Return the plugin type.
Definition: groupNormalizationPlugin.cpp:67
sample::operator+=
InferenceTime operator+=(InferenceTime &a, const InferenceTime &b)
Definition: sampleReporting.h:107
nvinfer1::plugin::ResizeNearest::initialize
int initialize() override
Initialize the layer for execution.
Definition: resizeNearestPlugin.cpp:114
nmtSample::Vocabulary::mEosStr
static const std::string mEosStr
Definition: vocabulary.h:70
nvcaffeparser1::readTextProto
bool readTextProto(trtcaffe::NetParameter *net, const char *file)
Definition: readProto.h:64
nvinfer1::EnumMax< RNNInputMode >
constexpr int32_t EnumMax< RNNInputMode >()
Maximum number of elements in RNNInputMode enum.
Definition: NvInfer.h:2849
nvinfer1::plugin::ProposalLayer::mParam
RefineNMSParameters mParam
Definition: proposalLayerPlugin.h:111
half_float::detail::functions::logb
static half logb(half arg)
Exponent implementation.
Definition: samples/common/half.h:2257
nvinfer1::plugin::ProposalPluginCreator::~ProposalPluginCreator
~ProposalPluginCreator() override
Definition: proposalPlugin.cpp:332
nvinfer1::plugin::GroupNormalizationPlugin::initialize
int initialize() override
Initialize the layer for execution.
Definition: groupNormalizationPlugin.cpp:55
onnx_graphsurgeon.ir.tensor.Variable.copy
def copy(self)
Definition: tensor.py:177
sample::TrtCudaBuffer::mPtr
void * mPtr
Definition: sampleDevice.h:270
Args::userToExpectedItemProbMap
std::map< int32_t, std::vector< std::pair< int32_t, float > > > userToExpectedItemProbMap
Definition: sampleMovieLensMPS.cpp:183
nmtSample::BLEUScoreWriter::mMaxOrder
int mMaxOrder
Definition: bleuScoreWriter.h:56
BertInference::prepare
void prepare(int profIdx, int batchSize)
Definition: bert_infer.h:158
bert::TFusedMultiHeadAttentionXMMAKernel::run
virtual void run(TKernelParam &params, cudaStream_t ss) const
Definition: fused_multihead_attention.h:258
nvinfer1::plugin::DetectionOutputParameters::confidenceThreshold
float confidenceThreshold
Definition: NvInferPluginUtils.h:185
nmtSample::MultiplicativeAlignment::addAttentionKeys
void addAttentionKeys(nvinfer1::INetworkDefinition *network, nvinfer1::ITensor *memoryStates, nvinfer1::ITensor **attentionKeys) override
add attention keys calculation (from source memory states) to the network
Definition: multiplicativeAlignment.cpp:48
half_float::detail::isless
bool isless(half x, half y)
Comparison for less than.
Definition: ieee_half.h:2795
nvinfer1::ICudaEngine::serialize
virtual __attribute__((deprecated)) virtual std IHostMemory * serialize() const noexcept=0
Get the amount of workspace the engine uses.
tests.fixtures.models.QuantLeNet.fc1
fc1
Definition: tools/pytorch-quantization/tests/fixtures/models.py:49
nvinfer1::plugin::CropAndResizePlugin::clone
nvinfer1::IPluginV2Ext * clone() const override
Clone the plugin object.
Definition: cropAndResizePlugin.cpp:181
backend.trt.test_loader.TestConfigLoader.test_strict_types
def test_strict_types(self, identity_builder_network, flag)
Definition: trt/test_loader.py:193
nvinfer1::plugin::Region::W
int W
Definition: regionPlugin.h:97
onnx_graphsurgeon.util.misc.SynchronizedList.__setitem__
def __setitem__(self, index, elem)
Definition: onnx-graphsurgeon/onnx_graphsurgeon/util/misc.py:91
SampleCharRNNLoop::LstmIO
Definition: sampleCharRNN.cpp:241
helpers.calibrator.BertCalibrator.read_calibration_cache
def read_calibration_cache(self)
Definition: demo/BERT/helpers/calibrator.py:89
nmtSample::LikelihoodCombinationOperator::ptr
std::shared_ptr< LikelihoodCombinationOperator > ptr
Definition: likelihoodCombinationOperator.h:27
pytorch_quantization.nn.modules.quant_conv.QuantConv3d.padding_mode
padding_mode
Definition: quant_conv.py:164
half_float::detail::functions::sin
static expr sin(float arg)
Sine implementation.
Definition: samples/common/half.h:1886
onnx_graphsurgeon.ir.tensor.Tensor.dtype
dtype
Definition: tensor.py:86
half_float::detail::tanh
expr tanh(half arg)
Hyperbolic tangent.
Definition: ieee_half.h:2487
half_float::detail::half_caster< T, expr, R >::cast_impl
static T cast_impl(float arg, true_type)
Definition: samples/common/half.h:2672
nvinfer1::plugin::DetectionLayerPluginCreator::mKeepTopK
int mKeepTopK
Definition: detectionLayerPlugin.h:128
polygraphy.backend.onnx.loader.ModifyOnnx.exclude_outputs
exclude_outputs
Definition: backend/onnx/loader.py:126
test_importers.TestOnnxImporter.test_import_graph_tensor_map_preserved
def test_import_graph_tensor_map_preserved(self)
Definition: test_importers.py:105
nvcaffeparser1::BinaryProtoBlob::~BinaryProtoBlob
~BinaryProtoBlob() override
Definition: binaryProtoBlob.h:56
polygraphy.backend.trt_legacy.LoadNetworkFromUff.__init__
def __init__(self, uff_loader, uff_order=None)
Definition: backend/trt_legacy.py:81
polygraphy.logger.logger.Logger.logging_indent
logging_indent
Definition: Polygraphy/polygraphy/logger/logger.py:127
nvinfer1::plugin::GridAnchorGenerator::configurePlugin
void configurePlugin(const Dims *inputDims, int nbInputs, const Dims *outputDims, int nbOutputs, const DataType *inputTypes, const DataType *outputTypes, const bool *inputIsBroadcast, const bool *outputIsBroadcast, PluginFormat floatFormat, int maxBatchSize) override
Definition: gridAnchorPlugin.cpp:322
export_tacotron2_onnx.parse_args
def parse_args(parser)
Definition: export_tacotron2_onnx.py:29
nvinfer1::plugin::RPROIPluginCreator::createPlugin
IPluginV2Ext * createPlugin(const char *name, const PluginFieldCollection *fc) override
Return a plugin object.
Definition: nvFasterRCNNPlugin.cpp:354
pytorch_quantization.nn.modules.quant_pooling.QuantMaxPool1d
Definition: quant_pooling.py:36
onnx_graphsurgeon.ir.graph.UnseenTensor
Definition: graph.py:29
nvinfer1::IHostMemory::type
virtual DataType type() const noexcept=0
The type of the memory that was allocated.
nvinfer1::plugin::CropAndResizePluginCreator::deserializePlugin
IPluginV2Ext * deserializePlugin(const char *name, const void *serialData, size_t serialLength) override
Called during deserialization of plugin layer.
Definition: cropAndResizePlugin.cpp:295
nvinfer1::utility::TRTParsedArgs::positionalArgs
std::vector< std::string > positionalArgs
Definition: getOptions.h:61
bert::QKVToContextInterleavedPlugin::mSM
int mSM
Definition: qkvToContextInt8InterleavedPlugin.h:83
nvuffparser::shutdownProtobufLibrary
void shutdownProtobufLibrary(void)
Shuts down protocol buffers library.
SampleSSD::teardown
bool teardown()
Cleans up any state created in the sample class.
Definition: sampleSSD.cpp:264
nvinfer1::IExecutionContext::enqueue
virtual bool enqueue(int32_t batchSize, void **bindings, cudaStream_t stream, cudaEvent_t *inputConsumed) noexcept=0
Asynchronously execute inference on a batch.
half_float::detail::functions::erf
static expr erf(float arg)
Error function implementation.
Definition: samples/common/half.h:2006
Int8EntropyCalibrator2::mImpl
EntropyCalibratorImpl mImpl
Definition: frcnnUtils.h:537
nvinfer1::plugin::PriorBoxParameters::aspectRatios
float * aspectRatios
Definition: NvInferPluginUtils.h:93
polygraphy.backend.trt_legacy.LoadNetworkFromCaffe.outputs
outputs
Definition: backend/trt_legacy.py:155
tools.test_run.TestOnnxrt.test_onnx_rt_custom_outputs
def test_onnx_rt_custom_outputs(self)
Definition: test_run.py:198
FCPlugin::getNbOutputs
int getNbOutputs() const override
Get the number of outputs from the layer.
Definition: samples/opensource/samplePlugin/fcPlugin.h:95
backend.trt.test_runner.TestTrtRunner.test_empty_tensor_with_dynamic_input_shape_tensor
def test_empty_tensor_with_dynamic_input_shape_tensor(self)
Definition: trt/test_runner.py:99
polygraphy.logger.logger.LoggerVerbosity.__init__
def __init__(self, logger, severity)
Definition: Polygraphy/polygraphy/logger/logger.py:59
bert::TFusedMultiHeadAttentionXMMAKernel::loadXMMAKernels
void loadXMMAKernels()
Definition: fused_multihead_attention.h:212
comparator.test_postprocess.TestTopK.test_basic
def test_basic(self)
Definition: test_postprocess.py:21
nvonnxparser::IOnnxConfig::destroy
virtual void destroy()=0
Destroy IOnnxConfig object.
nvinfer1::plugin::FlattenConcatPluginCreator::mIgnoreBatch
bool mIgnoreBatch
Definition: flattenConcat.h:127
nvinfer1::plugin::ResizeNearest::enqueue
int enqueue(int batch_size, const void *const *inputs, void **outputs, void *workspace, cudaStream_t stream) override
Definition: resizeNearestPlugin.cpp:197
nvinfer1::plugin::CropAndResizePlugin::mCropWidth
size_t mCropWidth
Definition: cropAndResizePlugin.h:101
nvinfer1::IAlgorithmContext::getNbInputs
virtual int32_t getNbInputs() const =0
Return number of inputs of the algorithm.
sample::anonymous_namespace{sampleInference.cpp}::SyncStruct::gpuStart
TrtCudaEvent gpuStart
Definition: sampleInference.cpp:170
nvinfer1::plugin::NormalizePluginCreator::createPlugin
IPluginV2Ext * createPlugin(const char *name, const PluginFieldCollection *fc) override
Return a plugin object.
Definition: normalizePlugin.cpp:282
test_importers.TestOnnxImporter.test_import_tensor_unknown_metadata
def test_import_tensor_unknown_metadata(self)
Definition: test_importers.py:54
nvinfer1::plugin::GridAnchorGenerator::getWorkspaceSize
size_t getWorkspaceSize(int maxBatchSize) const override
Definition: gridAnchorPlugin.cpp:193
utils.load_wav_to_torch
def load_wav_to_torch(full_path)
Definition: demo/Tacotron2/common/utils.py:31
onnx_graphsurgeon.util.misc.SynchronizedList.__add__
def __add__(self, List[object] other_list)
Definition: onnx-graphsurgeon/onnx_graphsurgeon/util/misc.py:129
onnx_graphsurgeon.ir.node.Node.__init__
def __init__(self, str op, str name=None, Dict[str, object] attrs=None, List["Tensor"] inputs=None, List["Tensor"] outputs=None)
Definition: node.py:25
nmtSample::DataReader::read
virtual int read(int samplesToRead, int maxInputSequenceLength, int *hInputData, int *hActualInputSequenceLengths)=0
reads the batch of smaples/sequences
polygraphy.tools.surgeon.subtool.sanitize.Sanitize.add_parser_args
def add_parser_args(self, parser)
Definition: sanitize.py:27
SampleMovieLensParams::numUsers
int32_t numUsers
Definition: sampleMovieLens.cpp:65
tests.quant_rnn_test.TestQuantLSTMCell
Definition: quant_rnn_test.py:55
util.test_cuda.TestDeviceBuffer.test_device_buffer_resize
def test_device_buffer_resize(self, shapes)
Definition: test_cuda.py:41
FCPlugin::mDataType
nvinfer1::DataType mDataType
Definition: samples/opensource/samplePlugin/fcPlugin.h:311
LogStreamConsumer::LogStreamConsumer
LogStreamConsumer(LogStreamConsumer &&other)
Definition: pluginLogging.h:142
nvinfer1::plugin::RPROIParams
RPROIParams is used to create the RPROIPlugin instance.
Definition: NvInferPluginUtils.h:116
nmtSample::TextReader::read
int read(int samplesToRead, int maxInputSequenceLength, int *hInputData, int *hActualInputSequenceLengths) override
reads the batch of smaples/sequences
Definition: textReader.cpp:32
nvinfer1::plugin::DetectionOutputParameters::topK
int32_t topK
Definition: NvInferPluginUtils.h:184
polygraphy.backend.trt_legacy.TrtLegacyRunner.infer_impl
def infer_impl(self, feed_dict)
Definition: backend/trt_legacy.py:334
nvinfer1::plugin::LReluPluginCreator::mFC
static PluginFieldCollection mFC
Definition: lReluPlugin.h:94
nvinfer1::DimensionType
DimensionType
The type of data encoded across this dimension.
Definition: NvInferRuntimeCommon.h:165
polygraphy.backend.pyt.runner.PytRunner.activate_impl
def activate_impl(self)
Definition: backend/pyt/runner.py:43
nvinfer1::RNNGateType::kHIDDEN
@ kHIDDEN
Hidden gate (h).
polygraphy.util.misc.is_shape_dynamic
def is_shape_dynamic(shape)
Definition: Polygraphy/polygraphy/util/misc.py:143
sample::BuildOptions::help
static void help(std::ostream &out)
Definition: sampleOptions.cpp:1167
builder.emb_layernorm
def emb_layernorm(builder, network, config, weights_dict, builder_config, sequence_lengths, batch_sizes)
Definition: builder.py:528
polygraphy.comparator.compare.CompareFunc
Definition: compare.py:58
nvinfer1::plugin::PriorBox::mParam
PriorBoxParameters mParam
Definition: priorBoxPlugin.h:92
ConcatTopKWorkSpace
Definition: maskRCNNKernels.h:189
polygraphy.tools.args.trt.loader.TrtLoaderArgs.trt_opt_shapes
trt_opt_shapes
Definition: tools/args/trt/loader.py:94
nvinfer1::plugin::ProposalLayerPluginCreator::deserializePlugin
IPluginV2Ext * deserializePlugin(const char *name, const void *data, size_t length) override
Called during deserialization of plugin layer.
Definition: proposalLayerPlugin.cpp:98
half_float::detail::fmax
half fmax(half x, half y)
Maximum of half expressions.
Definition: ieee_half.h:2289
SampleMLP::mParams
SampleMLPParams mParams
The parameters for the sample.
Definition: sampleMLP.cpp:85
nvinfer1::plugin::CropAndResizePlugin::mInputHeight
size_t mInputHeight
Definition: cropAndResizePlugin.h:101
half_float::detail::binary_specialized
Wrapper for binary half-precision functions needing specialization for individual argument types.
Definition: ieee_half.h:1051
samplesCommon::writeASCIIFile
bool writeASCIIFile(const std::string &fileName, const std::vector< T > &in)
Definition: samples/common/common.h:467
pytorch_quantization.nn.modules.tensor_quantizer.TensorQuantizer._if_quant
_if_quant
Definition: tensor_quantizer.py:84
nvinfer1::plugin::MultilevelCropAndResize::setPluginNamespace
void setPluginNamespace(const char *libNamespace) override
Set the namespace that this plugin object belongs to.
Definition: multilevelCropAndResizePlugin.cpp:145
tests.tensor_quant_test.TestFakeTensorQuant.test_per_channel_scale
def test_per_channel_scale(self)
Definition: tensor_quant_test.py:151
nvinfer1::plugin::PyramidROIAlign::configurePlugin
void configurePlugin(const Dims *inputDims, int nbInputs, const Dims *outputDims, int nbOutputs, const DataType *inputTypes, const DataType *outputTypes, const bool *inputIsBroadcast, const bool *outputIsBroadcast, PluginFormat floatFormat, int maxBatchSize) override
Definition: pyramidROIAlignPlugin.cpp:266
nvinfer1::plugin::GenerateDetection::mParam
RefineNMSParameters mParam
Definition: generateDetectionPlugin.h:104
nvinfer1::plugin::SplitPlugin::terminate
void terminate() override
Release resources acquired during plugin layer initialization.
half_float::detail::functions::islessgreater
static bool islessgreater(half x, half y)
Comparison implementation.
Definition: samples/common/half.h:2456
tests.tensor_quantizer_test.TestTensorQuantizer.test_scale_amax
def test_scale_amax(self)
Definition: tensor_quantizer_test.py:117
nvinfer1::ReduceOperation::kSUM
@ kSUM
nvinfer1::plugin::reduced_divisor
Definition: reducedMathPlugin.h:54
bert::QKVToContextInterleavedPlugin::clone
nvinfer1::IPluginV2DynamicExt * clone() const override
Clone the plugin object.
Definition: qkvToContextInt8InterleavedPlugin.cpp:93
pytorch_quantization.nn.modules.quant_conv.QuantConvTranspose1d.forward
def forward(self, input, output_size=None)
Definition: quant_conv.py:297
nvinfer1::plugin::ProposalLayer::isOutputBroadcastAcrossBatch
bool isOutputBroadcastAcrossBatch(int outputIndex, const bool *inputIsBroadcasted, int nbInputs) const override
Definition: proposalLayerPlugin.cpp:337
comparator.test_comparator.TestComparator.test_multirun_outputs_are_different
def test_multirun_outputs_are_different(self)
Definition: test_comparator.py:122
nvinfer1::IAlgorithmContext::~IAlgorithmContext
virtual ~IAlgorithmContext()
Definition: NvInfer.h:6255
SampleMLP::verifyOutput
bool verifyOutput(const samplesCommon::BufferManager &buffers)
Classifies digits and verify result.
Definition: sampleMLP.cpp:330
nvinfer1::plugin::PyramidROIAlign::mFeatureLength
int mFeatureLength
Definition: pyramidROIAlignPlugin.h:96
nvinfer1::plugin::GenerateDetectionPluginCreator::mFC
static PluginFieldCollection mFC
Definition: generateDetectionPlugin.h:130
pytorch_quantization.nn.modules.tensor_quantizer.TensorQuantizer.enable_clip
def enable_clip(self)
Definition: tensor_quantizer.py:163
nvinfer1::plugin::CoordConvACPlugin::setPluginNamespace
void setPluginNamespace(const char *pluginNamespace) override
Set the namespace that this plugin object belongs to.
Definition: coordConvACPlugin.cpp:150
pytorch_quantization.nn.modules.clip.Clip.forward
def forward(self, inputs)
Definition: clip.py:56
bert::QKVToContextInterleavedPlugin::configurePlugin
void configurePlugin(const nvinfer1::DynamicPluginTensorDesc *in, int nbInputs, const nvinfer1::DynamicPluginTensorDesc *out, int nbOutputs) override
Definition: qkvToContextInt8InterleavedPlugin.cpp:147
UffPoolPluginV2::mMode
cudnnPoolingMode_t mMode
Definition: sampleUffPluginV2Ext.cpp:635
helpers.calibrator.BertCalibrator.cache_file
cache_file
Definition: demo/BERT/helpers/calibrator.py:33
model.Decoder.decoder_rnn_dim
decoder_rnn_dim
Definition: demo/Tacotron2/tacotron2/model.py:246
polygraphy.backend.trt.loader.ModifyNetwork
Definition: backend/trt/loader.py:183
nvinfer1::plugin::GroupNormalizationPlugin::getOutputDimensions
DimsExprs getOutputDimensions(int index, const nvinfer1::DimsExprs *inputs, int nbInputDims, nvinfer1::IExprBuilder &exprBuilder) override
Definition: groupNormalizationPlugin.cpp:82
SampleCharRNNWeightNames::names
std::unordered_set< std::string > names
Definition: sampleCharRNN.cpp:82
nvinfer1::IBuilderConfig
Holds properties for configuring a builder to produce an engine.
Definition: NvInfer.h:6464
sample::anonymous_namespace{sampleInference.cpp}::Iteration::mActive
std::vector< bool > mActive
Definition: sampleInference.cpp:466
SampleFasterRCNN::kIMG_W
static const int kIMG_W
Definition: sampleFasterRCNN.cpp:89
polygraphy.backend.onnx.runner.OnnxTfRunner.get_input_metadata
def get_input_metadata(self)
Definition: backend/onnx/runner.py:62
bert::Fused_multihead_attention_params::enable_i2f_trick
bool enable_i2f_trick
Definition: fused_multihead_attention.h:85
UffPoolPluginV2::getPluginVersion
const char * getPluginVersion() const override
Return the plugin version.
Definition: sampleUffPluginV2Ext.cpp:542
nvinfer1::plugin::SplitPlugin::configurePlugin
void configurePlugin(const nvinfer1::DynamicPluginTensorDesc *in, int nbInputs, const nvinfer1::DynamicPluginTensorDesc *out, int nbOutputs) override
polygraphy.tools.args.trt.loader.TrtLoaderArgs.parse
def parse(self, args)
Definition: tools/args/trt/loader.py:84
polygraphy.comparator.comparator.Comparator.validate
def validate(run_results, check_finite=None, check_nan=None, fail_fast=None)
Definition: comparator/comparator.py:281
polygraphy.tools.args.comparator.ComparatorCompareArgs.validate
validate
Definition: tools/args/comparator.py:107
sample::SystemOptions
Definition: sampleOptions.h:134
sample::TrtCudaGraph::launch
void launch(TrtCudaStream &stream)
Definition: sampleDevice.h:192
nmtSample::LSTMEncoder::mWeights
ComponentWeights::ptr mWeights
Definition: lstmEncoder.h:50
models.get_model_config
def get_model_config(model_name, args)
Definition: demo/Tacotron2/models.py:85
pytorch_quantization.nn.modules.quant_bert.QuantBertSelfAttention.matmul_a_input_quantizer
matmul_a_input_quantizer
Definition: quant_bert.py:76
FCPlugin::convertAndCopyToBuffer
void convertAndCopyToBuffer(char *&buffer, const nvinfer1::Weights &weights)
Definition: samples/opensource/samplePlugin/fcPlugin.h:277
nvinfer1::plugin::MultilevelProposeROI::MultilevelProposeROI
MultilevelProposeROI(int prenms_topk, int keep_topk, float fg_threshold, float iou_threshold, const nvinfer1::Dims image_size)
Definition: multilevelProposeROIPlugin.cpp:111
samplesCommon::BufferManager::mEngine
std::shared_ptr< nvinfer1::ICudaEngine > mEngine
The pointer to the engine.
Definition: buffers.h:455
test_examples.Example.artifacts
artifacts
Definition: Polygraphy/tests/test_examples.py:57
nvinfer1::LayerType::kUNARY
@ kUNARY
UnaryOp operation Layer.
tests.quant_conv_transposed_test.TestQuantConvTranspose3D.test_fake_quant_per_channel_bias
def test_fake_quant_per_channel_bias(self)
Definition: quant_conv_transposed_test.py:286
nvinfer1::plugin::DetectionLayer::supportsFormat
bool supportsFormat(DataType type, PluginFormat format) const override
Check format support.
Definition: detectionLayerPlugin.cpp:143
polygraphy.backend.pyt.runner.PytRunner.infer_impl
def infer_impl(self, feed_dict)
Definition: backend/pyt/runner.py:48
SampleFasterRCNN
The SampleFasterRCNN class implements the FasterRCNN sample.
Definition: sampleFasterRCNN.cpp:55
bert::EmbLayerNormVarSeqlenPlugin::setPluginNamespace
void setPluginNamespace(const char *pluginNamespace) override
Set the namespace that this plugin object belongs to.
Definition: embLayerNormVarSeqlenPlugin.cpp:407
SampleFasterRCNNParams
The SampleFasterRCNNParams structure groups the additional parameters required by the FasterRCNN samp...
Definition: sampleFasterRCNN.cpp:45
sample::Logger::Logger
Logger(Severity severity=Severity::kWARNING)
Definition: samples/common/logging.h:209
ParserOnnxConfig
Configuration Manager Class Concrete Implementation.
Definition: parserOnnxConfig.h:40
nmtSample::LSTMEncoder::mGateKernelWeights
std::vector< nvinfer1::Weights > mGateKernelWeights
Definition: lstmEncoder.h:51
polygraphy.logger.logger.Logger.start
def start(self, message, mode=LogMode.EACH)
Definition: Polygraphy/polygraphy/logger/logger.py:270
UffPoolPluginV2Creator::mPluginName
std::string mPluginName
Definition: sampleUffPluginV2Ext.cpp:690
nvinfer1::UnaryOperation::kNEG
@ kNEG
Negation.
nvinfer1::plugin::GenerateDetection::getOutputDataType
DataType getOutputDataType(int index, const nvinfer1::DataType *inputTypes, int nbInputs) const override
Definition: generateDetectionPlugin.cpp:296
nvinfer1::plugin::CoordConvACPlugin::mPluginNamespace
const char * mPluginNamespace
Definition: coordConvACPlugin.h:88
tests.quant_conv_test.TestQuantConv2D.test_set_default_quant_desc
def test_set_default_quant_desc(self)
Definition: quant_conv_test.py:237
isolate.dtype
dtype
Definition: isolate.py:54
half_float::detail::binary
const binary_t binary
Tag for binary construction.
Definition: ieee_half.h:349
polygraphy.backend.tf.util.get_graph_output_names
def get_graph_output_names(graph)
Definition: tf/util.py:157
test_ir.build_two_layer_graph
def build_two_layer_graph()
Definition: test_ir.py:347
polygraphy.backend.trt.runner.TrtRunner.__init__
def __init__(self, engine, name=None)
Definition: backend/trt/runner.py:34
polygraphy.comparator.struct.AccuracyResult
Definition: comparator/struct.py:128
polygraphy.comparator.data_loader
Definition: comparator/data_loader.py:1
nvinfer1::plugin::DetectionLayer::destroy
void destroy() override
Destroy the plugin object.
Definition: detectionLayerPlugin.cpp:138
nmtSample::BeamSearchPolicy::readGeneratedResult
void readGeneratedResult(int sampleCount, int maxOutputSequenceLength, int *hOutputData, int *hActualOutputSequenceLengths)
Definition: beamSearchPolicy.cpp:129
nvinfer1::plugin::CropAndResizePluginCreator
Definition: cropAndResizePlugin.h:105
OutputParams::expectedPredictedMaxRatingItem
int32_t expectedPredictedMaxRatingItem
Definition: sampleMovieLens.cpp:52
perf.DeviceBuffer.buf
buf
Definition: perf.py:30
nvcaffeparser1::CaffeWeightFactory::mDataType
nvinfer1::DataType mDataType
Definition: caffeWeightFactory.h:61
sample::BaseModelOptions
Definition: sampleOptions.h:74
nvinfer1::plugin::RPROIPluginCreator::RPROIPluginCreator
RPROIPluginCreator()
Definition: nvFasterRCNNPlugin.cpp:311
polygraphy.tools.args.onnx.runner.OnnxtfRunnerArgs.check_registered
def check_registered(self)
Definition: tools/args/onnx/runner.py:28
polygraphy.backend.tf.loader.SaveGraph.__call__
def __call__(self)
Definition: backend/tf/loader.py:328
nvinfer1::EnumMax< TacticSource >
constexpr int32_t EnumMax< TacticSource >()
Maximum number of tactic sources in TacticSource enum.
Definition: NvInfer.h:6446
nvinfer1::ElementWiseOperation::kLESS
@ kLESS
Check if element in first tensor is less than corresponding element in second tensor.
SampleMovieLens::readNextLine
std::string readNextLine(std::ifstream &file, char delim)
Helper function to read the next line of the MovieLens dataset .csv file and return the contents of t...
Definition: sampleMovieLens.cpp:361
SampleMNISTAPI::mNumber
int mNumber
The number to classify.
Definition: sampleMNISTAPI.cpp:89
polygraphy.util.misc.unique_list
def unique_list(sequence)
Definition: Polygraphy/polygraphy/util/misc.py:60
nvinfer1::plugin::GridAnchorParameters::aspectRatios
float * aspectRatios
Definition: NvInferPluginUtils.h:146
trt_utils.is_shape_dynamic
def is_shape_dynamic(shape)
Definition: trt_utils.py:24
polygraphy.backend.tf.runner.TfRunner.run_options
run_options
Definition: backend/tf/runner.py:55
onnx_graphsurgeon.logger.logger.Logger.logger_callbacks
logger_callbacks
Definition: onnx-graphsurgeon/onnx_graphsurgeon/logger/logger.py:108
pytorch_quantization.calib.calibrator
Definition: tools/pytorch-quantization/pytorch_quantization/calib/calibrator.py:1
half_float::detail::functions::atanh
static expr atanh(float arg)
Hyperbolic area tangent implementation.
Definition: samples/common/half.h:1994
nmtSample::SoftmaxLikelihood::SoftmaxLikelihoodCombinationOperator::~SoftmaxLikelihoodCombinationOperator
~SoftmaxLikelihoodCombinationOperator() override=default
sample::Bindings::dumpOutputs
void dumpOutputs(const nvinfer1::IExecutionContext &context, std::ostream &os) const
Definition: sampleUtils.h:389
nvinfer1::plugin::GroupNormalizationPluginCreator::createPlugin
IPluginV2DynamicExt * createPlugin(const char *name, const PluginFieldCollection *fc) override
Return a plugin object.
Definition: groupNormalizationPlugin.cpp:285
polygraphy.tools.surgeon.subtool.base.BaseSurgeonSubtool
Definition: surgeon/subtool/base.py:27
SampleErrorRecorder::indexCheck
bool indexCheck(int32_t index) const noexcept
Definition: ErrorRecorder.h:117
nvinfer1::plugin::BatchedNMSPlugin::destroy
void destroy() override
Destroy the plugin object.
Definition: batchedNMSPlugin.cpp:375
nvinfer1::PluginRegistrar::PluginRegistrar
PluginRegistrar()
Definition: NvInferRuntimeCommon.h:1400
tools.test_run.TestTrt.test_trt_plugins
def test_trt_plugins(self)
Definition: test_run.py:65
export_waveglow_onnx.convert_convinv_1d_to_2d
def convert_convinv_1d_to_2d(convinv)
Definition: export_waveglow_onnx.py:42
SampleMLPParams::inputH
int inputH
The input height.
Definition: sampleMLP.cpp:47
polygraphy.backend.base.runner.BaseRunner.activate_impl
def activate_impl(self)
Definition: backend/base/runner.py:75
backend.trt.test_loader.TestModifyNetwork.test_unmark_shape_outputs
def test_unmark_shape_outputs(self, reshape_network)
Definition: trt/test_loader.py:154
polygraphy.util.misc.num_dynamic_dimensions
def num_dynamic_dimensions(shape)
Definition: Polygraphy/polygraphy/util/misc.py:139
helpers.calibrator.BertCalibrator
Definition: demo/BERT/helpers/calibrator.py:27
test_ir.TestNode.test_o
def test_o(self)
Definition: test_ir.py:205
pytorch_quantization.nn._functions.quant_rnn.VariableRecurrentReverse
def VariableRecurrentReverse(inner)
Definition: _functions/quant_rnn.py:198
SampleConfig::setReferenceFileName
void setReferenceFileName(const char *referenceFilename)
Definition: sampleConfig.h:245
BatchStream::BatchStream
BatchStream(int batchSize, int maxBatches, std::string prefix, std::string suffix, std::vector< std::string > directories)
Definition: BatchStream.h:157
pytorch_quantization.calib.max.MaxCalibrator.collect
def collect(self, x)
Definition: max.py:50
nvinfer1::plugin::MultilevelProposeROI::mParam
RefineNMSParameters mParam
Definition: multilevelProposeROIPlugin.h:115
bert::EmbLayerNormVarSeqlenPluginCreator::mNamespace
std::string mNamespace
Definition: embLayerNormVarSeqlenPlugin.h:137
nvinfer1::plugin::CropAndResizePlugin::CropAndResizePlugin
CropAndResizePlugin()=delete
nvinfer1::plugin::Region::Region
Region(RegionParameters params)
Definition: regionPlugin.cpp:79
nvcaffeparser1::gParseTable
static std::unordered_map< std::string, LayerParseFn > gParseTable
Definition: opParsers.h:75
nvinfer1::ILayer::getName
virtual const char * getName() const =0
Return the name of a layer.
samplesCommon::BBox::x2
float x2
Definition: samples/common/common.h:669
bert::fused_multihead_attention_v2_fp16_128_64_kernel_sm80_cubin_len
unsigned int fused_multihead_attention_v2_fp16_128_64_kernel_sm80_cubin_len
Definition: fused_multihead_attention_v2_fp16_128_64_kernel.sm80.cpp:6078
SampleErrorRecorder::reportError
bool reportError(ErrorCode val, IErrorRecorder::ErrorDesc desc) noexcept final
report an error to the error recorder with the corresponding enum and description.
Definition: ErrorRecorder.h:85
polygraphy.tools.args.base.BaseArgs
Definition: args/base.py:17
nvinfer1::PluginVersion::kV2
@ kV2
generate.concat
def concat(self, inputs, axis=0)
Definition: 09_shape_operations_with_the_layer_api/generate.py:46
polygraphy.backend.base
Definition: tools/Polygraphy/polygraphy/backend/base/__init__.py:1
nvcaffeparser1::CaffeWeightFactory::operator()
virtual nvinfer1::Weights operator()(const std::string &layerName, WeightType weightType)
Definition: caffeWeightFactory.cpp:148
polygraphy.tools.precision.precision.STCheckerBase.check_network
def check_network(self, suffix)
Definition: precision.py:118
polygraphy.tools.args.onnx.loader.OnnxLoaderArgs.add_onnx_loader
def add_onnx_loader(self, script, disable_outputs=None, suffix=None)
Definition: tools/args/onnx/loader.py:88
data_function.TextMelLoader.max_wav_value
max_wav_value
Definition: tacotron2/data_function.py:35
sample::anonymous_namespace{sampleOptions.cpp}::printBatch
std::ostream & printBatch(std::ostream &os, int maxBatch)
Definition: sampleOptions.cpp:327
tests.quant_pooling_test.TestQuantMaxPool3d
Definition: quant_pooling_test.py:107
nvinfer1::plugin::PyramidROIAlignPluginCreator::getPluginVersion
const char * getPluginVersion() const override
Return the plugin version.
Definition: pyramidROIAlignPlugin.cpp:48
nvinfer1::plugin::MultilevelProposeROI::destroy
void destroy() override
Destroy the plugin object.
Definition: multilevelProposeROIPlugin.cpp:201
sample::anonymous_namespace{sampleEngines.cpp}::RndInt8Calibrator::mCalibrationCache
std::vector< char > mCalibrationCache
Definition: sampleEngines.cpp:187
nvinfer1::plugin::PyramidROIAlignPluginCreator::getPluginName
const char * getPluginName() const override
Return the plugin name.
Definition: pyramidROIAlignPlugin.cpp:43
MaskRCNNUtils::preprocessPPM
void preprocessPPM(PPM< uint8_t > &src, PPM< uint8_t > &dst, int target_h, int target_w)
Definition: sampleUffMaskRCNN.cpp:161
SampleCharRNNBase::loadWeights
std::map< std::string, nvinfer1::Weights > loadWeights(const std::string file)
Load requested weights from a formatted file into a map.
Definition: sampleCharRNN.cpp:354
nvinfer1::plugin::FlattenConcat::supportsFormat
bool supportsFormat(DataType type, PluginFormat format) const override
Check format support.
Definition: flattenConcat.cpp:265
tests.tensor_quant_test.TestFakeTensorQuant.test_cuda_ext_inplace
def test_cuda_ext_inplace(self)
Definition: tensor_quant_test.py:230
polygraphy.comparator.data_loader.DataLoader.user_input_metadata
user_input_metadata
Definition: comparator/data_loader.py:88
half_float::detail::cbrt
expr cbrt(half arg)
Cubic root.
Definition: ieee_half.h:2386
sample::TrtCudaGraph::mGraph
cudaGraph_t mGraph
Definition: sampleDevice.h:205
nvinfer1::plugin::Region::supportsFormat
bool supportsFormat(DataType type, PluginFormat format) const override
Check format support.
Definition: regionPlugin.cpp:371
pytorch_quantization.nn.modules._utils.QuantMixin._weight_quantizer
_weight_quantizer
Definition: _utils.py:81
nmtSample::Encoder::Encoder
Encoder()=default
onnx_graphsurgeon.logger.logger.LoggerSuppress.severity
severity
Definition: onnx-graphsurgeon/onnx_graphsurgeon/logger/logger.py:45
Logger::reportWaive
static int reportWaive(const TestAtom &testAtom)
Definition: pluginLogging.h:380
Logger::reportTestEnd
static void reportTestEnd(const TestAtom &testAtom, TestResult result)
Report that a test has ended.
Definition: pluginLogging.h:361
sample::anonymous_namespace{sampleInference.cpp}::EnqueueImplicit::operator()
void operator()(TrtCudaStream &stream) const
Definition: sampleInference.cpp:201
onnx_graphsurgeon.util.misc.SynchronizedList.__iadd__
def __iadd__(self, List[object] other_list)
Definition: onnx-graphsurgeon/onnx_graphsurgeon/util/misc.py:133
nvinfer1::plugin::Reorg::H
int H
Definition: reorgPlugin.h:87
nvinfer1::plugin::BatchTilePlugin::getPluginType
const char * getPluginType() const override
Return the plugin type.
Definition: batchTilePlugin.cpp:152
TLTMaskRCNNConfig::MODEL_NAME
static const std::string MODEL_NAME
Definition: tlt_mrcnn_config.h:170
nvinfer1::plugin::MultilevelProposeROI::configurePlugin
void configurePlugin(const Dims *inputDims, int nbInputs, const Dims *outputDims, int nbOutputs, const DataType *inputTypes, const DataType *outputTypes, const bool *inputIsBroadcast, const bool *outputIsBroadcast, PluginFormat floatFormat, int maxBatchSize) override
Definition: multilevelProposeROIPlugin.cpp:449
bert::fused_multihead_attention_v2_int8_384_64_kernel_sm80_cubin_len
unsigned int fused_multihead_attention_v2_int8_384_64_kernel_sm80_cubin_len
Definition: fused_multihead_attention_v2_int8_384_64_kernel.sm80.cpp:17244
pytorch_quantization.nn.modules.tensor_quantizer.TensorQuantizer.enable_calib
def enable_calib(self)
Definition: tensor_quantizer.py:176
nvinfer1::utils::reshapeWeights
bool reshapeWeights(const Weights &input, const int32_t *shape, const int32_t *shapeOrder, void *data, int32_t nbDims)
Reformat the input weights of the given shape based on the new order of dimensions.
polygraphy.backend.tf.loader.CreateConfig.allow_growth
allow_growth
Definition: backend/tf/loader.py:370
nvinfer1::IResizeLayer::~IResizeLayer
virtual ~IResizeLayer()
Definition: NvInfer.h:4483
sample::printTiming
void printTiming(const std::vector< InferenceTime > &timings, int runsPerAvg, std::ostream &os)
Print a timing trace.
Definition: sampleReporting.cpp:85
SampleOnnxMnistCoordConvAC
The SampleOnnxMnistCoordConvAC class implements the ONNX MNIST sample.
Definition: sampleOnnxMnistCoordConvAC.cpp:52
nvinfer1::plugin::LReluPluginCreator::negSlope
float negSlope
Definition: lReluPlugin.h:95
UffPoolPluginV2::mCudnn
cudnnHandle_t mCudnn
Definition: sampleUffPluginV2Ext.cpp:631
sample::anonymous_namespace{sampleOptions.cpp}::checkEraseRepeatedOption
bool checkEraseRepeatedOption(Arguments &arguments, const std::string &option, std::vector< T > &values)
Definition: sampleOptions.cpp:204
nvinfer1::plugin::MultilevelProposeROIPluginCreator
Definition: multilevelProposeROIPlugin.h:120
nvinfer1::plugin::BatchedNMSPlugin::mClipBoxes
bool mClipBoxes
Definition: batchedNMSPlugin.h:72
anonymous_namespace{serialize.hpp}::Serializer< T, typename std::enable_if< std::is_arithmetic< T >::value||std::is_enum< T >::value||std::is_pod< T >::value >::type >::serialized_size
static size_t serialized_size(T const &value)
Definition: serialize.hpp:47
sample::anonymous_namespace{sampleInference.cpp}::EventType::kCOMPUTE_E
@ kCOMPUTE_E
sample::defaultAvgTiming
constexpr int defaultAvgTiming
Definition: sampleOptions.h:38
bert::FusedMultiHeadAttentionKernelMetaInfoV1::mCubin
const unsigned char * mCubin
Definition: fused_multihead_attention.h:121
SampleSSD::verifyOutput
bool verifyOutput(const samplesCommon::BufferManager &buffers)
Filters output detections and verify results.
Definition: sampleSSD.cpp:316
Logger::mReportableSeverity
Severity mReportableSeverity
Definition: demo/BERT/infer_c/logging.h:75
profiler.totalData
def totalData(features, profile)
Definition: profiler.py:53
polygraphy.tools.args.logger.LoggerArgs.log_format
log_format
Definition: Polygraphy/polygraphy/tools/args/logger.py:34
tests.functional_test.TestClip.test_backward
def test_backward(self)
Definition: functional_test.py:69
text.cleaners.lowercase
def lowercase(text)
Definition: cleaners.py:71
nvinfer1::PluginFieldCollection::nbFields
int32_t nbFields
Number of PluginField entries.
Definition: NvInferRuntimeCommon.h:878
nvcaffeparser1::IPluginFactory
Plugin factory used to configure plugins.
Definition: NvCaffeParser.h:88
backend.tf.test_loader.TestFrozenGraphLoader.test_load_pb
def test_load_pb(self)
Definition: tf/test_loader.py:45
helpers.calibrator.BertCalibrator.write_calibration_cache
def write_calibration_cache(self, cache)
Definition: demo/BERT/helpers/calibrator.py:95
nvinfer1::plugin::GridAnchorGenerator::terminate
void terminate() override
Release resources acquired during plugin layer initialization.
Definition: gridAnchorPlugin.cpp:191
MaskRCNNConfig::MODEL_OUTPUTS
static const std::vector< std::string > MODEL_OUTPUTS
Definition: plugin/proposalLayerPlugin/mrcnn_config.h:173
nvinfer1::plugin::GridAnchorGenerator::clone
IPluginV2Ext * clone() const override
Clone the plugin object.
Definition: gridAnchorPlugin.cpp:344
pytorch_quantization.calib.max.MaxCalibrator.reset
def reset(self)
Definition: max.py:84
tests.calibrator_test.TestMSECalibrator
Definition: calibrator_test.py:196
comparator.test_struct.run_results
def run_results()
Definition: comparator/test_struct.py:14
nvonnxparser::ErrorCode::kMODEL_DESERIALIZE_FAILED
@ kMODEL_DESERIALIZE_FAILED
nvcaffeparser1::ICaffeParser::setPluginFactoryV2
virtual void setPluginFactoryV2(IPluginFactoryV2 *factory)=0
Set the IPluginFactoryV2 used to create the user defined pluginV2 objects.
train.validate
def validate(model, criterion, valset, epoch, batch_iter, batch_size, world_size, collate_fn, distributed_run, rank, batch_to_gpu)
Definition: train.py:260
SampleCharRNNBase::SampleCharRNNBase
SampleCharRNNBase(const SampleCharRNNParams &params)
Definition: sampleCharRNN.cpp:148
pytorch_quantization.nn._functions.quant_rnn.VariableRecurrent
def VariableRecurrent(inner)
Definition: _functions/quant_rnn.py:156
polygraphy.tools.args.trt_legacy.TrtLegacyArgs.trt_runner_args
trt_runner_args
Definition: tools/args/trt_legacy.py:50
nvinfer1::UnaryOperation::kACOS
@ kACOS
Inverse cosine.
ProposalWorkSpace::sortNMSMarkDims
nvinfer1::DimsHW sortNMSMarkDims
Definition: maskRCNNKernels.h:137
nmtSample::BLEUScoreWriter::getInfo
std::string getInfo() override
get the textual description of the component
Definition: bleuScoreWriter.cpp:224
polygraphy.backend.trt_legacy.TrtLegacyRunner.stream
stream
Definition: backend/trt_legacy.py:312
bert::DATA_TYPE_FP16
@ DATA_TYPE_FP16
Definition: fused_multihead_attention_common.h:27
SampleUffFasterRcnnParams::outputProposalName
std::string outputProposalName
Definition: sampleUffFasterRCNN.cpp:65
nvinfer1::plugin::CoordConvACPlugin::CoordConvACPlugin
CoordConvACPlugin()
Definition: coordConvACPlugin.cpp:34
tests.tensor_quant_test.TestFakeTensorQuant.test_full_range
def test_full_range(self)
Definition: tensor_quant_test.py:259
nvinfer1::CUDADriverWrapper
Definition: cudaDriverWrapper.h:30
tests.quant_conv_test.TestQuantConv2D.test_in_feature_fake_quant
def test_in_feature_fake_quant(self)
Definition: quant_conv_test.py:112
half_float::detail::functions::acos
static expr acos(float arg)
Arc cosine implementation.
Definition: samples/common/half.h:1918
polygraphy.util.misc.shapes_match
def shapes_match(shape0, shape1)
Definition: Polygraphy/polygraphy/util/misc.py:157
half_float::detail::enable< T, expr, expr, expr >::type
T type
Definition: ieee_half.h:386
tests.classification_flow_test.TestClassificationFlow.test_resnet50
def test_resnet50(self, request, pytestconfig)
Definition: classification_flow_test.py:29
sample::anonymous_namespace{sampleOptions.cpp}::insertShapesBuild
void insertShapesBuild(std::unordered_map< std::string, ShapeRange > &shapes, nvinfer1::OptProfileSelector selector, const std::string &name, const std::vector< int > &dims)
Definition: sampleOptions.cpp:217
text.cleaners.expand_abbreviations
def expand_abbreviations(text)
Definition: cleaners.py:61
polygraphy.tools.args.tf2onnx.loader.Tf2OnnxLoaderArgs.check_registered
def check_registered(self)
Definition: tools/args/tf2onnx/loader.py:36
SampleCharRNNWeightNames::RNNW_L1_NAME
const std::string RNNW_L1_NAME
Definition: sampleCharRNN.cpp:75
nvinfer1::UnaryOperation::kCOS
@ kCOS
Cosine.
model.Tacotron2.__init__
def __init__(self, mask_padding, n_mel_channels, n_symbols, symbols_embedding_dim, encoder_kernel_size, encoder_n_convolutions, encoder_embedding_dim, attention_rnn_dim, attention_dim, attention_location_n_filters, attention_location_kernel_size, n_frames_per_step, decoder_rnn_dim, prenet_dim, max_decoder_steps, gate_threshold, p_attention_dropout, p_decoder_dropout, postnet_embedding_dim, postnet_kernel_size, postnet_n_convolutions, decoder_no_early_stopping)
Definition: demo/Tacotron2/tacotron2/model.py:588
layers.TacotronSTFT
Definition: layers.py:57
nmtSample::DataWriter::generateText
static std::string generateText(int sequenceLength, const int *currentOutputData, Vocabulary::ptr vocabulary)
Definition: dataWriter.cpp:23
bert::SkipLayerNormInterleavedPlugin::SkipLayerNormInterleavedPlugin
SkipLayerNormInterleavedPlugin()=delete
tools.test_inspect.TestInspectData
Definition: test_inspect.py:291
nvinfer1::IExprBuilder::operation
virtual const IDimensionExpr * operation(DimensionOperation op, const IDimensionExpr &first, const IDimensionExpr &second)=0
Return pointer to IDimensionExp that represents the given operation applied to first and second.
nvinfer1::IAlgorithmContext
Describes the context and requirements, that could be fulfilled by one or more instances of IAlgorith...
Definition: NvInfer.h:6227
SampleOnnxMNIST::build
bool build()
Function builds the network engine.
Definition: sampleOnnxMNIST.cpp:103
half_float::detail::half_caster
Helper class for half casts.
Definition: ieee_half.h:1052
polygraphy.comparator.data_loader.DataLoaderCache.input_metadata
input_metadata
Definition: comparator/data_loader.py:274
pytorch_quantization.nn.modules.quant_rnn.QuantRNNBase.dropout
dropout
Definition: modules/quant_rnn.py:51
nvinfer1::plugin::CoordConvACPlugin::getPluginType
const char * getPluginType() const override
Return the plugin type.
Definition: coordConvACPlugin.cpp:129
pytorch_quantization.nn.modules.quant_rnn.QuantRNN.__init__
def __init__(self, *args, **kwargs)
Definition: modules/quant_rnn.py:274
BertInference::mSeqLength
const int mSeqLength
Definition: bert_infer.h:337
Logger
Class which manages logging of TensorRT tools and samples.
Definition: demo/BERT/infer_c/logging.h:28
nvinfer1::IAlgorithmIOInfo::~IAlgorithmIOInfo
virtual ~IAlgorithmIOInfo()
Definition: NvInfer.h:6188
polygraphy.backend.trt.loader.EngineFromNetwork._network
_network
Definition: backend/trt/loader.py:406
nvinfer1::plugin::BatchTilePlugin::getPluginNamespace
const char * getPluginNamespace() const override
Return the namespace of the plugin object.
Definition: batchTilePlugin.cpp:178
tests.tensor_quant_test.TestTensorQuant.test_clip_gradient
def test_clip_gradient(self)
Definition: tensor_quant_test.py:115
builder.bert_model
def bert_model(config, init_dict, network, input_tensor, input_mask)
Definition: builder.py:323
nvcaffeparser1::WeightType::kNVVARIANCE
@ kNVVARIANCE
nvinfer1::plugin::GenerateDetectionPluginCreator::getFieldNames
const PluginFieldCollection * getFieldNames() override
Return a list of fields that needs to be passed to createPlugin.
Definition: generateDetectionPlugin.cpp:61
nvinfer1::plugin::RPROIParams::spatialScale
float spatialScale
Definition: NvInferPluginUtils.h:127
nvinfer1::plugin::PyramidROIAlignPluginCreator::~PyramidROIAlignPluginCreator
~PyramidROIAlignPluginCreator()
Definition: pyramidROIAlignPlugin.h:109
nvinfer1::plugin::ProposalLayer::getOutputDimensions
Dims getOutputDimensions(int index, const Dims *inputs, int nbInputDims) override
Definition: proposalLayerPlugin.cpp:255
onnx_graphsurgeon.util.misc.SynchronizedList._remove_from_elem
def _remove_from_elem(self, elem)
Definition: onnx-graphsurgeon/onnx_graphsurgeon/util/misc.py:81
SampleUffSSDParams::uffFileName
std::string uffFileName
The file name of the UFF model to use.
Definition: sampleUffSSD.cpp:50
nvinfer1::plugin::Normalize::serialize
void serialize(void *buffer) const override
Serialize the layer.
Definition: normalizePlugin.cpp:123
nvinfer1::IPluginV2DynamicExt::getWorkspaceSize
virtual size_t getWorkspaceSize(const PluginTensorDesc *inputs, int32_t nbInputs, const PluginTensorDesc *outputs, int32_t nbOutputs) const =0
Find the workspace size required by the layer.
Logger::reportTest
static int reportTest(const TestAtom &testAtom, bool pass)
Definition: pluginLogging.h:386
SampleSSDParams::calibrationBatches
std::string calibrationBatches
The path to calibration batches.
Definition: sampleSSD.cpp:53
utils
Definition: demo/Tacotron2/common/utils.py:1
model.Invertible1x1Conv
Definition: demo/Tacotron2/waveglow/model.py:32
bert::fused_multihead_attention_v2_fp16_64_64_kernel_sm80_cubin_len
unsigned int fused_multihead_attention_v2_fp16_64_64_kernel_sm80_cubin_len
Definition: fused_multihead_attention_v2_fp16_64_64_kernel.sm80.cpp:1653
polygraphy.backend.onnx.runner.OnnxTfRunner
Definition: backend/onnx/runner.py:25
test_examples.Example.path
path
Definition: Polygraphy/tests/test_examples.py:56
nvinfer1::IExecutionContext::~IExecutionContext
virtual ~IExecutionContext() noexcept
Definition: NvInferRuntime.h:1660
nvinfer1::plugin::Normalize::terminate
void terminate() override
Release resources acquired during plugin layer initialization.
Definition: normalizePlugin.cpp:100
nvinfer1::IAlgorithmIOInfo::getTensorFormat
virtual TensorFormat getTensorFormat() const =0
Return TensorFormat of the input/output of algorithm.
polygraphy.backend.onnxrt.runner.OnnxrtRunner.deactivate_impl
def deactivate_impl(self)
Definition: backend/onnxrt/runner.py:43
sample::InferenceOptions::help
static void help(std::ostream &out)
Definition: sampleOptions.cpp:1241
nvcaffeparser1::CaffeWeightFactory::mTmpAllocs
std::vector< void * > & mTmpAllocs
Definition: caffeWeightFactory.h:60
pytorch_quantization.nn.modules.quant_bert.QuantBertSelfAttention.value
value
Definition: quant_bert.py:66
polygraphy.backend.trt.calibrator
Definition: tools/Polygraphy/polygraphy/backend/trt/calibrator.py:1
half_float::detail::unary_specialized< expr >::fabs
static expr fabs(float arg)
Definition: samples/common/half.h:2536
comparator.test_data_loader.TestDataLoaderCache.test_can_cast_dtype
def test_can_cast_dtype(self)
Definition: test_data_loader.py:99
polygraphy.comparator.struct.AccuracyResult._get_runner_pair
def _get_runner_pair(self, runner_pair)
Definition: comparator/struct.py:169
polygraphy.tools.args.trt_legacy.TrtLegacyArgs
Definition: tools/args/trt_legacy.py:24
classification_flow.compute_amax
def compute_amax(model, **kwargs)
Definition: classification_flow.py:400
MaskRCNNConfig::FPN_CLASSIF_FC_LAYERS_SIZE
static const int FPN_CLASSIF_FC_LAYERS_SIZE
Definition: plugin/proposalLayerPlugin/mrcnn_config.h:54
nvinfer1::plugin::ProposalLayerPluginCreator::mPluginAttributes
static std::vector< PluginField > mPluginAttributes
Definition: proposalLayerPlugin.h:138
sample::Logger::severityOstream
static std::ostream & severityOstream(Severity severity)
returns an appropriate output stream (cout or cerr) to use with the given severity
Definition: samples/common/logging.h:410
SampleAlgorithmSelector::build
bool build(IAlgorithmSelector *selector)
Builds the network engine.
Definition: sampleAlgorithmSelector.cpp:375
nvinfer1::plugin::GridAnchorParameters::maxSize
float maxSize
Definition: NvInferPluginUtils.h:145
half_float::detail::functions::cos
static expr cos(float arg)
Cosine implementation.
Definition: samples/common/half.h:1894
polygraphy.tools.args.data_loader.DataLoaderArgs.load_inputs
load_inputs
Definition: tools/args/data_loader.py:63
polygraphy.tools.util.script.Script.loader_count
loader_count
Definition: script.py:138
polygraphy.tools.precision.precision.STCheckerBase.add_parser_args
def add_parser_args(self, parser, mode=True)
Definition: precision.py:43
FCPlugin::write
void write(char *&buffer, const T &val)
Definition: samples/opensource/samplePlugin/fcPlugin.h:230
nvinfer1::RNNOperation::kGRU
@ kGRU
Three-gate network consisting of Gated Recurrent Units.
inference_c.print_single_query
def print_single_query(eval_time_elapsed, prediction, nbest_json)
Definition: inference_c.py:182
nvinfer1::UnaryOperation::kNOT
@ kNOT
Logical NOT.
model.WaveGlow.convinv
convinv
Definition: demo/Tacotron2/waveglow/model.py:170
tests.calibrator_test.TestHistogramCalibrator.test_grow
def test_grow(self, verbose)
Definition: calibrator_test.py:96
nvinfer1::plugin::DetectionOutput::enqueue
int enqueue(int batchSize, const void *const *inputs, void **outputs, void *workspace, cudaStream_t stream) override
Definition: nmsPlugin.cpp:103
nmtSample::SoftmaxLikelihood::SoftmaxLikelihoodCombinationOperator::smallerThanMinimalLikelihood
float smallerThanMinimalLikelihood() const override
Definition: softmaxLikelihood.cpp:93
polygraphy.common.exception
Definition: Polygraphy/polygraphy/common/exception.py:1
SampleINT8::teardown
bool teardown()
Cleans up any state created in the sample class.
Definition: sampleINT8.cpp:357
nvinfer1::plugin::GridAnchorGenerator::mNumPriors
int * mNumPriors
Definition: gridAnchorPlugin.h:97
SamplePlugin::mInputDims
nvinfer1::Dims mInputDims
The dimensions of the input to the network.
Definition: samplePlugin.cpp:114
nvinfer1::IAlgorithmVariant::~IAlgorithmVariant
virtual ~IAlgorithmVariant()
Definition: NvInfer.h:6216
setup.main
def main()
Definition: onnx-graphsurgeon/setup.py:33
comparator.test_compare.TestBasicCompareFunc.test_non_matching_outputs
def test_non_matching_outputs(self)
Definition: test_compare.py:86
polygraphy.tools.util.script.Script.append_preimport
def append_preimport(self, line)
Definition: script.py:201
nmtSample::MultiplicativeAlignment::getSourceStatesSize
int getSourceStatesSize() override
get the size of the source states
Definition: multiplicativeAlignment.cpp:66
models.meta.Model.check_runner
check_runner
Definition: meta.py:34
sample::TrtCudaBuffer::get
void * get() const
Definition: sampleDevice.h:264
nvinfer1::plugin::FlattenConcat::setPluginNamespace
void setPluginNamespace(const char *pluginNamespace) override
Set the namespace that this plugin object belongs to.
Definition: flattenConcat.cpp:210
bert::embSkipLayerNorm2
int embSkipLayerNorm2(cudaStream_t stream, int ld, int B, int S, const int *inputIds, const int *tokenIds, const int *cuSeqlens, const float *beta, const float *gamma, const T *wordEmb, const T *posEmb, const T *tokEmb, T *output)
half_float::detail::conditional
Conditional type.
Definition: ieee_half.h:296
BatchStream::mBatch
std::vector< float > mBatch
Data for the batch.
Definition: BatchStream.h:379
polygraphy.tools.base
Definition: tools/Polygraphy/polygraphy/tools/base/__init__.py:1
UffPoolPluginV2Creator::getFieldNames
const PluginFieldCollection * getFieldNames() override
Return a list of fields that needs to be passed to createPlugin.
Definition: sampleUffPluginV2Ext.cpp:658
anonymous_namespace{regionPlugin.cpp}::safeFree
void safeFree(T *ptr)
Definition: regionPlugin.cpp:31
sample::HostAllocator
Definition: sampleDevice.h:289
bert::EmbLayerNormVarSeqlenPlugin::mPosVocabSize
size_t mPosVocabSize
Definition: embLayerNormVarSeqlenPlugin.h:95
test_infer.main
def main()
Definition: test_infer.py:110
test_func.TestExtend.test_extend_1_args_0_rv
def test_extend_1_args_0_rv(self)
Definition: test_func.py:69
sample::ReportingOptions::help
static void help(std::ostream &out)
Definition: sampleOptions.cpp:1278
samplesCommon::DeviceFree::operator()
void operator()(void *ptr) const
Definition: buffers.h:191
sample::LayerProfile::name
std::string name
Definition: sampleReporting.h:160
pytorch_quantization.tensor_quant.ScaledQuantDescriptor._learn_amax
_learn_amax
Definition: tensor_quant.py:88
MaskRCNNConfig::CLASS_NAMES
static const std::vector< std::string > CLASS_NAMES
Definition: plugin/proposalLayerPlugin/mrcnn_config.h:86
nvinfer1::PluginFieldType::kFLOAT32
@ kFLOAT32
FP32 field type.
polygraphy.tools.util.script.Script.prefix
prefix
Definition: script.py:141
nvinfer1::plugin::SplitPluginCreator::~SplitPluginCreator
~SplitPluginCreator()
Definition: split.h:140
nvcaffeparser1::CaffeParser::getErrorRecorder
nvinfer1::IErrorRecorder * getErrorRecorder() const override
get the ErrorRecorder assigned to this interface.
Definition: caffeParser.h:60
nvcaffeparser1::parseConcat
nvinfer1::ILayer * parseConcat(nvinfer1::INetworkDefinition &network, const trtcaffe::LayerParameter &msg, CaffeWeightFactory &, BlobNameToTensor &tensors)
Definition: parseConcat.cpp:23
models.meta.no_check_implemented
def no_check_implemented(runner)
Definition: meta.py:82
half_float::detail::lrint
long lrint(half arg)
Nearest integer using half's internal rounding mode.
Definition: ieee_half.h:2600
bert::QKVToContextInterleavedPlugin::getOutputDataType
nvinfer1::DataType getOutputDataType(int index, const nvinfer1::DataType *inputTypes, int nbInputs) const override
Definition: qkvToContextInt8InterleavedPlugin.cpp:159
SampleMovieLensArgs::batchSize
int batchSize
Definition: sampleMovieLens.cpp:525
nvinfer1::plugin::ProposalLayer::mAnchorBoxesDevice
std::shared_ptr< CudaBind< float > > mAnchorBoxesDevice
Definition: proposalLayerPlugin.h:106
tests.quant_pooling_test.TestQuantAdaptiveAvgPool3d.test_input_fake_quant
def test_input_fake_quant(self)
Definition: quant_pooling_test.py:297
nvinfer1::plugin::BatchedNMSPlugin::getPluginNamespace
const char * getPluginNamespace() const override
Return the namespace of the plugin object.
Definition: batchedNMSPlugin.cpp:414
nvinfer1::CUDADriverWrapper::_cuGetErrorName
CUresult(* _cuGetErrorName)(CUresult, const char **)
Definition: cudaDriverWrapper.h:70
bert::Fused_multihead_attention_params_v2::clear
void clear()
Definition: fused_multihead_attention_v2.h:73
polygraphy.backend.base.loader
Definition: backend/base/loader.py:1
nvinfer1::plugin::ResizeNearest::mNameSpace
std::string mNameSpace
Definition: resizeNearestPlugin.h:93
MultilevelProposeROIWorkSpace::sortClassLabelDims
nvinfer1::DimsHW sortClassLabelDims
Definition: maskRCNNKernels.h:168
nvinfer1::DimsExprs
Definition: NvInferRuntime.h:375
nvcaffeparser1::CaffeParser::~CaffeParser
~CaffeParser() override
Definition: caffeParser.cpp:32
SampleReformatFreeIO::mInputDims
nvinfer1::Dims mInputDims
The dimensions of the input to the network.
Definition: sampleReformatFreeIO.cpp:264
SampleDynamicReshape::infer
bool infer()
Runs inference using TensorRT on a random image.
Definition: sampleDynamicReshape.cpp:321
xy_t::x
int x
Definition: maskRCNNKernels.h:258
nmtSample::Alignment::addToModel
virtual void addToModel(nvinfer1::INetworkDefinition *network, nvinfer1::ITensor *attentionKeys, nvinfer1::ITensor *queryStates, nvinfer1::ITensor **alignmentScores)=0
add the alignment scores calculation to the network
polygraphy.backend.trt.util.str_from_config
def str_from_config(config)
Definition: trt/util.py:266
nvinfer1::plugin::ProposalLayer::terminate
void terminate() override
Release resources acquired during plugin layer initialization.
Definition: proposalLayerPlugin.cpp:153
bert::EmbLayerNormVarSeqlenPlugin::terminate
void terminate() override
Release resources acquired during plugin layer initialization.
Definition: embLayerNormVarSeqlenPlugin.cpp:357
sample::anonymous_namespace{sampleInference.cpp}::EventType::kCOMPUTE_S
@ kCOMPUTE_S
polygraphy.tools.args.data_loader.DataLoaderArgs.check_registered
def check_registered(self)
Definition: tools/args/data_loader.py:49
sample::Logger::TestAtom::TestAtom
TestAtom(TestAtom &&)=default
nvinfer1::ElementWiseOperation::kMAX
@ kMAX
Maximum of the two elements.
nvinfer1::plugin::PyramidROIAlign::initialize
int initialize() override
Initialize the layer for execution.
Definition: pyramidROIAlignPlugin.cpp:93
polygraphy.tools.util.script.Script.append_prefix
def append_prefix(self, line)
Definition: script.py:211
nvinfer1::plugin::BasePlugin
Definition: plugin.h:44
nvinfer1::plugin::InstanceNormalizationPlugin::~InstanceNormalizationPlugin
~InstanceNormalizationPlugin() override
Definition: instanceNormalizationPlugin.cpp:137
train.load_checkpoint
def load_checkpoint(model, optimizer, epoch, config, amp_run, filepath, local_rank)
Definition: train.py:229
SampleMLP
The SampleMLP class implements the MNIST API sample.
Definition: sampleMLP.cpp:57
sample::toDims
nvinfer1::Dims toDims(const std::vector< int > &vec)
Definition: sampleUtils.h:151
nvinfer1::plugin::GenerateDetection::mRegWeightDevice
std::shared_ptr< CudaBind< float > > mRegWeightDevice
Definition: generateDetectionPlugin.h:105
SampleUffMNIST::verifyOutput
bool verifyOutput(const samplesCommon::BufferManager &buffers, const std::string &outputTensorName, int groundTruthDigit) const
Verifies that the output is correct and prints it.
Definition: sampleUffMNIST.cpp:225
SampleCharRNNBase::build
bool build()
Builds the network engine.
Definition: sampleCharRNN.cpp:284
anonymous_namespace{proposalLayerPlugin.cpp}::PROPOSALLAYER_PLUGIN_NAME
const char * PROPOSALLAYER_PLUGIN_NAME
Definition: proposalLayerPlugin.cpp:33
Logger::TestAtom::mStarted
bool mStarted
Definition: pluginLogging.h:302
polygraphy.util.misc.is_valid_shape_override
def is_valid_shape_override(new_shape, original_shape)
Definition: Polygraphy/polygraphy/util/misc.py:147
polygraphy.util.misc.Compressed
Compression and Serialization.
Definition: Polygraphy/polygraphy/util/misc.py:175
nvinfer1::IConstantLayer::getWeights
virtual Weights getWeights() const =0
Get the weights for the layer.
layers
Definition: layers.py:1
data_function.MelAudioLoader.max_wav_value
max_wav_value
Definition: waveglow/data_function.py:31
nvinfer1::plugin::NMSParameters
The NMSParameters are used by the BatchedNMSPlugin for performing the non_max_suppression operation o...
Definition: NvInferPluginUtils.h:247
nmtSample::TextWriter::TextWriter
TextWriter(std::shared_ptr< std::ostream > textOnput, Vocabulary::ptr vocabulary)
Definition: textWriter.cpp:24
nvinfer1::plugin::FlattenConcat::mCopySize
std::vector< size_t > mCopySize
Definition: flattenConcat.h:99
bert::fused_multihead_attention_fp16_384_64_kernel_sm80_cu_o_len
unsigned int fused_multihead_attention_fp16_384_64_kernel_sm80_cu_o_len
Definition: fused_multihead_attention_fp16_384_64_kernel.sm80.cpp:1664
SampleConfig::setFailurePercentage
void setFailurePercentage(float f)
Definition: sampleConfig.h:291
SampleINT8API::mInputDims
nvinfer1::Dims mInputDims
The dimensions of the input to the network.
Definition: sampleINT8API.cpp:110
samplesCommon::PPM
Definition: samples/common/common.h:652
anonymous_namespace{multilevelProposeROIPlugin.cpp}::MULTILEVELPROPOSEROI_PLUGIN_VERSION
const char * MULTILEVELPROPOSEROI_PLUGIN_VERSION
Definition: multilevelProposeROIPlugin.cpp:34
BertInference::mEnableVariableLen
bool mEnableVariableLen
Definition: bert_infer.h:343
nvinfer1::plugin::ProposalPlugin::mRpnStdScaling
float mRpnStdScaling
Definition: proposalPlugin.h:113
inference.MeasureTime.key
key
Definition: Tacotron2/inference.py:167
polygraphy.backend.trt.runner.TrtRunner.host_output_buffers
host_output_buffers
Definition: backend/trt/runner.py:100
nvinfer1::ISliceLayer::~ISliceLayer
virtual ~ISliceLayer()
Definition: NvInfer.h:4029
half_float::half::half
half(detail::binary_t, detail::uint16 bits)
Constructor.
Definition: samples/common/half.h:1559
backend.tf.test_loader.TestSaveGraph.test_save_tensorboard
def test_save_tensorboard(self)
Definition: tf/test_loader.py:68
nvinfer1::plugin::SplitPlugin::_d_segment_offsets
thrust::device_vector< int > _d_segment_offsets
Definition: split.h:44
SampleMaskRCNN
Definition: sampleUffMaskRCNN.cpp:310
sample::BaseModelOptions::parse
void parse(Arguments &arguments) override
Definition: sampleOptions.cpp:406
nvinfer1::plugin::PriorBoxPluginCreator::mPluginAttributes
static std::vector< PluginField > mPluginAttributes
Definition: priorBoxPlugin.h:117
pytorch_quantization.nn.modules._utils.QuantMixin.init_quantizer
def init_quantizer(self, quant_desc_input, quant_desc_weight, num_layers=None)
Definition: _utils.py:56
tests.license_test.TestLicense
Definition: license_test.py:26
BatchStream::getDims
nvinfer1::Dims getDims() const override
Definition: BatchStream.h:279
UffPoolPluginV2::write
void write(char *&buffer, const T &val) const
Definition: sampleUffPluginV2Ext.cpp:580
nvinfer1::plugin::Region::setPluginNamespace
void setPluginNamespace(const char *pluginNamespace) override
Set the namespace that this plugin object belongs to.
Definition: regionPlugin.cpp:414
polygraphy.util.misc.try_call
def try_call(func, *args, **kwargs)
Definition: Polygraphy/polygraphy/util/misc.py:267
data_function.TextMelLoader.text_cleaners
text_cleaners
Definition: tacotron2/data_function.py:34
nmtSample::accumulateBLEU
void accumulateBLEU(const std::vector< Segment_t > &referenceSamples, const std::vector< Segment_t > &outputSamples, int maxOrder, size_t &referenceLength, size_t &translationLength, std::vector< size_t > &matchesByOrder, std::vector< size_t > &possibleMatchesByOrder)
Definition: bleuScoreWriter.cpp:119
backend.tf.test_loader.TestFrozenGraphLoader
Definition: tf/test_loader.py:34
bert::anonymous_namespace{embLayerNormVarSeqlenPlugin.cpp}::EMB_LAYER_NORM_VAR_SEQLEN_VERSION
static const char * EMB_LAYER_NORM_VAR_SEQLEN_VERSION
Definition: embLayerNormVarSeqlenPlugin.cpp:47
pytorch_quantization.nn.modules.quant_rnn.QuantRNNBase.mode
mode
Definition: modules/quant_rnn.py:45
models.meta.model_path
def model_path(name)
Definition: meta.py:25
sample::InferenceOptions::spin
bool spin
Definition: sampleOptions.h:156
nvinfer1::plugin::BatchedNMSDynamicPlugin::initialize
int initialize() override
Initialize the layer for execution.
Definition: batchedNMSPlugin.cpp:90
nvinfer1::plugin::ProposalPlugin::mNamespace
std::string mNamespace
Definition: proposalPlugin.h:108
bert::EmbLayerNormVarSeqlenPluginCreator::getPluginVersion
const char * getPluginVersion() const override
Return the plugin version.
Definition: embLayerNormVarSeqlenPlugin.cpp:430
test_importers.TestOnnxImporter.test_import_node
def test_import_node(self)
Definition: test_importers.py:63
nvinfer1::INetworkDefinition::destroy
virtual void destroy()=0
Destroy this INetworkDefinition object.
polygraphy.backend.trt.util.get_output_metadata
def get_output_metadata(network)
Definition: trt/util.py:88
train.get_last_checkpoint_filename
def get_last_checkpoint_filename(output_dir, model_name)
Definition: train.py:219
inference_trt.parse_args
def parse_args(parser)
Definition: inference_trt.py:36
xy_t::y
int y
Definition: maskRCNNKernels.h:257
nmtSample::BeamSearchPolicy::initialize
void initialize(int sampleCount, int *maxOutputSequenceLengths)
Definition: beamSearchPolicy.cpp:37
sample::anonymous_namespace{sampleOptions.cpp}::stringToValue
T stringToValue(const std::string &option)
Definition: sampleOptions.cpp:55
ParserOnnxConfig::getModelFileName
virtual const char * getModelFileName() const
Get the Model FileName.
Definition: parserOnnxConfig.h:87
polygraphy.backend.onnx.runner.OnnxTfRunner.infer_impl
def infer_impl(self, feed_dict)
Definition: backend/onnx/runner.py:50
nvinfer1::plugin::FlattenConcat::mPluginNamespace
std::string mPluginNamespace
Definition: flattenConcat.h:104
nvinfer1::TripLimit::kCOUNT
@ kCOUNT
Tensor is scalar of type kINT32 that contains the trip count.
onnx_models.nested_dup_names
def nested_dup_names()
Definition: onnx_models.py:185
polygraphy.backend.trt.runner.TrtRunner.activate_impl
def activate_impl(self)
Definition: backend/trt/runner.py:59
tools.test_surgeon.TestSurgeonInsert.check_insert_model
def check_insert_model(self, path, expected_node_ops, expected_graph_input_names, expected_graph_output_names)
Definition: test_surgeon.py:46
nvinfer1::plugin::GridAnchorGenerator::enqueue
int enqueue(int batchSize, const void *const *inputs, void **outputs, void *workspace, cudaStream_t stream) override
Definition: gridAnchorPlugin.cpp:198
nvinfer1::plugin::FlattenConcat::initialize
int initialize() override
Initialize the layer for execution.
Definition: flattenConcat.cpp:121
nvinfer1::plugin::RPROIPluginCreator::getFieldNames
const PluginFieldCollection * getFieldNames() override
Return a list of fields that needs to be passed to createPlugin.
Definition: nvFasterRCNNPlugin.cpp:349
export_tacotron2_onnx.Postnet.forward
def forward(self, mel_outputs)
Definition: export_tacotron2_onnx.py:81
nvinfer1::plugin::MultilevelProposeROI::mIOUThreshold
float mIOUThreshold
Definition: multilevelProposeROIPlugin.h:99
nvinfer1::INetworkDefinition::getOutput
virtual ITensor * getOutput(int32_t index) const =0
Get the output tensor specified by the given index.
nvinfer1::plugin::PyramidROIAlign::supportsFormat
bool supportsFormat(DataType type, PluginFormat format) const override
Check format support.
Definition: pyramidROIAlignPlugin.cpp:112
tests.quant_conv_transposed_test.TestQuantConvTranspose3D.test_against_unquantized
def test_against_unquantized(self)
Definition: quant_conv_transposed_test.py:307
TLTMaskRCNNConfig::DETECTION_REG_WEIGHTS
static const float DETECTION_REG_WEIGHTS[]
Definition: tlt_mrcnn_config.h:37
polygraphy.backend.tf.util.get_output_metadata
def get_output_metadata(graph, layerwise=False)
Definition: tf/util.py:94
helpers.tokenization.load_vocab
def load_vocab(vocab_file)
Definition: tokenization.py:120
tests.quant_instancenorm_test.TestQuantInstanceNorm1D.test_fake_quant_per_channel
def test_fake_quant_per_channel(self)
Definition: quant_instancenorm_test.py:79
profiler.findAndRemove
def findAndRemove(profile, name)
Definition: profiler.py:70
polygraphy.tools.base.tool.Tool.__init__
def __init__(self, name)
Definition: tool.py:24
nmtSample::Projection::Projection
Projection()=default
model.Decoder.max_decoder_steps
max_decoder_steps
Definition: demo/Tacotron2/tacotron2/model.py:248
SampleConfig::getModelFileName
const char * getModelFileName() const
Get the Model FileName.
Definition: sampleConfig.h:104
test_ir.TestGraph.test_layer_with_arrays
def test_layer_with_arrays(self)
Definition: test_ir.py:532
model.Decoder.early_stopping
early_stopping
Definition: demo/Tacotron2/tacotron2/model.py:252
polygraphy.tools.args.trt.loader.TrtLoaderArgs.get_trt_config_loader
def get_trt_config_loader(self, data_loader)
Definition: tools/args/trt/loader.py:209
nmtSample::Encoder::addToModel
virtual void addToModel(nvinfer1::INetworkDefinition *network, int maxInputSequenceLength, nvinfer1::ITensor *inputEmbeddedData, nvinfer1::ITensor *actualInputSequenceLengths, nvinfer1::ITensor **inputStates, nvinfer1::ITensor **memoryStates, nvinfer1::ITensor **lastTimestepStates)=0
add the memory and last timestep states to the network lastTimestepHiddenStates is the pointer to the...
TLTMaskRCNNConfig::CLASS_NAMES
static const std::vector< std::string > CLASS_NAMES
Definition: tlt_mrcnn_config.h:86
nmtSample::MultiplicativeAlignment
alignment scores from Luong attention mechanism
Definition: multiplicativeAlignment.h:31
polygraphy.backend.cntk
Definition: tools/Polygraphy/polygraphy/backend/cntk/__init__.py:1
helpers.tokenization.whitespace_tokenize
def whitespace_tokenize(text)
Definition: tokenization.py:151
bert::Fused_multihead_attention_params_v2::qkv_ptr
void * qkv_ptr
Definition: fused_multihead_attention_v2.h:28
polygraphy.tools.base.tool.Tool.run
def run(self, args)
Definition: tool.py:68
nvcaffeparser1::readBinaryProto
bool readBinaryProto(trtcaffe::NetParameter *net, const char *file, size_t bufSize)
Definition: readProto.h:37
tests.integration_test.TestNetwork.test_simple_build
def test_simple_build(self)
Definition: integration_test.py:46
backend.trt.test_runner.TestTrtRunner.test_multithreaded_runners_from_engine
def test_multithreaded_runners_from_engine(self)
Definition: trt/test_runner.py:66
sample::TrtCudaEvent::record
void record(const TrtCudaStream &stream)
Definition: sampleDevice.h:134
backend.base.test_runner.test_infer_raises_if_runner_inactive
def test_infer_raises_if_runner_inactive()
Definition: base/test_runner.py:8
nvinfer1::ErrorCode::kINVALID_STATE
@ kINVALID_STATE
TensorRT was put into a bad state by incorrect sequence of function calls.
polygraphy.util.misc.decompress
def decompress(compressed)
Definition: Polygraphy/polygraphy/util/misc.py:192
FCPlugin::getSerializationSize
virtual size_t getSerializationSize() override
Find the size of the serialization buffer required.
Definition: samples/opensource/samplePlugin/fcPlugin.h:204
nvinfer1::plugin::MultilevelProposeROI::isOutputBroadcastAcrossBatch
bool isOutputBroadcastAcrossBatch(int outputIndex, const bool *inputIsBroadcasted, int nbInputs) const override
Definition: multilevelProposeROIPlugin.cpp:436
nvcaffeparser1::CaffeParser::mPluginFactory
nvcaffeparser1::IPluginFactory * mPluginFactory
Definition: caffeParser.h:87
FCPlugin::type2size
size_t type2size(nvinfer1::DataType type)
Definition: samples/opensource/samplePlugin/fcPlugin.h:224
nvinfer1::IRuntime::setDLACore
virtual void setDLACore(int32_t dlaCore) noexcept=0
Set the DLA core that the deserialized engine must execute on.
nvinfer1::IPoolingLayer::~IPoolingLayer
virtual ~IPoolingLayer()
Definition: NvInfer.h:1862
half_float::half::operator+=
detail::enable< half &, T >::type operator+=(T rhs)
Arithmetic assignment.
Definition: samples/common/half.h:1441
AlgorithmCacheReader::choiceMap
std::unordered_map< std::string, AlgorithmCacheItem > choiceMap
Definition: sampleAlgorithmSelector.cpp:243
polygraphy.comparator.comparator.Comparator.compare_accuracy
def compare_accuracy(run_results, fail_fast=False, comparisons=None, compare_func=None)
Definition: comparator/comparator.py:214
model.Attention.forward
def forward(self, attention_hidden_state, memory, processed_memory, attention_weights_cat, mask)
Definition: demo/Tacotron2/tacotron2/model.py:85
half_float::detail::enable< T, expr, expr, half >::type
T type
Definition: ieee_half.h:385
test_examples.load_commands_from_readme
def load_commands_from_readme(readme)
Definition: onnx-graphsurgeon/tests/test_examples.py:52
nmtSample::SLPEmbedder::mResizedKernelWeights
std::vector< float > mResizedKernelWeights
Definition: slpEmbedder.h:55
bert::QKVToContextInterleavedPlugin::mNumHeads
int mNumHeads
Definition: qkvToContextInt8InterleavedPlugin.h:86
tests.print_test.TestPrint.quant_linear
quant_linear
Definition: print_test.py:50
sample::Logger::mReportableSeverity
Severity mReportableSeverity
Definition: samples/common/logging.h:441
polygraphy.util.cuda.Stream.free
def free(self)
Definition: cuda.py:95
pytorch_quantization.nn.modules.quant_conv._QuantConvNd._quant
def _quant(self, input)
Definition: quant_conv.py:73
sample::Logger::reportTestEnd
static void reportTestEnd(const TestAtom &testAtom, TestResult result)
Report that a test has ended.
Definition: samples/common/logging.h:340
nvinfer1::PoolingType::kAVERAGE
@ kAVERAGE
samplesCommon::PPM::fileName
std::string fileName
Definition: samples/common/common.h:654
sample::BuildOptions::int8
bool int8
Definition: sampleOptions.h:115
half_float::detail::log10
expr log10(half arg)
Common logorithm.
Definition: ieee_half.h:2354
comparator.test_comparator.TestComparator.test_segfault_does_not_hang
def test_segfault_does_not_hang(self)
Definition: test_comparator.py:110
nvinfer1::plugin::Region::getPluginVersion
const char * getPluginVersion() const override
Return the plugin version.
Definition: regionPlugin.cpp:388
half_float::detail::functions::exp
static expr exp(float arg)
Exponential implementation.
Definition: samples/common/half.h:1767
ConcatTopKWorkSpace::concatedScoreDims
nvinfer1::DimsHW concatedScoreDims
Definition: maskRCNNKernels.h:195
nvinfer1::IPluginV2IOExt::configurePlugin
virtual void configurePlugin(const PluginTensorDesc *in, int32_t nbInput, const PluginTensorDesc *out, int32_t nbOutput)=0
Configure the layer.
nmtSample::SoftmaxLikelihood::getLikelihoodCombinationOperator
LikelihoodCombinationOperator::ptr getLikelihoodCombinationOperator() const override
Definition: softmaxLikelihood.cpp:98
nvinfer1::EnumMax< SliceMode >
constexpr int32_t EnumMax< SliceMode >()
Maximum number of elements in SliceMode enum.
Definition: NvInfer.h:3889
polygraphy.tools.args.onnx.runner.OnnxtfRunnerArgs.add_to_script
def add_to_script(self, script)
Definition: tools/args/onnx/runner.py:32
nvinfer1::IPluginRegistry::setErrorRecorder
virtual void setErrorRecorder(IErrorRecorder *recorder) noexcept=0
Set the ErrorRecorder for this interface.
bert::EmbLayerNormVarSeqlenPlugin::EmbLayerNormVarSeqlenPlugin
EmbLayerNormVarSeqlenPlugin()=delete
samplesCommon::Args::loadEngine
std::string loadEngine
Definition: argsParser.h:88
nvinfer1::IBuilderConfig::~IBuilderConfig
virtual ~IBuilderConfig()
Definition: NvInfer.h:6736
nvcaffeparser1::BlobNameToTensor::mError
bool mError
Definition: blobNameToTensor.h:68
nvinfer1::ILRNLayer::~ILRNLayer
virtual ~ILRNLayer()
Definition: NvInfer.h:2060
Int8EntropyCalibrator2::Int8EntropyCalibrator2
Int8EntropyCalibrator2(BatchStream &stream, int firstBatch, const char *networkName, const char *inputBlobName, bool readCache=true)
Definition: frcnnUtils.h:510
nmtSample::DebugUtil::mPlugins
static std::list< DumpTensorPlugin::ptr > mPlugins
Definition: debugUtil.h:80
nvinfer1::ILogger::~ILogger
virtual ~ILogger()
Definition: NvInferRuntimeCommon.h:1101
bert::Fused_multihead_attention_params::h
int h
Definition: fused_multihead_attention.h:79
pytorch_quantization.calib.calibrator._Calibrator.__repr__
def __repr__(self)
Definition: tools/pytorch-quantization/pytorch_quantization/calib/calibrator.py:56
nvinfer1::IHostMemory::data
virtual void * data() const noexcept=0
A pointer to the raw data that is owned by the library.
nvcaffeparser1::ICaffeParser::parse
virtual const IBlobNameToTensor * parse(const char *deploy, const char *model, nvinfer1::INetworkDefinition &network, nvinfer1::DataType weightType)=0
Parse a prototxt file and a binaryproto Caffe model to extract network definition and weights associa...
FrcnnArgs::inputImages
std::vector< std::string > inputImages
Definition: frcnnUtils.h:63
nvinfer1::plugin::BatchedNMSPlugin::scoresSize
int scoresSize
Definition: batchedNMSPlugin.h:69
sample::Profiler::getTotalTime
float getTotalTime() const
Definition: sampleReporting.h:182
sample::BuildOptions::fp16
bool fp16
Definition: sampleOptions.h:114
nmtSample::LikelihoodCombinationOperator::LikelihoodCombinationOperator
LikelihoodCombinationOperator()=default
sample::UffInput::NHWC
bool NHWC
Definition: sampleOptions.h:87
nvinfer1::ITensor::dynamicRangeIsSet
virtual bool dynamicRangeIsSet() const =0
Query whether dynamic range is set.
nvcaffeparser1::createCaffeParser
ICaffeParser * createCaffeParser()
Creates a ICaffeParser object.
Definition: NvCaffeParser.cpp:32
bert::SkipLayerNormInterleavedPluginCreator::getPluginNamespace
const char * getPluginNamespace() const override
Return the namespace of the plugin creator object.
Definition: skipLayerNormInt8InterleavedPlugin.cpp:344
sample::ReportingOptions::avgs
int avgs
Definition: sampleOptions.h:172
nvinfer1::plugin::SpecialSlice::check_valid_inputs
void check_valid_inputs(const nvinfer1::Dims *inputs, int nbInputDims)
Definition: specialSlicePlugin.cpp:137
pytorch_quantization.calib.max.MaxCalibrator._calib_amax
_calib_amax
Definition: max.py:42
polygraphy.backend.trt.loader.ShapeTuple.max
max
Definition: backend/trt/loader.py:247
onnx_graphsurgeon.exporters.onnx_exporter.OnnxExporter.export_value_info_proto
onnx.ValueInfoProto export_value_info_proto(Tensor tensor, bool do_type_check)
Definition: onnx_exporter.py:39
nvinfer1::IGpuAllocator
Application-implemented class for controlling allocation on the GPU.
Definition: NvInferRuntimeCommon.h:1033
nvinfer1::plugin::DetectionOutput::canBroadcastInputAcrossBatch
bool canBroadcastInputAcrossBatch(int inputIndex) const override
Definition: nmsPlugin.cpp:202
polygraphy.backend.tf.loader.ModifyGraph.__init__
def __init__(self, graph, outputs=None)
Definition: backend/tf/loader.py:271
BatchStream::mFileCount
int mFileCount
Definition: BatchStream.h:376
nvinfer1::ITensor::getType
virtual DataType getType() const =0
Get the data type of a tensor.
nvinfer1::plugin::BatchedNMSPlugin::initialize
int initialize() override
Initialize the layer for execution.
Definition: batchedNMSPlugin.cpp:85
polygraphy.backend.trt_legacy.TrtLegacyRunner.HostDeviceMem.__init__
def __init__(self, host_mem, device_mem)
Definition: backend/trt_legacy.py:180
polygraphy.backend.onnx.loader.OnnxFromTfGraph.opset
opset
Definition: backend/onnx/loader.py:68
bert::SkipLayerNormInterleavedPluginCreator::mFC
static nvinfer1::PluginFieldCollection mFC
Definition: skipLayerNormInt8InterleavedPlugin.h:114
SampleUffFasterRcnnParams::outputRegName
std::string outputRegName
Definition: sampleUffFasterRCNN.cpp:64
SampleMLP::mWeightMap
std::map< std::string, std::pair< nvinfer1::Dims, nvinfer1::Weights > > mWeightMap
The weight name to weight value map.
Definition: sampleMLP.cpp:90
nvinfer1::EnumMax< PluginType >
constexpr int32_t EnumMax< PluginType >()
Maximum number of elements in PluginType enum.
Definition: NvInferPluginUtils.h:53
polygraphy.tools.args.trt.runner.TrtRunnerArgs.save_engine
save_engine
Definition: tools/args/trt/runner.py:48
nvinfer1::plugin::SplitPlugin::getOutputDataType
nvinfer1::DataType getOutputDataType(int index, const nvinfer1::DataType *inputTypes, int nbInputs) const override
samplesCommon::toLower
std::string toLower(const std::string &inp)
Definition: samples/common/common.h:494
SampleConfig
Definition: sampleConfig.h:26
SampleINT8::isSupported
bool isSupported(DataType dataType)
Checks if the platform supports the data type.
Definition: sampleINT8.cpp:178
loss_function.Tacotron2Loss
Definition: tacotron2/loss_function.py:20
polygraphy.tools.run.run.Run
TOOL #################################.
Definition: run.py:95
model.LocationLayer
Definition: demo/Tacotron2/tacotron2/model.py:29
bert::TFusedMultiHeadAttentionXMMAKernel::mModules
std::unordered_map< const unsigned char *, CUmodule > mModules
Definition: fused_multihead_attention.h:281
samplesCommon::GenericBuffer::resize
void resize(const nvinfer1::Dims &dims)
Overload of resize that accepts Dims.
Definition: buffers.h:161
test_ir.TestNodeIO.tensors
tensors
Definition: test_ir.py:223
nvinfer1::plugin::Region::num
int num
Definition: regionPlugin.h:93
SampleUffFasterRcnnParams::inputImages
std::vector< std::string > inputImages
Definition: sampleUffFasterRCNN.cpp:67
nvonnxparser::IOnnxConfig::setPrintLayerInfo
virtual void setPrintLayerInfo(bool)=0
Set whether the layer information will be printed.
nvinfer1::IConvolutionLayer::~IConvolutionLayer
virtual ~IConvolutionLayer()
Definition: NvInfer.h:1368
bert::fused_multihead_attention_v2_int8_128_64_kernel_sm80_cubin
unsigned char fused_multihead_attention_v2_int8_128_64_kernel_sm80_cubin[]
Definition: fused_multihead_attention_v2_int8_128_64_kernel.sm80.cpp:19
polygraphy.backend.trt.loader.CreateNetwork
Definition: backend/trt/loader.py:60
polygraphy.backend.trt_legacy.LoadUffFile.__init__
def __init__(self, path, shapes, outputs)
Definition: backend/trt_legacy.py:36
sample::Logger::TestResult
TestResult
Represents the state of a given test.
Definition: samples/common/logging.h:218
nvinfer1::plugin::PyramidROIAlign::mFeatureMapCount
static const int mFeatureMapCount
Definition: pyramidROIAlignPlugin.h:95
nmtSample::Vocabulary::getId
int getId(const std::string &token) const
get the ID of the token
Definition: vocabulary.cpp:42
onnx_graphsurgeon.ir.tensor.Tensor.__class__
__class__
Definition: tensor.py:69
MNISTBatchStream::mMaxBatches
int mMaxBatches
Definition: BatchStream.h:148
nmtSample::Embedder
projects 1-hot vectors (represented as a vector with indices) into dense embedding space
Definition: embedder.h:32
nvcaffeparser1::IPluginFactoryExt::isPluginExt
virtual bool isPluginExt(const char *layerName)=0
A user implemented function that determines if a layer configuration is provided by an IPluginExt.
SampleDynamicReshape::mOutput
samplesCommon::ManagedBuffer mOutput
Host buffer for the ouptut.
Definition: sampleDynamicReshape.cpp:91
sample::TrtCudaStream::~TrtCudaStream
~TrtCudaStream()
Definition: sampleDevice.h:73
polygraphy.backend.trt.loader.NetworkFromOnnxBytes
Definition: backend/trt/loader.py:111
polygraphy.util.cuda.Cuda.create_stream
def create_stream(self)
Definition: cuda.py:45
half_float::detail::functions::signbit
static bool signbit(half arg)
Sign bit implementation.
Definition: samples/common/half.h:2378
MaskRCNNUtils::PPM::max
int max
Definition: sampleUffMaskRCNN.cpp:75
polygraphy.tools.args.logger.LoggerArgs.get_logger
def get_logger(self)
Definition: Polygraphy/polygraphy/tools/args/logger.py:71
test_importers.TestOnnxImporter
Definition: test_importers.py:32
nvinfer1::plugin::LReLU::mBatchDim
int mBatchDim
Definition: lReluPlugin.h:73
anonymous_namespace{detectionLayerPlugin.cpp}::DETECTIONLAYER_PLUGIN_NAME
const char * DETECTIONLAYER_PLUGIN_NAME
Definition: detectionLayerPlugin.cpp:29
half_float::detail::cos
expr cos(half arg)
Cosine function.
Definition: ieee_half.h:2424
model.Encoder.forward
def forward(self, x, input_lengths)
Definition: demo/Tacotron2/tacotron2/model.py:200
audio_processing.griffin_lim
def griffin_lim(magnitudes, stft_fn, n_iters=30)
Definition: audio_processing.py:75
half_float::detail::acosh
expr acosh(half arg)
Hyperbolic area cosine.
Definition: ieee_half.h:2501
nvinfer1::plugin::FlattenConcat::enqueue
int enqueue(int batchSize, const void *const *inputs, void **outputs, void *workspace, cudaStream_t stream) override
Definition: flattenConcat.cpp:133
option::val
int val
Definition: getopt.h:79
builder_varseqlen.set_output_name
def set_output_name(layer, prefix, name, out_idx=0)
Definition: builder_varseqlen.py:114
nvinfer1::plugin::FlattenConcatPluginCreator::getFieldNames
const PluginFieldCollection * getFieldNames() override
Return a list of fields that needs to be passed to createPlugin.
Definition: flattenConcat.cpp:311
polygraphy.tools.args.model.ModelArgs.add_to_parser
def add_to_parser(self, parser)
Definition: tools/Polygraphy/polygraphy/tools/args/model.py:33
pytorch_quantization.calib.histogram.HistogramCalibrator._compute_amax_entropy
def _compute_amax_entropy(self, stride, start_bin)
Definition: histogram.py:92
nvinfer1::plugin::InstanceNormalizationPlugin::getWorkspaceSize
size_t getWorkspaceSize(const nvinfer1::PluginTensorDesc *inputs, int nbInputs, const nvinfer1::PluginTensorDesc *outputs, int nbOutputs) const override
Definition: instanceNormalizationPlugin.cpp:166
data_function.TextMelCollate.__call__
def __call__(self, batch)
Definition: tacotron2/data_function.py:89
nmtSample::Alignment::~Alignment
~Alignment() override=default
nvinfer1::IErrorRecorder::RefCount
int32_t RefCount
A typedef of a 32bit integer for reference counting.
Definition: NvInferRuntimeCommon.h:1244
EntropyCalibratorImpl::~EntropyCalibratorImpl
virtual ~EntropyCalibratorImpl()
Definition: EntropyCalibrator.h:44
backend.trt.test_loader.identity_builder_network
def identity_builder_network()
Definition: trt/test_loader.py:41
nvinfer1::plugin::CropAndResizePlugin::enqueue
int enqueue(int batchSize, const void *const *inputs, void **outputs, void *workspace, cudaStream_t stream) override
Definition: cropAndResizePlugin.cpp:131
nvinfer1::plugin::MultilevelCropAndResize::mFeatureMapCount
static const int mFeatureMapCount
Definition: multilevelCropAndResizePlugin.h:95
polygraphy.logger.logger.Logger.letter
letter
Definition: Polygraphy/polygraphy/logger/logger.py:131
polygraphy.backend.onnx.util.get_output_metadata
def get_output_metadata(graph)
Definition: onnx/util.py:164
half_float::detail::log2
expr log2(half arg)
Binary logorithm.
Definition: ieee_half.h:2368
text._arpabet_to_sequence
def _arpabet_to_sequence(text)
Definition: demo/Tacotron2/tacotron2/text/__init__.py:69
nvinfer1::plugin::MultilevelProposeROIPluginCreator::mIOUThreshold
float mIOUThreshold
Definition: multilevelProposeROIPlugin.h:142
nvuffparser::FieldCollection::nbFields
int32_t nbFields
Definition: NvUffParser.h:89
nvinfer1::plugin::DetectionOutput::getNbOutputs
int getNbOutputs() const override
Get the number of outputs from the layer.
Definition: nmsPlugin.cpp:67
model.Decoder.attention_layer
attention_layer
Definition: demo/Tacotron2/tacotron2/model.py:262
polygraphy.backend.tf.loader.ModifyGraph._graph
_graph
Definition: backend/tf/loader.py:286
SampleConfig::mReferenceFilename
std::string mReferenceFilename
Definition: sampleConfig.h:41
SampleDynamicReshape::SampleUniquePtr
std::unique_ptr< T, samplesCommon::InferDeleter > SampleUniquePtr
Definition: sampleDynamicReshape.cpp:48
sample::SystemOptions::device
int device
Definition: sampleOptions.h:136
data_function.MelAudioLoader.segment_length
segment_length
Definition: waveglow/data_function.py:37
nvinfer1::INetworkDefinition::addFullyConnected
virtual IFullyConnectedLayer * addFullyConnected(ITensor &input, int32_t nbOutputs, Weights kernelWeights, Weights biasWeights)=0
Add a fully connected layer to the network.
nvinfer1::EngineCapability::kSAFE_DLA
@ kSAFE_DLA
Safety restricted capability, TensorRT flow that can only run on DLA devices via NvMediaDLA APIs.
tests.quant_pooling_test.TestQuantAvgPool2d
Definition: quant_pooling_test.py:149
MaskRCNNConfig::MODEL_NAME
static const std::string MODEL_NAME
Definition: plugin/proposalLayerPlugin/mrcnn_config.h:170
nvinfer1::plugin::BatchedNMSPlugin::mPrecision
DataType mPrecision
Definition: batchedNMSPlugin.h:73
samplesCommon::PPM::max
int max
Definition: samples/common/common.h:655
bert::fused_multihead_attention_fp16_128_64_kernel_sm75_cu_o_len
unsigned int fused_multihead_attention_fp16_128_64_kernel_sm75_cu_o_len
Definition: fused_multihead_attention_fp16_128_64_kernel.sm75.cpp:2702
SampleINT8API::SampleINT8API
SampleINT8API(const SampleINT8APIParams &params)
Definition: sampleINT8API.cpp:83
classification_flow.main
def main(cmdline_args)
Definition: classification_flow.py:182
nvinfer1::plugin::Reorg::stride
int stride
Definition: reorgPlugin.h:88
sample::Bindings::dumpBindingDimensions
void dumpBindingDimensions(int binding, const nvinfer1::IExecutionContext &context, std::ostream &os) const
Definition: sampleUtils.h:371
test_ir.TestGraph.test_topologically_sort
def test_topologically_sort(self, toposort_test_case)
Definition: test_ir.py:618
SampleMLP::transposeWeights
void transposeWeights(nvinfer1::Weights &wts, int hiddenSize)
Transpose weights.
Definition: sampleMLP.cpp:461
nvinfer1::IRuntime::getNbDLACores
virtual int32_t getNbDLACores() const noexcept=0
Returns number of DLA hardware cores accessible.
sample::LogStreamConsumer
Convenience object used to facilitate use of C++ stream syntax when logging messages....
Definition: samples/common/logging.h:132
parserutils::getCHW
nvinfer1::DimsCHW getCHW(const nvinfer1::Dims &d)
Definition: parserUtils.h:108
nvinfer1::plugin::PyramidROIAlign::mThresh
float mThresh
Definition: pyramidROIAlignPlugin.h:99
nvinfer1::plugin::RPROIParams::nmsMaxOut
int32_t nmsMaxOut
Definition: NvInferPluginUtils.h:122
polygraphy.tools.inspect.subtool.data.Data
Definition: data.py:23
polygraphy.tools.util.script.Script.add_runner
def add_runner(self, runner_str)
Definition: script.py:191
nvcaffeparser1::IBinaryProtoBlob
Object used to store and query data extracted from a binaryproto file using the ICaffeParser.
Definition: NvCaffeParser.h:71
builder.onnx_to_trt_name
def onnx_to_trt_name(onnx_name)
Definition: builder.py:440
nvinfer1::plugin::BaseCreator::getPluginNamespace
const char * getPluginNamespace() const override
Return the namespace of the plugin creator object.
Definition: plugin.h:68
sample::gLogger
Logger gLogger
Definition: logger.cpp:22
nvinfer1::plugin::PriorBox::supportsFormat
bool supportsFormat(DataType type, PluginFormat format) const override
Check format support.
Definition: priorBoxPlugin.cpp:209
nvinfer1::plugin::detectionInferenceWorkspaceSize
size_t detectionInferenceWorkspaceSize(bool shareLocation, int N, int C1, int C2, int numClasses, int numPredsPerClass, int topK, DataType DT_BBOX, DataType DT_SCORE)
Definition: kernel.cpp:39
test_api.TestApi.test_export
def test_export(self)
Definition: test_api.py:35
denoiser.Denoiser.stft
stft
Definition: denoiser.py:30
SimpleProfiler::mProfile
std::map< std::string, Record > mProfile
Definition: samples/common/common.h:222
polygraphy.tools.args.trt.runner.TrtRunnerArgs._write
_write
Definition: tools/args/trt/runner.py:23
polygraphy.backend.base.runner.BaseRunner.infer
def infer(self, feed_dict)
Definition: backend/base/runner.py:104
SampleCharRNNBase::mParams
SampleCharRNNParams mParams
Definition: sampleCharRNN.cpp:187
nvinfer1::plugin::MultilevelCropAndResizePluginCreator::mFC
static PluginFieldCollection mFC
Definition: multilevelCropAndResizePlugin.h:123
polygraphy.backend.trt.loader.CreateConfig
Definition: backend/trt/loader.py:299
sample::LogStreamConsumerBuffer::~LogStreamConsumerBuffer
~LogStreamConsumerBuffer()
Definition: samples/common/logging.h:51
polygraphy.tools.base.tool.Tool.setup_parser
def setup_parser(self, subparsers)
Definition: tool.py:47
polygraphy.tools.args.logger.LoggerArgs.parse
def parse(self, args)
Definition: Polygraphy/polygraphy/tools/args/logger.py:31
pytorch_quantization.optim
Definition: tools/pytorch-quantization/pytorch_quantization/optim/__init__.py:1
nvinfer1::IPluginV2DynamicExt::getOutputDimensions
virtual DimsExprs getOutputDimensions(int32_t outputIndex, const DimsExprs *inputs, int32_t nbInputs, IExprBuilder &exprBuilder)=0
Get expressions for computing dimensions of an output tensor from dimensions of the input tensors.
anonymous_namespace{regionPlugin.cpp}::SoftmaxTreeDeleter::operator()
void operator()(softmaxTree *smTree) const
Definition: regionPlugin.cpp:48
data_function.MelAudioLoader.audiopaths_and_text
audiopaths_and_text
Definition: waveglow/data_function.py:30
anonymous_namespace{split.h}::SPLIT_PLUGIN_VERSION
constexpr const char * SPLIT_PLUGIN_VERSION
Definition: split.h:30
nvinfer1::plugin::GridAnchorGenerator::mParam
std::vector< GridAnchorParameters > mParam
Definition: gridAnchorPlugin.h:96
nvinfer1::plugin::CoordConvACPlugin::serialize
void serialize(void *buffer) const override
Serialize the layer.
Definition: coordConvACPlugin.cpp:94
onnx_graphsurgeon.util.misc.SynchronizedList.clear
def clear(self)
Definition: onnx-graphsurgeon/onnx_graphsurgeon/util/misc.py:123
tests.tensor_quantizer_test.TestTensorQuantizer
Definition: tensor_quantizer_test.py:36
BatchStream::getLabels
float * getLabels() override
Definition: BatchStream.h:264
sample::anonymous_namespace{sampleOptions.cpp}::printPrecision
std::ostream & printPrecision(std::ostream &os, const BuildOptions &options)
Definition: sampleOptions.cpp:372
sample::MirroredBuffer::getDeviceBuffer
void * getDeviceBuffer() const
Definition: sampleDevice.h:323
onnx_models.Model.outputs
outputs
Definition: onnx_models.py:38
nvinfer1::IPluginV2IOExt
Plugin class for user-implemented layers.
Definition: NvInferRuntimeCommon.h:716
nvinfer1::plugin::BatchedNMSDynamicPlugin::setClipParam
void setClipParam(bool clip)
Definition: batchedNMSPlugin.cpp:454
backend.trt.test_loader.TestModifyNetwork.test_mark_layerwise
def test_mark_layerwise(self, identity_identity_network)
Definition: trt/test_loader.py:121
sample::BuildOptions::maxBatch
int maxBatch
Definition: sampleOptions.h:108
half_float::detail::half_caster< T, expr, R >::cast
static T cast(expr arg)
Definition: samples/common/half.h:2666
model.WaveGlow.forward
def forward(self, forward_input)
Definition: demo/Tacotron2/waveglow/model.py:186
nvinfer1::plugin::RPROIParams::iouThreshold
float iouThreshold
Definition: NvInferPluginUtils.h:125
SampleFasterRCNN::constructNetwork
void constructNetwork(SampleUniquePtr< nvcaffeparser1::ICaffeParser > &parser, SampleUniquePtr< nvinfer1::IBuilder > &builder, SampleUniquePtr< nvinfer1::INetworkDefinition > &network, SampleUniquePtr< nvinfer1::IBuilderConfig > &config)
Parses a Caffe model for FasterRCNN and creates a TensorRT network.
Definition: sampleFasterRCNN.cpp:181
nvinfer1::PluginType::kANCHORGENERATOR
@ kANCHORGENERATOR
SSD Grid Anchor Generator.
samplesCommon::BufferManager::copyOutputToHost
void copyOutputToHost()
Copy the contents of output device buffers to output host buffers synchronously.
Definition: buffers.h:403
nvinfer1::INetworkDefinition::addActivation
virtual IActivationLayer * addActivation(ITensor &input, ActivationType type)=0
Add an activation layer to the network.
nvinfer1::plugin::GroupNormalizationPlugin::mNbGroups
int mNbGroups
Definition: groupNormalizationPlugin.h:99
SampleMLP::build
bool build()
Function builds the network engine.
Definition: sampleMLP.cpp:142
nmtSample::LSTMEncoder::getMemoryStatesSize
int getMemoryStatesSize() override
get the size of the memory state vector
Definition: lstmEncoder.cpp:100
test_ir.TestGraph.test_register
def test_register(self)
Definition: test_ir.py:461
nvinfer1::plugin::ProposalPlugin::getPluginVersion
const char * getPluginVersion() const override
Return the plugin version.
Definition: proposalPlugin.cpp:155
anonymous_namespace{gridAnchorPlugin.cpp}::GRID_ANCHOR_PLUGIN_VERSION
const char * GRID_ANCHOR_PLUGIN_VERSION
Definition: gridAnchorPlugin.cpp:30
polygraphy.comparator.data_loader.DataLoaderCache
Definition: comparator/data_loader.py:198
polygraphy.tools.registry.MissingTool.__doc__
__doc__
Definition: registry.py:33
nvinfer1::plugin::DetectionLayer::mKeepTopK
int mKeepTopK
Definition: detectionLayerPlugin.h:95
nvinfer1::EnumMax
constexpr int32_t EnumMax()
Forward declare IGpuAllocator for use in other interfaces.
pytorch_quantization.nn.modules.tensor_quantizer.TensorQuantizer.unsigned
def unsigned(self)
Definition: tensor_quantizer.py:114
nvinfer1::plugin::ProposalPlugin::configurePlugin
void configurePlugin(const Dims *inputDims, int nbInputs, const Dims *outputDims, int nbOutputs, const DataType *inputTypes, const DataType *outputTypes, const bool *inputIsBroadcast, const bool *outputIsBroadcast, PluginFormat floatFormat, int maxBatchSize) override
Definition: proposalPlugin.cpp:294
perf.DeviceBuffer
Definition: perf.py:28
samplesCommon::BufferManager::getHostBuffer
void * getHostBuffer(const std::string &tensorName) const
Returns the host buffer corresponding to tensorName.
Definition: buffers.h:304
ProposalWorkSpace::sortClassLabelDims
nvinfer1::DimsHW sortClassLabelDims
Definition: maskRCNNKernels.h:133
trt_utils.build_engine
def build_engine(model_file, shapes, max_ws=512 *1024 *1024, fp16=False)
Definition: trt_utils.py:92
nvinfer1::RNNOperation::kTANH
@ kTANH
Single gate RNN w/ TANH activation function.
train.evaluating
def evaluating(model)
Definition: train.py:249
polygraphy.tools.precision.precision.STWorstFirst
Definition: precision.py:166
nvinfer1::plugin::RPROIPlugin::copyFromHost
int copyFromHost(char *dstHostBuffer, const void *source, int count) const
Definition: nvFasterRCNNPlugin.cpp:208
nvinfer1::plugin::FlattenConcat::configurePlugin
void configurePlugin(const Dims *inputDims, int nbInputs, const Dims *outputDims, int nbOutputs, const DataType *inputTypes, const DataType *outputTypes, const bool *inputIsBroadcast, const bool *outputIsBroadcast, PluginFormat floatFormat, int maxBatchSize) override
Definition: flattenConcat.cpp:227
model.Attention.get_alignment_energies
def get_alignment_energies(self, query, processed_memory, attention_weights_cat)
Definition: demo/Tacotron2/tacotron2/model.py:63
SampleOnnxMNIST::mParams
samplesCommon::OnnxSampleParams mParams
The parameters for the sample.
Definition: sampleOnnxMNIST.cpp:69
polygraphy.tools.args.tf.loader.TfLoaderArgs.register
def register(self, maker)
Definition: tools/args/tf/loader.py:47
polygraphy.backend.trt.loader.NetworkFromOnnxBytes.__call__
def __call__(self)
Definition: backend/trt/loader.py:123
nvinfer1::plugin::LReluPluginCreator::createPlugin
IPluginV2 * createPlugin(const char *name, const PluginFieldCollection *fc) override
Return a plugin object.
Definition: lReluPlugin.cpp:155
pytorch_quantization.nn.functional.ClipFunction.backward
def backward(ctx, grad_output)
Definition: functional.py:43
nvinfer1::PluginField::PluginField
PluginField(const char *name_=nullptr, const void *data_=nullptr, const PluginFieldType type_=PluginFieldType::kUNKNOWN, int32_t length_=0)
Definition: NvInferRuntimeCommon.h:867
polygraphy.backend.tf.loader.CreateConfig.__call__
def __call__(self)
Definition: backend/tf/loader.py:374
polygraphy.tools.base.tool
Definition: tool.py:1
builder_varseqlen.build_engine
def build_engine(batch_size, workspace_size, sequence_length, config, weights_dict, squad_json, vocab_file, calibrationCacheFile, calib_num)
Definition: builder_varseqlen.py:546
pytorch_quantization.utils
Definition: tools/pytorch-quantization/pytorch_quantization/utils/__init__.py:1
nvinfer1::plugin::BatchedNMSDynamicPlugin::enqueue
int enqueue(const PluginTensorDesc *inputDesc, const PluginTensorDesc *outputDesc, const void *const *inputs, void *const *outputs, void *workspace, cudaStream_t stream) override
Execute the layer.
Definition: batchedNMSPlugin.cpp:229
bert::QKVToContextInterleavedPlugin::getWorkspaceSize
size_t getWorkspaceSize(const nvinfer1::PluginTensorDesc *inputs, int nbInputs, const nvinfer1::PluginTensorDesc *outputs, int nbOutputs) const override
Definition: qkvToContextInt8InterleavedPlugin.cpp:152
OutputParams::userId
int32_t userId
Definition: sampleMovieLens.cpp:51
nvinfer1::plugin::BatchedNMSDynamicPlugin::mClipBoxes
bool mClipBoxes
Definition: batchedNMSPlugin.h:117
nvinfer1::INetworkDefinition::addElementWise
virtual IElementWiseLayer * addElementWise(ITensor &input1, ITensor &input2, ElementWiseOperation op)=0
Add an elementwise layer to the network.
onnx_graphsurgeon.ir.tensor.Constant.copy
def copy(self)
Definition: tensor.py:210
polygraphy.tools.util.misc
Definition: Polygraphy/polygraphy/tools/util/misc.py:1
nvonnxparser::IOnnxConfig::setFullTextFileName
virtual void setFullTextFileName(const char *fullTextFileName)=0
Set the File Name of the Network Description as a Text File, including the weights.
helpers.calibrator.BertCalibrator.num_inputs
num_inputs
Definition: demo/BERT/helpers/calibrator.py:40
nvinfer1::INetworkDefinition::getNbLayers
virtual int32_t getNbLayers() const =0
Get the number of layers in the network.
nvinfer1::plugin::MultilevelCropAndResizePluginCreator::~MultilevelCropAndResizePluginCreator
~MultilevelCropAndResizePluginCreator()
Definition: multilevelCropAndResizePlugin.h:110
SampleMNIST::teardown
bool teardown()
Used to clean up any state created in the sample class.
Definition: sampleMNIST.cpp:345
sample::TrtCudaEvent::get
cudaEvent_t get() const
Definition: sampleDevice.h:129
UffPoolPluginV2::getOutputDimensions
Dims getOutputDimensions(int index, const Dims *inputs, int nbInputDims) override
Definition: sampleUffPluginV2Ext.cpp:391
model.Tacotron2.mask_padding
mask_padding
Definition: demo/Tacotron2/tacotron2/model.py:590
nvinfer1::plugin::InstanceNormalizationPluginCreator::getPluginName
const char * getPluginName() const override
Return the plugin name.
Definition: instanceNormalizationPlugin.cpp:335
TrtDestroyer::operator()
void operator()(T *t)
Definition: demo/BERT/infer_c/common.h:169
nmtSample::BLEUScoreWriter::mVocabulary
Vocabulary::ptr mVocabulary
Definition: bleuScoreWriter.h:53
nmtSample::Vocabulary::mIdToToken
std::vector< std::string > mIdToToken
Definition: vocabulary.h:73
nvinfer1::plugin::PriorBox::PriorBox
PriorBox(PriorBoxParameters param, int H=0, int W=0)
Definition: priorBoxPlugin.cpp:40
polygraphy.logger.logger.Logger.line_info
line_info
Definition: Polygraphy/polygraphy/logger/logger.py:133
nvinfer1::plugin::SpecialSlice::detachFromContext
void detachFromContext() override
Detach the plugin object from its execution context.
Definition: specialSlicePlugin.cpp:212
sample::TrtCudaEvent::operator-
float operator-(const TrtCudaEvent &e) const
Definition: sampleDevice.h:145
SampleConfig::getTF32
bool getTF32() const
Definition: sampleConfig.h:94
layers.TacotronSTFT.mel_spectrogram
def mel_spectrogram(self, y)
Definition: layers.py:78
nvinfer1::plugin::GridAnchorGenerator
Definition: gridAnchorPlugin.h:30
nmtSample::DebugUtil::DumpTensorPlugin::mDims
nvinfer1::Dims mDims
Definition: debugUtil.h:69
pytorch_quantization.nn.modules._utils.QuantMixin._input_quantizers
_input_quantizers
Definition: _utils.py:83
polygraphy.backend.onnx.loader.ModifyOnnx.__call__
def __call__(self)
Definition: backend/onnx/loader.py:129
nvinfer1::utility::TRTOption
TRTOption defines a command line option.
Definition: getOptions.h:34
SampleOnnxMNIST::processInput
bool processInput(const samplesCommon::BufferManager &buffers)
Reads the input and stores the result in a managed buffer.
Definition: sampleOnnxMNIST.cpp:238
test_ir.TensorBaseTests
Definition: test_ir.py:30
util.test_format.FormatTestCase.format
format
Definition: test_format.py:26
nvinfer1::plugin::GroupNormalizationPlugin::bnScale
void * bnScale
Definition: groupNormalizationPlugin.h:105
FCPlugin::mDeviceBias
void * mDeviceBias
Definition: samples/opensource/samplePlugin/fcPlugin.h:313
nvinfer1::LayerType::kELEMENTWISE
@ kELEMENTWISE
Elementwise layer.
data_function.MelAudioLoader.__getitem__
def __getitem__(self, index)
Definition: waveglow/data_function.py:65
sample::gLogVerbose
LogStreamConsumer gLogVerbose
Definition: logger.cpp:23
half_float::detail::floor
half floor(half arg)
Nearest integer not greater than half value.
Definition: ieee_half.h:2558
nmtSample::Vocabulary::getToken
std::string getToken(int id) const
get token by ID
Definition: vocabulary.cpp:50
nvinfer1::IPluginCreator
Plugin creator class for user implemented layers.
Definition: NvInferRuntimeCommon.h:890
nvinfer1::plugin::ProposalLayer::detachFromContext
void detachFromContext() override
Detach the plugin object from its execution context.
Definition: proposalLayerPlugin.cpp:368
sample::BuildOptions::calibration
std::string calibration
Definition: sampleOptions.h:122
nvinfer1::CUDADriverWrapper::cuGetErrorName
CUresult cuGetErrorName(CUresult error, const char **pStr) const
Definition: cudaDriverWrapper.cpp:71
nmtSample::Projection::~Projection
~Projection() override=default
generate.propagate_dtype
def propagate_dtype(outputs, dtype)
Definition: 07_creating_a_model_with_the_layer_api/generate.py:25
nvinfer1::plugin::DetectionLayerPluginCreator::mNbClasses
int mNbClasses
Definition: detectionLayerPlugin.h:127
sample::InferenceOptions::rerun
bool rerun
Definition: sampleOptions.h:160
BertInference::allocateBindings
void allocateBindings(const int maxBatchSize)
Definition: bert_infer.h:108
test_importers.TestOnnxImporter.test_import_constant_tensor
def test_import_constant_tensor(self)
Definition: test_importers.py:44
loss_function.WaveGlowLoss.sigma
sigma
Definition: waveglow/loss_function.py:22
SampleMaskRCNN::constructNetwork
bool constructNetwork(SampleUniquePtr< nvinfer1::IBuilder > &builder, SampleUniquePtr< nvinfer1::INetworkDefinition > &network, SampleUniquePtr< nvuffparser::IUffParser > &parser)
Definition: sampleUffMaskRCNN.cpp:388
builder_varseqlen.BertConfig.use_qat
use_qat
Definition: builder_varseqlen.py:100
LogStreamConsumer::severityPrefix
static std::string severityPrefix(Severity severity)
Definition: pluginLogging.h:167
pytorch_quantization.optim.helper.match_parameters
def match_parameters(model, patterns)
Definition: helper.py:24
nvinfer1::plugin::ResizeNearest::getOutputDimensions
Dims getOutputDimensions(int index, const Dims *inputs, int nbInputDims) override
Definition: resizeNearestPlugin.cpp:93
nmtSample::DebugUtil::DumpTensorPlugin::mElemsPerRow
int mElemsPerRow
Definition: debugUtil.h:71
polygraphy.tools.surgeon.subtool.base.BaseSurgeonSubtool.export_graph
def export_graph(self, graph, args, do_type_check=True)
Definition: surgeon/subtool/base.py:49
pytorch_quantization.nn.modules.quant_pooling.QuantAvgPool3d
Definition: quant_pooling.py:103
nvinfer1::plugin::SpecialSlice::enqueue
int enqueue(int batch_size, const void *const *inputs, void **outputs, void *workspace, cudaStream_t stream) override
Definition: specialSlicePlugin.cpp:162
nvinfer1::plugin::MultilevelProposeROI::getPluginType
const char * getPluginType() const override
Return the plugin type.
Definition: multilevelProposeROIPlugin.cpp:211
builder.BertConfig.is_calib_mode
is_calib_mode
Definition: builder.py:103
polygraphy.backend.trt.loader.ShapeTuple.__init__
def __init__(self, min, opt, max)
Definition: backend/trt/loader.py:236
polygraphy.backend.trt_legacy.LoadUffFile.shapes
shapes
Definition: backend/trt_legacy.py:38
test_importers.TestOnnxImporter.test_import_graph_with_dim_param
def test_import_graph_with_dim_param(self)
Definition: test_importers.py:119
nvcaffeparser1::WeightType::kMEAN
@ kMEAN
SampleOnnxMnistCoordConvAC::mEngine
std::shared_ptr< nvinfer1::ICudaEngine > mEngine
The TensorRT engine used to run the network.
Definition: sampleOnnxMnistCoordConvAC.cpp:81
polygraphy.tools.args.onnx.loader.OnnxLoaderArgs.parse
def parse(self, args)
Definition: tools/args/onnx/loader.py:61
generate.x0
x0
Definition: 03_isolating_a_subgraph/generate.py:26
nvinfer1::Dims::nbDims
int32_t nbDims
The number of dimensions.
Definition: NvInferRuntimeCommon.h:198
nvinfer1::plugin::RPROIPlugin::getSerializationSize
size_t getSerializationSize() const override
Find the size of the serialization buffer required.
Definition: nvFasterRCNNPlugin.cpp:173
nvcaffeparser1::parseCrop
nvinfer1::ILayer * parseCrop(nvinfer1::INetworkDefinition &network, const trtcaffe::LayerParameter &msg, CaffeWeightFactory &, BlobNameToTensor &tensors)
Definition: parseCrop.cpp:23
nvinfer1::LayerType::kSHUFFLE
@ kSHUFFLE
Shuffle layer.
anonymous_namespace{generateDetectionPlugin.cpp}::GENERATEDETECTION_PLUGIN_VERSION
const char * GENERATEDETECTION_PLUGIN_VERSION
Definition: generateDetectionPlugin.cpp:31
nvcaffeparser1::BlobNameToTensor::operator[]
nvinfer1::ITensor *& operator[](const std::string &name)
Definition: blobNameToTensor.h:46
onnx_graphsurgeon.ir.graph.Graph._generate_name
def _generate_name(self, prefix)
Definition: graph.py:391
nvinfer1::BuilderFlag::kDEBUG
@ kDEBUG
Enable debugging of layers via synchronizing after every layer.
anonymous_namespace{proposalPlugin.cpp}::RPN_STD_SCALING
static const float RPN_STD_SCALING
Definition: proposalPlugin.cpp:34
ConcatTopKWorkSpace::sortedScoreDims
nvinfer1::DimsHW sortedScoreDims
Definition: maskRCNNKernels.h:197
pytorch_quantization.nn._functions.quant_rnn.StackedRNN
def StackedRNN(inners, num_layers, lstm=False, dropout=0, train=True)
Definition: _functions/quant_rnn.py:88
samplesCommon::BufferManager::getBuffer
void * getBuffer(const bool isHost, const std::string &tensorName) const
Definition: buffers.h:427
polygraphy.backend.trt_legacy.ConvertToUff
Definition: backend/trt_legacy.py:48
SharedMemory::open
int open(int flag, mode_t mode)
Definition: sampleMovieLensMPS.cpp:142
nmtSample::Vocabulary::getSize
int getSize() const
get the number of elements in the vocabulary
Definition: vocabulary.cpp:56
tests.quant_conv_transposed_test.TestQuantConvTranspose1D
Definition: quant_conv_transposed_test.py:334
bert::TFusedMultiHeadAttentionXMMAKernel::hashID
uint64_t hashID(unsigned int s, unsigned int d) const
Definition: fused_multihead_attention.h:194
samplesCommon::GpuTimer::mStream
cudaStream_t mStream
Definition: samples/common/common.h:836
nvinfer1::IFullyConnectedLayer
A fully connected layer in a network definition. This layer expects an input tensor of three or more ...
Definition: NvInfer.h:1560
MaskRCNNConfig::MODEL_INPUT
static const std::string MODEL_INPUT
Definition: plugin/proposalLayerPlugin/mrcnn_config.h:171
Logger::TestResult::kPASSED
@ kPASSED
The test passed.
onnx_graphsurgeon.ir.graph.Graph.name
name
Definition: graph.py:118
pytorch_quantization.nn.modules._utils.QuantMixin._input_quantizer
_input_quantizer
Definition: _utils.py:80
nvcaffeparser1::IBinaryProtoBlob::getDimensions
virtual nvinfer1::DimsNCHW getDimensions()=0
polygraphy.tools.args.data_loader.DataLoaderArgs.parse
def parse(self, args)
Definition: tools/args/data_loader.py:53
nvinfer1::plugin::LogStream
Definition: checkMacrosPlugin.h:47
SampleConfig::getReferenceFileName
const char * getReferenceFileName() const
Definition: sampleConfig.h:241
SampleBuffer::SampleBuffer
SampleBuffer(nvinfer1::Dims dims, int dataWidth, TensorFormat format)
Definition: sampleReformatFreeIO.cpp:143
sample::Logger::genCmdlineString
static std::string genCmdlineString(int argc, char const *const *argv)
generate a command line string from the given (argc, argv) values
Definition: samples/common/logging.h:427
util.test_misc.test_unpack_args
def test_unpack_args(case)
Definition: test_misc.py:86
sample::defaultStreams
constexpr int defaultStreams
Definition: sampleOptions.h:45
nvinfer1::plugin::PriorBox::deserializeToDevice
Weights deserializeToDevice(const char *&hostBuffer, size_t count)
Definition: priorBoxPlugin.cpp:228
nvinfer1::plugin::MultilevelProposeROIPluginCreator::mPreNMSTopK
int mPreNMSTopK
Definition: multilevelProposeROIPlugin.h:139
polygraphy.backend.tf.loader.GraphFromCkpt
Definition: backend/tf/loader.py:161
builder_varseqlen.BertConfig.num_attention_heads
num_attention_heads
Definition: builder_varseqlen.py:93
nmtSample::BeamSearchPolicy::BeamSearchPolicy
BeamSearchPolicy(int endSequenceId, LikelihoodCombinationOperator::ptr likelihoodCombinationOperator, int beamWidth)
Definition: beamSearchPolicy.cpp:29
xy_t::xy_t
xy_t(int y_, int x_)
Definition: maskRCNNKernels.h:265
polygraphy.backend.trt.util.get_active_profile_bindings
def get_active_profile_bindings(context)
Definition: trt/util.py:432
sample::BuildOptions::avgTiming
int avgTiming
Definition: sampleOptions.h:111
sample::TrtCudaStream::get
cudaStream_t get() const
Definition: sampleDevice.h:78
polygraphy.tools.args.tf.loader.TfLoaderArgs.model_args
model_args
Definition: tools/args/tf/loader.py:54
half_float::half::operator*=
detail::enable< half &, T >::type operator*=(T rhs)
Arithmetic assignment.
Definition: samples/common/half.h:1461
SampleMaskRCNNParams
Definition: sampleUffMaskRCNN.cpp:304
test_ir.TestGraph.test_cleanup_multi_tier
def test_cleanup_multi_tier(self)
Definition: test_ir.py:625
SampleAlgorithmSelector::verifyOutput
bool verifyOutput(const samplesCommon::BufferManager &buffers, const std::string &outputTensorName, int groundTruthDigit) const
Verifies that the output is correct and prints it.
Definition: sampleAlgorithmSelector.cpp:476
polygraphy.tools.run.run.Run.run
def run(self, args)
Definition: run.py:125
tests.quant_conv_test.TestQuantConv1D.test_fake_quant_per_channel_bias
def test_fake_quant_per_channel_bias(self)
Definition: quant_conv_test.py:410
half_float::detail::half2int
T half2int(uint16 value)
Convert half-precision floating point to integer.
Definition: ieee_half.h:995
test_examples.Example.__init__
def __init__(self, path_components, artifact_names=[])
Definition: Polygraphy/tests/test_examples.py:55
SampleUffMNIST::SampleUffMNIST
SampleUffMNIST(const samplesCommon::UffSampleParams &params)
Definition: sampleUffMNIST.cpp:59
bert::anonymous_namespace{qkvToContextInt8InterleavedPlugin.cpp}::QKV_TO_CONTEXT_INTERLEAVED_PLUGIN_NAME
static const char * QKV_TO_CONTEXT_INTERLEAVED_PLUGIN_NAME
Definition: qkvToContextInt8InterleavedPlugin.cpp:40
half_float::detail::functions::minus
static expr minus(float x, float y)
Subtraction implementation.
Definition: samples/common/half.h:1598
nvinfer1::plugin::Region::getOutputDataType
DataType getOutputDataType(int index, const nvinfer1::DataType *inputTypes, int nbInputs) const override
Definition: regionPlugin.cpp:425
bert::SkipLayerNormInterleavedPlugin::getWorkspaceSize
size_t getWorkspaceSize(const nvinfer1::PluginTensorDesc *inputs, int nbInputs, const nvinfer1::PluginTensorDesc *outputs, int nbOutputs) const override
Definition: skipLayerNormInt8InterleavedPlugin.cpp:155
nmtSample::ComponentWeights::operator>>
friend std::istream & operator>>(std::istream &input, ComponentWeights &value)
Definition: componentWeights.cpp:23
polygraphy.backend.trt.loader.SaveEngine.__init__
def __init__(self, engine, path=None)
Definition: backend/trt/loader.py:490
half_float::detail::erfc
expr erfc(half arg)
Complementary error function.
Definition: ieee_half.h:2526
sample::LogStreamConsumer::severityPrefix
static std::string severityPrefix(Severity severity)
Definition: samples/common/logging.h:165
nmtSample::BeamSearchPolicy::mSampleCount
int mSampleCount
Definition: beamSearchPolicy.h:72
config.preprocess
def preprocess(dynamic_graph)
Definition: samples/opensource/sampleUffFasterRCNN/config.py:33
bert::FusedMultiHeadAttentionKernelMetaInfoV1::mCubinSize
unsigned int mCubinSize
Definition: fused_multihead_attention.h:122
nvinfer1::plugin::GenerateDetection::mKeepTopK
int mKeepTopK
Definition: generateDetectionPlugin.h:96
nvinfer1::plugin::RPROIParams::anchorsScaleCount
int32_t anchorsScaleCount
Definition: NvInferPluginUtils.h:124
nvinfer1::plugin::CodeTypeSSD::CENTER_SIZE
@ CENTER_SIZE
Use box centers and size.
main.main
def main()
Definition: main.py:30
infer.DeviceBuffer.free
def free(self)
Definition: infer.py:39
polygraphy.logger.logger.LoggerVerbosity
Definition: Polygraphy/polygraphy/logger/logger.py:58
pytorch_quantization.nn.modules.quant_instancenorm.QuantInstanceNorm1d.forward
def forward(self, input)
Definition: quant_instancenorm.py:44
multiproc.main
def main()
Definition: multiproc.py:23
nvinfer1::plugin::TRTException::message
const char * message
Definition: checkMacrosPlugin.h:97
nvinfer1::plugin::BatchedNMSPluginCreator::BatchedNMSPluginCreator
BatchedNMSPluginCreator()
Definition: batchedNMSPlugin.cpp:486
nvinfer1::plugin::Reorg::clone
IPluginV2Ext * clone() const override
Clone the plugin object.
Definition: reorgPlugin.cpp:175
nvinfer1::IFillLayer
Generate an output tensor with specified mode.
Definition: NvInfer.h:4762
onnx_graphsurgeon.util.misc.SynchronizedList.__delitem__
def __delitem__(self, index)
Definition: onnx-graphsurgeon/onnx_graphsurgeon/util/misc.py:86
nmtSample::DebugUtil::DumpTensorPlugin::getNbOutputs
int getNbOutputs() const override
Get the number of outputs from the layer.
Definition: debugUtil.cpp:33
half_float::detail::fmod
expr fmod(half x, half y)
Remainder of division.
Definition: ieee_half.h:2243
polygraphy.backend.tf.loader.GraphFromCkpt.__init__
def __init__(self, dir, name=None)
Definition: backend/tf/loader.py:162
model.Decoder.n_frames_per_step
n_frames_per_step
Definition: demo/Tacotron2/tacotron2/model.py:243
nvinfer1::plugin::InstanceNormalizationPlugin::mPluginNamespace
std::string mPluginNamespace
Definition: instanceNormalizationPlugin.h:99
nvcaffeparser1::BlobNameToTensor
Definition: blobNameToTensor.h:28
pytorch_quantization.nn.modules.clip.Clip.clip_value_max
clip_value_max
Definition: clip.py:52
helpers.calibrator.BertCalibrator.current_index
current_index
Definition: demo/BERT/helpers/calibrator.py:39
sample::ReportingOptions
Definition: sampleOptions.h:169
MaskRCNNUtils
Definition: sampleUffMaskRCNN.cpp:50
nvinfer1::plugin::GridAnchorParameters::variance
float variance[4]
Definition: NvInferPluginUtils.h:148
test_ir.TestVariable.test_equals_name_mismatch
def test_equals_name_mismatch(self)
Definition: test_ir.py:116
nvinfer1::plugin::ProposalPlugin::~ProposalPlugin
~ProposalPlugin() override
Definition: proposalPlugin.cpp:148
polygraphy.util.misc.is_compressed
def is_compressed(obj)
Definition: Polygraphy/polygraphy/util/misc.py:183
nvinfer1::plugin::ProposalPluginCreator::mNamespace
std::string mNamespace
Definition: proposalPlugin.h:145
nvonnxparser::ErrorCode::kUNSUPPORTED_GRAPH
@ kUNSUPPORTED_GRAPH
nvinfer1::plugin::NMSPluginCreator::getPluginName
const char * getPluginName() const override
Return the plugin name.
Definition: nmsPlugin.cpp:281
samplesCommon::topKMagnitudes
std::vector< size_t > topKMagnitudes(const std::vector< T > &v, const size_t k)
Definition: samples/common/common.h:443
half_float::detail::pow
expr pow(half base, half exp)
Power function.
Definition: ieee_half.h:2404
polygraphy.tools.surgeon.subtool.sanitize
Definition: sanitize.py:1
tools.test_run.TestOther.test_save_load_inputs
def test_save_load_inputs(self)
Definition: test_run.py:265
helpers.tokenization._is_whitespace
def _is_whitespace(char)
Definition: tokenization.py:392
tests.tensor_quant_test.TestFakeTensorQuant.test_per_tensor_scale
def test_per_tensor_scale(self)
Definition: tensor_quant_test.py:141
polygraphy.util.cuda.Stream.address
def address(self)
Definition: cuda.py:103
nvinfer1::ILayer::setName
virtual void setName(const char *name)=0
Set the name of a layer.
sample::AllOptions::inference
InferenceOptions inference
Definition: sampleOptions.h:191
nvinfer1::IPluginFactory::~IPluginFactory
virtual ~IPluginFactory()
Definition: NvInferRuntime.h:985
polygraphy.backend.tf.loader.SaveGraph.tensorboard_dir
tensorboard_dir
Definition: backend/tf/loader.py:324
samplesCommon::SampleParams::dlaCore
int32_t dlaCore
Specify the DLA core to run network on.
Definition: argsParser.h:39
nvonnxparser::IOnnxConfig
Configuration Manager Class.
Definition: NvOnnxConfig.h:45
nvinfer1::plugin::MultilevelCropAndResize::getSerializationSize
size_t getSerializationSize() const override
Find the size of the serialization buffer required.
Definition: multilevelCropAndResizePlugin.cpp:212
MaskRCNNConfig::RPN_NMS_THRESHOLD
static const float RPN_NMS_THRESHOLD
Definition: plugin/proposalLayerPlugin/mrcnn_config.h:80
polygraphy.backend.tf.loader.UseTfTrt.max_workspace_size
max_workspace_size
Definition: backend/tf/loader.py:226
MaskRCNNUtils::BBoxInfo::mask
Mask * mask
Definition: sampleUffMaskRCNN.cpp:68
SampleCharRNNBase::SampleUniquePtr
std::unique_ptr< T, samplesCommon::InferDeleter > SampleUniquePtr
Definition: sampleCharRNN.cpp:146
inference.args
def args
Definition: BERT/inference.py:83
SampleCharRNNParams::useILoop
bool useILoop
Definition: sampleCharRNN.cpp:133
nvinfer1::plugin::PyramidROIAlignPluginCreator::createPlugin
IPluginV2Ext * createPlugin(const char *name, const PluginFieldCollection *fc) override
Return a plugin object.
Definition: pyramidROIAlignPlugin.cpp:58
nvinfer1::plugin::Region::isOutputBroadcastAcrossBatch
bool isOutputBroadcastAcrossBatch(int outputIndex, const bool *inputIsBroadcasted, int nbInputs) const override
Definition: regionPlugin.cpp:432
polygraphy.backend.trt_legacy.TrtLegacyRunner.get_input_metadata
def get_input_metadata(self)
Definition: backend/trt_legacy.py:315
bert::QKVToContextInterleavedPlugin::createMHARunner
void createMHARunner()
polygraphy.logger.logger.Logger.extra_verbose
def extra_verbose(self, message, mode=LogMode.EACH)
Definition: Polygraphy/polygraphy/logger/logger.py:258
polygraphy.backend.tf.loader.UseTfTrt.fp16
fp16
Definition: backend/tf/loader.py:227
nmtSample::Encoder::~Encoder
~Encoder() override=default
samplesCommon::getC
int getC(const Dims &d)
Definition: samples/common/common.h:881
backend.trt.test_runner.TestTrtRunner.test_multiple_profiles
def test_multiple_profiles(self)
Definition: trt/test_runner.py:80
SampleMLP::weightsMemory
std::vector< SampleUniquePtr< nvinfer1::IHostMemory > > weightsMemory
Host weights memory holder.
Definition: sampleMLP.cpp:94
tools.test_run.TestTrtLegacy
Definition: test_run.py:50
polygraphy.comparator.data_loader.DataLoaderCache.set_input_metadata
def set_input_metadata(self, input_metadata)
Definition: comparator/data_loader.py:265
backend.onnxrt.test_runner.TestOnnxrtRunner.test_shape_output
def test_shape_output(self)
Definition: onnxrt/test_runner.py:45
test_ir.TestGraph.test_fold_constants_one_hop
def test_fold_constants_one_hop(self)
Definition: test_ir.py:726
pytorch_quantization.nn.modules.quant_bert.QuantBertSelfAttention.output_attentions
output_attentions
Definition: quant_bert.py:57
MNISTBatchStream::getBatch
float * getBatch() override
Definition: BatchStream.h:73
IBatchStream::reset
virtual void reset(int firstBatch)=0
nvinfer1::plugin::MultilevelCropAndResizePluginCreator::createPlugin
IPluginV2Ext * createPlugin(const char *name, const PluginFieldCollection *fc) override
Return a plugin object.
Definition: multilevelCropAndResizePlugin.cpp:62
inference.MeasureTime.t0
t0
Definition: Tacotron2/inference.py:173
model.WN.n_layers
n_layers
Definition: demo/Tacotron2/waveglow/model.py:93
SampleConfig::getFullTextFileName
const char * getFullTextFileName() const
Get the File Name of the Network Description as a Text File, including the weights.
Definition: sampleConfig.h:145
polygraphy.backend.trt.runner.TrtRunner._engine_or_context
_engine_or_context
Definition: backend/trt/runner.py:50
tests.integration_test.TestNetwork.test_forward
def test_forward(self)
Definition: integration_test.py:62
MNISTBatchStream::reset
void reset(int firstBatch) override
Definition: BatchStream.h:53
nvinfer1::plugin::ProposalPlugin
Definition: proposalPlugin.h:36
half_float::detail::functions::divides
static expr divides(float x, float y)
Division implementation.
Definition: samples/common/half.h:1616
polygraphy.util.format.FormatManager.convert
def convert(shape, new_format)
Definition: format.py:101
nvinfer1::plugin::CropAndResizePluginCreator::CropAndResizePluginCreator
CropAndResizePluginCreator()
Definition: cropAndResizePlugin.cpp:243
nvinfer1::DimensionType::kSEQUENCE
@ kSEQUENCE
Elements correspond to different sequence values.
nvinfer1::plugin::Region::setSoftmaxTree
void setSoftmaxTree(const std::shared_ptr< softmaxTree > &softmaxTree)
Definition: regionPlugin.h:87
polygraphy.logger.logger.Logger.register_callback
def register_callback(self, callback)
Definition: Polygraphy/polygraphy/logger/logger.py:150
sample::TrtCudaEvent::operator=
TrtCudaEvent & operator=(const TrtCudaEvent &)=delete
nmtSample::Attention::ptr
std::shared_ptr< Attention > ptr
Definition: attention.h:35
text.cleaners.basic_cleaners
def basic_cleaners(text)
Definition: cleaners.py:83
pytorch_quantization.utils.amp_wrapper.float_function
def float_function(fn)
Definition: amp_wrapper.py:26
polygraphy.backend.tf.loader.SessionFromGraph
Definition: backend/tf/loader.py:392
tests.calibrator_test.TestMaxCalibrator.test_track_amax
def test_track_amax(self)
Definition: calibrator_test.py:78
onnx_graphsurgeon.ir.graph.Graph.__str__
def __str__(self)
Definition: graph.py:466
onnx_models.Model.__init__
def __init__(self, str path, List[Tensor] inputs, List[Tensor] outputs, List[Node] nodes, int opset)
Definition: onnx_models.py:35
nmtSample::DataReader
reader of sequences of data
Definition: dataReader.h:31
samplesCommon::getFileType
std::string getFileType(const std::string &filepath)
Definition: samples/common/common.h:489
SimpleProfiler::mName
std::string mName
Definition: samples/common/common.h:220
sample::AllOptions::help
static void help(std::ostream &out)
Definition: sampleOptions.cpp:1308
polygraphy.comparator.compare.OutputCompareResult.passed
passed
Definition: compare.py:38
SampleMovieLensParams::ratingInputFile
std::string ratingInputFile
Definition: sampleMovieLens.cpp:68
polygraphy.backend.trt_legacy.ParseNetworkFromOnnxLegacy
Definition: backend/trt_legacy.py:117
pytorch_quantization.nn.modules.quant_rnn.GRUCell.bias_ih
bias_ih
Definition: modules/quant_rnn.py:434
nvinfer1::plugin::GridAnchorGenerator::getPluginType
const char * getPluginType() const override
Return the plugin type.
Definition: gridAnchorPlugin.cpp:279
pytorch_quantization.nn.modules.quant_conv.QuantConv3d.forward
def forward(self, input)
Definition: quant_conv.py:160
polygraphy.tools.args.onnxrt.runner.OnnxrtRunnerArgs.add_to_script
def add_to_script(self, script)
Definition: tools/args/onnxrt/runner.py:32
nvinfer1::ISelectLayer::~ISelectLayer
virtual ~ISelectLayer()
Definition: NvInfer.h:4721
nvinfer1::plugin::SplitPlugin::_d_output_ptrs
thrust::device_vector< float * > _d_output_ptrs
Definition: split.h:45
sample::TrtCudaBuffer::operator=
TrtCudaBuffer & operator=(TrtCudaBuffer &&rhs)
Definition: sampleDevice.h:229
samplesCommon::GpuTimer::GpuTimer
GpuTimer(cudaStream_t stream)
Definition: samples/common/common.h:810
backend.trt.test_loader.identity_network
def identity_network()
Definition: trt/test_loader.py:48
nvcaffeparser1::CaffeWeightFactory::getBlobProtoData
static std::pair< const void *, size_t > getBlobProtoData(const trtcaffe::BlobProto &blobMsg, trtcaffe::Type type, std::vector< void * > &tmpAllocs)
Definition: caffeWeightFactory.cpp:279
sample::anonymous_namespace{sampleInference.cpp}::Iteration::record
void record(EventType e, StreamType s)
Definition: sampleInference.cpp:397
export_tacotron2_onnx.DecoderIter.__init__
def __init__(self, tacotron2)
Definition: export_tacotron2_onnx.py:104
polygraphy.backend.tf.runner.TfRunner.num_inferences
num_inferences
Definition: backend/tf/runner.py:54
half_float::detail::result
Return type for specialized generic 2-argument half-precision functions.
Definition: ieee_half.h:393
samplesCommon::parseDLA
int parseDLA(int argc, char **argv)
Definition: samples/common/common.h:602
samplesCommon::Args::useILoop
bool useILoop
Definition: argsParser.h:89
nvinfer1::plugin::DetectionLayerPluginCreator::deserializePlugin
IPluginV2Ext * deserializePlugin(const char *name, const void *data, size_t length) override
Called during deserialization of plugin layer.
Definition: detectionLayerPlugin.cpp:92
pytorch_quantization.nn.modules.quant_rnn.QuantRNNCell.input_size
input_size
Definition: modules/quant_rnn.py:342
nmtSample::Vocabulary::mNumTokens
int mNumTokens
Definition: vocabulary.h:74
nvinfer1::plugin::ReorgPluginCreator::getPluginVersion
const char * getPluginVersion() const override
Return the plugin version.
Definition: reorgPlugin.cpp:195
polygraphy.backend.trt_legacy.ParseNetworkFromOnnxLegacy.__init__
def __init__(self, onnx_loader)
Definition: backend/trt_legacy.py:118
IBatchStream::getDims
virtual nvinfer1::Dims getDims() const =0
generate.c
c
Definition: 05_folding_constants/generate.py:31
half_float::detail::functions::asin
static expr asin(float arg)
Arc sine implementation.
Definition: samples/common/half.h:1910
SampleOnnxMNIST
The SampleOnnxMNIST class implements the ONNX MNIST sample.
Definition: sampleOnnxMNIST.cpp:46
SampleConfig::getTopK
uint64_t getTopK() const
Definition: sampleConfig.h:277
nvinfer1::plugin::PluginCreatorRegistry::mRegistryLock
std::mutex mRegistryLock
Definition: InferPlugin.cpp:142
parserutils::printMem
void printMem(const char *where)
Definition: parserUtils.h:56
builder_varseqlen.set_tensor_name
def set_tensor_name(tensor, prefix, name)
Definition: builder_varseqlen.py:111
tests.quant_rnn_test.TestQuantLSTM.test_against_unquantized
def test_against_unquantized(self, verbose)
Definition: quant_rnn_test.py:380
backend.trt.test_loader.TestNetworkFromOnnxPath.test_loader_explicit_precision
def test_loader_explicit_precision(self)
Definition: trt/test_loader.py:113
polygraphy.backend.trt_legacy.ConvertToUff.tf_loader
tf_loader
Definition: backend/trt_legacy.py:50
model.Attention
Definition: demo/Tacotron2/tacotron2/model.py:48
half_float::detail::isnormal
bool isnormal(half arg)
Check if normal number.
Definition: ieee_half.h:2752
half_float::detail::functions::copysign
static half copysign(half x, half y)
Sign implementation.
Definition: samples/common/half.h:2321
SampleOnnxMNIST::verifyOutput
bool verifyOutput(const samplesCommon::BufferManager &buffers)
Classifies digits and verify result.
Definition: sampleOnnxMNIST.cpp:271
ProposalWorkSpace::preRefineSortedScoreDims
nvinfer1::DimsHW preRefineSortedScoreDims
Definition: maskRCNNKernels.h:127
Args::engine
std::string engine
Definition: demo/BERT/infer_c/common.h:29
nvinfer1::IPluginV2Ext::getOutputDataType
virtual nvinfer1::DataType getOutputDataType(int32_t index, const nvinfer1::DataType *inputTypes, int32_t nbInputs) const =0
Return the DataType of the plugin output at the requested index.
nvcaffeparser1::BinaryProtoBlob::mMemory
void * mMemory
Definition: binaryProtoBlob.h:61
nvinfer1::plugin::MultilevelCropAndResize::~MultilevelCropAndResize
~MultilevelCropAndResize() override=default
nmtSample::ComponentWeights::ComponentWeights
ComponentWeights()=default
pytorch_quantization.nn.modules.tensor_quantizer.TensorQuantizer
Definition: tensor_quantizer.py:34
nmtSample::LSTMDecoder::mGateBiasWeights
std::vector< nvinfer1::Weights > mGateBiasWeights
Definition: lstmDecoder.h:48
tests.fixtures.models.LeNet.__init__
def __init__(self, **kwargs)
Definition: tools/pytorch-quantization/tests/fixtures/models.py:27
SimpleProfiler::reportLayerTime
virtual void reportLayerTime(const char *layerName, float ms)
Layer time reporting callback.
Definition: samples/common/common.h:147
model.Invertible1x1Conv.__init__
def __init__(self, c)
Definition: demo/Tacotron2/waveglow/model.py:39
SampleCharRNNLoop::LstmParams::inputBias
nvinfer1::ITensor * inputBias
Definition: sampleCharRNN.cpp:252
mrcnn_to_trt_single.main
def main(args=None)
Definition: mrcnn_to_trt_single.py:99
nvinfer1::plugin::BatchTilePlugin::terminate
void terminate() override
Release resources acquired during plugin layer initialization.
Definition: batchTilePlugin.cpp:120
helpers.tokenization.BertTokenizer.convert_ids_to_tokens
def convert_ids_to_tokens(self, ids)
Definition: tokenization.py:208
polygraphy.tools.precision.precision.STCheckerBase.args
args
Definition: precision.py:73
SampleConfig::setFullTextFileName
void setFullTextFileName(const char *fullTextFilename)
Set the File Name of the Network Description as a Text File, including the weights.
Definition: sampleConfig.h:149
nvinfer1::plugin::DetectionOutputParameters::backgroundLabelId
int32_t backgroundLabelId
Definition: NvInferPluginUtils.h:184
nvinfer1::Dims4
Descriptor for four-dimensional data.
Definition: NvInfer.h:290
nvinfer1::plugin::logError
void logError(const char *msg, const char *file, const char *fn, int line)
Definition: checkMacrosPlugin.cpp:94
polygraphy.backend.trt_legacy
Definition: backend/trt_legacy.py:1
bert::QKVToContextInterleavedPlugin::enqueue
int enqueue(const nvinfer1::PluginTensorDesc *inputDesc, const nvinfer1::PluginTensorDesc *outputDesc, const void *const *inputs, void *const *outputs, void *workspace, cudaStream_t stream) override
Execute the layer.
Definition: qkvToContextInt8InterleavedPlugin.cpp:221
nvonnxparser::IOnnxConfig::setTextFileName
virtual void setTextFileName(const char *textFileName)=0
Set the File Name of the Network Description as a Text File.
nvinfer1::plugin::BasePlugin::getPluginNamespace
const char * getPluginNamespace() const override
Return the namespace of the plugin object.
Definition: plugin.h:52
polygraphy.logger.logger.Logger.log
def log(self, message, severity, mode=LogMode.EACH, stack_depth=2)
Definition: Polygraphy/polygraphy/logger/logger.py:182
nvinfer1::plugin::CropAndResizePlugin::getWorkspaceSize
size_t getWorkspaceSize(int) const override
Definition: cropAndResizePlugin.h:58
test_ir.TestNode.test_equals_inputs_mismatch
def test_equals_inputs_mismatch(self)
Definition: test_ir.py:158
nvinfer1::plugin::ProposalLayer::~ProposalLayer
~ProposalLayer() override=default
SampleMNISTAPI::infer
bool infer()
Runs the TensorRT inference engine for this sample.
Definition: sampleMNISTAPI.cpp:262
nvinfer1::plugin::initializePlugin
void initializePlugin(void *logger, const char *libNamespace)
Definition: InferPlugin.cpp:152
polygraphy.tools.args.base
Definition: args/base.py:1
model.WaveGlow.n_remaining_channels
n_remaining_channels
Definition: demo/Tacotron2/waveglow/model.py:183
test_examples.Artifact.name
name
Definition: onnx-graphsurgeon/tests/test_examples.py:35
samplesCommon::splitString
std::vector< std::string > splitString(std::string str, char delimiter=',')
Definition: samples/common/common.h:861
nvinfer1::IPlugin
Plugin class for user-implemented layers.
Definition: NvInferRuntime.h:113
nvinfer1::plugin::BatchedNMSDynamicPluginCreator::BatchedNMSDynamicPluginCreator
BatchedNMSDynamicPluginCreator()
Definition: batchedNMSPlugin.cpp:491
onnx_graphsurgeon.logger.logger.Logger.logging_indent
logging_indent
Definition: onnx-graphsurgeon/onnx_graphsurgeon/logger/logger.py:101
polygraphy.backend.onnx.util.unmark_outputs
def unmark_outputs(model, outputs)
Definition: onnx/util.py:97
SampleMLP::processInput
bool processInput(const samplesCommon::BufferManager &buffers)
Reads the input and stores the result in a managed buffer.
Definition: sampleMLP.cpp:297
sample::LogStreamConsumer::mShouldLog
bool mShouldLog
Definition: samples/common/logging.h:178
models.meta.check_dynamic_identity
def check_dynamic_identity(runner, shapes)
Definition: meta.py:61
models.meta.check_tf_identity
def check_tf_identity(runner)
Definition: meta.py:38
SampleINT8APIParams::networkTensorsFileName
std::string networkTensorsFileName
Definition: sampleINT8API.cpp:67
polygraphy.tools.args.tf.loader.TfLoaderArgs.get_tf_loader
def get_tf_loader(self)
Definition: tools/args/tf/loader.py:137
polygraphy.tools.base.tool.Tool.__call__
def __call__(self, args)
Definition: tool.py:72
nvinfer1::TensorFormat::kDLA_LINEAR
@ kDLA_LINEAR
DLA planar format.
bert::EmbLayerNormVarSeqlenPluginCreator::mFC
static nvinfer1::PluginFieldCollection mFC
Definition: embLayerNormVarSeqlenPlugin.h:135
half_float::detail::binary_specialized< half, half >::fmin
static half fmin(half x, half y)
Definition: samples/common/half.h:2585
polygraphy.backend.tf.loader.OptimizeGraph.__init__
def __init__(self, graph)
Definition: backend/tf/loader.py:28
nvinfer1::plugin::GenerateDetection::enqueue
int enqueue(int batch_size, const void *const *inputs, void **outputs, void *workspace, cudaStream_t stream) override
Definition: generateDetectionPlugin.cpp:272
samplesCommon::TimerBase
Definition: samples/common/common.h:781
nmtSample::Projection
calculates raw logits
Definition: projection.h:32
nvinfer1::CUDADriverWrapper::_cuLinkDestroy
CUresult(* _cuLinkDestroy)(CUlinkState)
Definition: cudaDriverWrapper.h:74
nvinfer1::plugin::BatchTilePlugin::canBroadcastInputAcrossBatch
bool canBroadcastInputAcrossBatch(int inputIndex) const override
Definition: batchTilePlugin.cpp:132
util.test_misc.test_find_in_dict
def test_find_in_dict(case)
Definition: test_misc.py:47
polygraphy.comparator.comparator.Comparator
Definition: comparator/comparator.py:31
classification_flow.finetune_model
def finetune_model(model, data_loader)
Definition: classification_flow.py:412
onnx_graphsurgeon.ir.graph.Graph.__setattr__
def __setattr__(self, name, value)
Definition: graph.py:144
backend.onnx.test_loader.TestModifyOnnx.test_exclude_outputs_with_layerwise
def test_exclude_outputs_with_layerwise(self)
Definition: onnx/test_loader.py:67
pytorch_quantization.nn.modules.tensor_quantizer.TensorQuantizer.disable_clip
def disable_clip(self)
Definition: tensor_quantizer.py:157
SampleCharRNNLoop::LstmParams::inputWeights
nvinfer1::ITensor * inputWeights
Definition: sampleCharRNN.cpp:250
nvinfer1::utility::TRTOption::shortName
char shortName
Option name in short (single hyphen) form (i.e. -a, -b)
Definition: getOptions.h:36
nvinfer1::plugin::PriorBox::getOutputDataType
DataType getOutputDataType(int index, const nvinfer1::DataType *inputTypes, int nbInputs) const override
Definition: priorBoxPlugin.cpp:294
Logger::TestAtom::mCmdline
std::string mCmdline
Definition: pluginLogging.h:304
bert::QKVToContextInterleavedPlugin::getNbOutputs
int getNbOutputs() const override
Get the number of outputs from the layer.
Definition: qkvToContextInt8InterleavedPlugin.cpp:177
UffPoolPluginV2Creator::getPluginVersion
const char * getPluginVersion() const override
Return the plugin version.
Definition: sampleUffPluginV2Ext.cpp:653
bert::QKVToContextInterleavedPluginCreator::deserializePlugin
nvinfer1::IPluginV2 * deserializePlugin(const char *name, const void *serialData, size_t serialLength) override
Called during deserialization of plugin layer.
Definition: qkvToContextInt8InterleavedPlugin.cpp:345
nmtSample::MultiplicativeAlignment::getInfo
std::string getInfo() override
get the textual description of the component
Definition: multiplicativeAlignment.cpp:76
polygraphy.backend.trt.runner.TrtRunner.engine
engine
Definition: backend/trt/runner.py:84
bert::IIDX
constexpr uint32_t IIDX
Definition: qkvToContextInt8InterleavedPlugin.cpp:49
sample::LogStreamConsumer::LogStreamConsumer
LogStreamConsumer(Severity reportableSeverity, Severity severity)
Creates a LogStreamConsumer which logs messages with level severity.
Definition: samples/common/logging.h:137
SampleConfig::setImageFileName
void setImageFileName(const char *imageFilename)
Definition: sampleConfig.h:237
nvinfer1::DimensionType::kSPATIAL
@ kSPATIAL
Elements correspond to different spatial data.
nvinfer1::plugin::RPROIPlugin::copyToHost
float * copyToHost(const void *srcHostData, int count)
Definition: nvFasterRCNNPlugin.cpp:200
nvinfer1::plugin::GroupNormalizationPlugin::attachToContext
void attachToContext(cudnnContext *cudnn, cublasContext *cublas, nvinfer1::IGpuAllocator *allocator) override
Attach the plugin object to an execution context and grant the plugin the access to some context reso...
Definition: groupNormalizationPlugin.cpp:92
nvinfer1::DimsHW::DimsHW
DimsHW()
Construct an empty DimsHW object.
Definition: NvInfer.h:101
nvinfer1::plugin::reportAssertion
void reportAssertion(const char *msg, const char *file, int line)
Definition: checkMacrosPlugin.cpp:101
sample::Logger
Class which manages logging of TensorRT tools and samples.
Definition: samples/common/logging.h:206
comparator.test_data_loader.TestDataLoaderCache.test_will_not_give_up_on_first_cache_miss
def test_will_not_give_up_on_first_cache_miss(self)
Definition: test_data_loader.py:118
pytorch_quantization.nn.modules.quant_pooling.QuantAdaptiveAvgPool3d.__init__
def __init__(self, output_size, **kwargs)
Definition: quant_pooling.py:143
nvinfer1::ErrorCode::kINTERNAL_ERROR
@ kINTERNAL_ERROR
A non-recoverable TensorRT error occurred.
polygraphy.backend.tf.loader.CreateConfig.gpu_memory_fraction
gpu_memory_fraction
Definition: backend/tf/loader.py:369
SampleUffFasterRcnn::build
bool build()
Function builds the network engine.
Definition: sampleUffFasterRCNN.cpp:163
SampleCharRNNBindingNames::CELL_IN_BLOB_NAME
const char * CELL_IN_BLOB_NAME
Definition: sampleCharRNN.cpp:89
nvinfer1::plugin::ProposalPlugin::initialize
int initialize() override
Initialize the layer for execution.
Definition: proposalPlugin.cpp:178
polygraphy.backend.base.runner
Definition: backend/base/runner.py:1
bert::DATA_TYPE_INT4
@ DATA_TYPE_INT4
Definition: fused_multihead_attention_common.h:29
half_float::detail::enable< T, half, half, void >::type
T type
Definition: ieee_half.h:375
sample::anonymous_namespace{sampleInference.cpp}::EnqueueFunction
std::function< void(TrtCudaStream &)> EnqueueFunction
Definition: sampleInference.cpp:250
polygraphy.backend.trt.loader.ShapeTuple.opt
opt
Definition: backend/trt/loader.py:246
nvinfer1::plugin::PriorBoxParameters
The PriorBox plugin layer generates the prior boxes of designated sizes and aspect ratios across all ...
Definition: NvInferPluginUtils.h:91
nmtSample::TextWriter::mOutput
std::shared_ptr< std::ostream > mOutput
Definition: textWriter.h:49
tools.test_surgeon.TestSurgeonExtract
Definition: test_surgeon.py:24
polygraphy.comparator.struct.RunResults.update
def update(self, other)
Definition: comparator/struct.py:85
text.cmudict._parse_cmudict
def _parse_cmudict(file)
Definition: cmudict.py:60
pytorch_quantization.nn.functional.ClipFunction.forward
def forward(ctx, input, clip_value_min, clip_value_max)
Definition: functional.py:36
sample::defaultIterations
constexpr int defaultIterations
Definition: sampleOptions.h:46
SampleConfig::mTolerance
float mTolerance
Definition: sampleConfig.h:59
bert::kSM_AMPERE
static constexpr int32_t kSM_AMPERE
Definition: qkvToContextInt8InterleavedPlugin.h:31
layers.ConvNorm
Definition: layers.py:36
sample::anonymous_namespace{sampleInference.cpp}::SyncStruct::mainStream
TrtCudaStream mainStream
Definition: sampleInference.cpp:169
example.create_network
def create_network(builder, network)
Definition: Polygraphy/examples/api/05_using_tensorrt_network_api/example.py:34
sample::printProlog
void printProlog(int warmups, int timings, float warmupMs, float benchTimeMs, std::ostream &os)
Print benchmarking time and number of traces collected.
Definition: sampleReporting.cpp:79
half_float::detail::operator*
enable< expr, T, U >::type operator*(T x, U y)
Multiply halfs.
Definition: ieee_half.h:2184
pytorch_quantization.nn.modules.quant_bert.QuantBertSelfAttention.__init__
def __init__(self, config)
Definition: quant_bert.py:50
batchPrepare.width
int width
Definition: batchPrepare.py:35
nvinfer1::LayerType::kCONCATENATION
@ kCONCATENATION
Concatenation layer.
data_function.TextMelLoader.get_mel_text_pair
def get_mel_text_pair(self, audiopath_and_text)
Definition: tacotron2/data_function.py:45
nvinfer1::IErrorRecorder::~IErrorRecorder
virtual ~IErrorRecorder() noexcept
Definition: NvInferRuntimeCommon.h:1246
polygraphy.tools.args.onnx.loader.OnnxLoaderArgs._get_modify_onnx_str
def _get_modify_onnx_str(self, script, loader_name, disable_outputs=None)
Definition: tools/args/onnx/loader.py:71
helpers.calibrator.BertCalibrator.get_batch
def get_batch(self, names)
Definition: demo/BERT/helpers/calibrator.py:58
nmtSample::DebugUtil::DumpTensorPlugin::ptr
std::shared_ptr< DumpTensorPlugin > ptr
Definition: debugUtil.h:41
helpers.calibrator.BertCalibrator.max_seq_length
max_seq_length
Definition: demo/BERT/helpers/calibrator.py:37
onnx_graphsurgeon.logger.logger.LoggerSuppress.__exit__
def __exit__(self, exc_type, exc_value, traceback)
Definition: onnx-graphsurgeon/onnx_graphsurgeon/logger/logger.py:51
polygraphy.backend.trt_legacy.LoadUffFile
Definition: backend/trt_legacy.py:35
nvinfer1::plugin::PyramidROIAlign::enqueue
int enqueue(int batch_size, const void *const *inputs, void **outputs, void *workspace, cudaStream_t stream) override
Definition: pyramidROIAlignPlugin.cpp:185
SampleCharRNNLoop
Definition: sampleCharRNN.cpp:238
nvinfer1::IPluginExt::configure
void configure(const Dims *, int32_t, const Dims *, int32_t, int32_t)
Derived classes should not implement this.
Definition: NvInferRuntime.h:279
nvinfer1::plugin::GenerateDetection::getNbOutputs
int getNbOutputs() const override
Get the number of outputs from the layer.
Definition: generateDetectionPlugin.cpp:130
nmtSample::Decoder::getStateSizes
virtual std::vector< nvinfer1::Dims > getStateSizes()=0
get the sizes (vector of them) of the hidden state vectors
polygraphy.logger.logger.LoggerIndent.__exit__
def __exit__(self, exc_type, exc_value, traceback)
Definition: Polygraphy/polygraphy/logger/logger.py:53
nvinfer1::plugin::FlattenConcatPluginCreator
Definition: flattenConcat.h:108
backend.trt.test_calibrator.TestCalibrator.test_calibrator_caches_without_explicit_cache
def test_calibrator_caches_without_explicit_cache(self, identity_builder_network)
Definition: test_calibrator.py:114
builder.BertConfig.use_int8_skipln
use_int8_skipln
Definition: builder.py:101
pytorch_quantization.tensor_quant.ScaledQuantDescriptor._amax
_amax
Definition: tensor_quant.py:98
sample::anonymous_namespace{sampleEngines.cpp}::setTensorScales
bool setTensorScales(const INetworkDefinition &network, float inScales=2.0f, float outScales=4.0f)
Definition: sampleEngines.cpp:254
sample::anonymous_namespace{sampleInference.cpp}::Iteration::mEvents
std::vector< MultiEvent > mEvents
Definition: sampleInference.cpp:468
model.WN.res_skip_layers
res_skip_layers
Definition: demo/Tacotron2/waveglow/model.py:96
nvinfer1::plugin::CoordConvACPluginCreator::getPluginVersion
const char * getPluginVersion() const override
Return the plugin version.
Definition: coordConvACPlugin.cpp:185
nvinfer1::plugin::CodeTypeSSD::CORNER_SIZE
@ CORNER_SIZE
Use box centers and size.
bert::fused_multihead_attention_v2_fp16_256_64_kernel_sm80_cubin
unsigned char fused_multihead_attention_v2_fp16_256_64_kernel_sm80_cubin[]
Definition: fused_multihead_attention_v2_fp16_256_64_kernel.sm80.cpp:19
nvinfer1::anonymous_namespace{NvInfer.h}::createInferBuilder
IBuilder * createInferBuilder(ILogger &logger)
Create an instance of an IBuilder class.
Definition: NvInfer.h:7440
pytorch_quantization.tensor_quant.ScaledQuantDescriptor.learn_amax
def learn_amax(self)
Definition: tensor_quant.py:128
BufferDesc::channelPivot
bool channelPivot
Definition: sampleReformatFreeIO.cpp:121
nvinfer1::plugin::DetectionLayer::isOutputBroadcastAcrossBatch
bool isOutputBroadcastAcrossBatch(int outputIndex, const bool *inputIsBroadcasted, int nbInputs) const override
Definition: detectionLayerPlugin.cpp:282
sample::InferenceTrace::inStart
float inStart
Definition: sampleReporting.h:94
nvinfer1::plugin::RPROIPlugin::A
int A
Definition: nvFasterRCNNPlugin.h:99
models.meta.check_identity
def check_identity(runner)
Definition: meta.py:49
half_float::detail::functions::plus
static expr plus(float x, float y)
Addition implementation.
Definition: samples/common/half.h:1589
nvinfer1::plugin::GenerateDetection::getPluginType
const char * getPluginType() const override
Return the plugin type.
Definition: generateDetectionPlugin.cpp:165
bert
Definition: fused_multihead_attention.h:28
nvinfer1::plugin::CoordConvACPlugin::enqueue
int enqueue(int batchSize, const void *const *inputs, void **outputs, void *workspace, cudaStream_t stream) override
polygraphy.tools.args.trt.runner.TrtRunnerArgs.model_args
model_args
Definition: tools/args/trt/runner.py:37
pytorch_quantization.nn.modules.quant_pooling.QuantMaxPool1d.__init__
def __init__(self, kernel_size, stride=None, padding=0, dilation=1, return_indices=False, ceil_mode=False, **kwargs)
Definition: quant_pooling.py:38
Logger::operator<<
Logger & operator<<(T const &obj)
Definition: demo/BERT/infer_c/logging.h:38
polygraphy.common.struct.TensorMetadata.__str__
def __str__(self)
Definition: common/struct.py:51
pytorch_quantization.nn.modules.quant_instancenorm.QuantInstanceNorm3d.forward
def forward(self, input)
Definition: quant_instancenorm.py:76
backend.onnx.test_loader.TestSaveOnnx.test_save_onnx
def test_save_onnx(self)
Definition: onnx/test_loader.py:75
nvinfer1::plugin::CoordConvACPlugin::canBroadcastInputAcrossBatch
bool canBroadcastInputAcrossBatch(int inputIndex) const override
Definition: coordConvACPlugin.cpp:172
nvinfer1::plugin::BatchedNMSDynamicPlugin::mNamespace
std::string mNamespace
Definition: batchedNMSPlugin.h:116
polygraphy.backend.trt.loader.ShapeTuple.min
min
Definition: backend/trt/loader.py:245
pytorch_quantization.nn.modules.quant_linear.QuantLinear.forward
def forward(self, input)
Definition: quant_linear.py:68
UffPoolPluginV2Creator::createPlugin
IPluginV2 * createPlugin(const char *name, const PluginFieldCollection *fc) override
Return a plugin object.
Definition: sampleUffPluginV2Ext.cpp:663
SampleConfig::setCalibrationFileName
void setCalibrationFileName(const char *calibrationFilename)
Definition: sampleConfig.h:272
polygraphy.backend.trt.loader.CreateConfig.calibrator
calibrator
Definition: backend/trt/loader.py:326
half_float::detail::fdim
expr fdim(half x, half y)
Positive difference.
Definition: ieee_half.h:2309
polygraphy.tools.args.onnx.loader.OnnxLoaderArgs.add_to_parser
def add_to_parser(self, parser)
Definition: tools/args/onnx/loader.py:30
polygraphy.backend.base.runner.BaseRunner.get_input_metadata
def get_input_metadata(self)
Definition: backend/base/runner.py:123
pytorch_quantization.nn.modules.quant_rnn.QuantRNNCell.nonlinearity
nonlinearity
Definition: modules/quant_rnn.py:345
SampleMLPParams::outputSize
int outputSize
The output size.
Definition: sampleMLP.cpp:49
pytorch_quantization.nn._functions.quant_rnn.LSTMCell
def LSTMCell(input, hidden, w_ih, w_hh, b_ih=None, b_hh=None, input_quantizer=None, weight_quantizer=None)
Definition: _functions/quant_rnn.py:46
backend.trt.test_loader.TestConfigLoader.test_tf32
def test_tf32(self, identity_builder_network, flag)
Definition: trt/test_loader.py:201
sample::ModelFormat
ModelFormat
Definition: sampleOptions.h:55
bert::EmbLayerNormVarSeqlenPlugin::mTokVocabSize
size_t mTokVocabSize
Definition: embLayerNormVarSeqlenPlugin.h:96
samplesCommon::Args::dataDirs
std::vector< std::string > dataDirs
Definition: argsParser.h:86
nvinfer1::IAlgorithm
Describes a variation of execution of a layer. An algorithm is represented by IAlgorithmVariant and t...
Definition: NvInfer.h:6267
polygraphy.util.cuda.DeviceBuffer._ptr
_ptr
Definition: cuda.py:125
nmtSample::TextReader::reset
void reset() override
Reset the reader position, the data reader is ready to read the data from th ebeginning again after t...
Definition: textReader.cpp:60
nvinfer1::IExprBuilder
Definition: NvInferRuntime.h:356
nvonnxparser::IParserError::file
virtual const char * file() const =0
source file in which the error occurred
sample::BuildOptions::shapes
std::unordered_map< std::string, ShapeRange > shapes
Definition: sampleOptions.h:123
pytorch_quantization.optim.helper.group_parameters
def group_parameters(model, patterns_list, lrs=None, momentums=None, weight_decays=None)
Definition: helper.py:43
helpers.tokenization.BasicTokenizer._clean_text
def _clean_text(self, text)
Definition: tokenization.py:316
OutputArgs::itemProbPairVec
std::vector< std::pair< int32_t, float > > itemProbPairVec
Definition: sampleMovieLensMPS.cpp:161
polygraphy.tools.inspect.subtool.model.Model.inspect_onnx
def inspect_onnx(self, args)
Definition: tools/Polygraphy/polygraphy/tools/inspect/subtool/model.py:81
polygraphy.backend.tf.util.str_from_graph
def str_from_graph(graph, mode)
Definition: tf/util.py:161
tests.quant_pooling_test.TestQuantAdaptiveAvgPool1d.test_input_fake_quant
def test_input_fake_quant(self)
Definition: quant_pooling_test.py:228
half_float::detail::trunc
half trunc(half arg)
Nearest integer not greater in magnitude than half value.
Definition: ieee_half.h:2565
polygraphy.tools.args.trt.loader.TrtLoaderArgs.__init__
def __init__(self, config=True, outputs=True, network_api=False)
Definition: tools/args/trt/loader.py:25
sample::Parser::caffeParser
TrtUniquePtr< nvcaffeparser1::ICaffeParser > caffeParser
Definition: sampleEngines.h:35
nmtSample::LSTMDecoder::mNumUnits
int mNumUnits
Definition: lstmDecoder.h:51
nmtSample::Alignment::Alignment
Alignment()=default
polygraphy.tools.util.script.Script.preimport
preimport
Definition: script.py:140
nvinfer1::IExecutionContext::destroy
virtual void destroy() noexcept=0
Destroy this object.
nvinfer1::LayerType::kACTIVATION
@ kACTIVATION
Activation layer.
nvuffparser::FieldType::kDATATYPE
@ kDATATYPE
nvinfer1::DataType field type.
half_float::detail::functions::isgreaterequal
static bool isgreaterequal(half x, half y)
Comparison implementation.
Definition: samples/common/half.h:2420
nvinfer1::plugin::NMSParameters::numClasses
int32_t numClasses
Definition: NvInferPluginUtils.h:250
profiler.hasNames
def hasNames(features)
Definition: profiler.py:46
FCPlugin::mNbOutputChannels
int mNbOutputChannels
Definition: samples/opensource/samplePlugin/fcPlugin.h:308
polygraphy.util.misc.Compressed.__init__
def __init__(self, cobj)
Definition: Polygraphy/polygraphy/util/misc.py:179
SampleSSD::mInputDims
nvinfer1::Dims mInputDims
The dimensions of the input to the network.
Definition: sampleSSD.cpp:90
polygraphy.tools.surgeon.subtool.insert.Insert
Definition: insert.py:21
nvinfer1::plugin::DetectionLayer::getWorkspaceSize
size_t getWorkspaceSize(int maxBatchSize) const override
Definition: detectionLayerPlugin.cpp:231
sample::InferenceTrace::InferenceTrace
InferenceTrace(int s, float es, float ee, float is, float ie, float cs, float ce, float os, float oe)
Definition: sampleReporting.h:71
SampleFasterRCNN::processInput
bool processInput(const samplesCommon::BufferManager &buffers)
Reads the input and mean data, preprocesses, and stores the result in a managed buffer.
Definition: sampleFasterRCNN.cpp:259
sample::BaseModelOptions::model
std::string model
Definition: sampleOptions.h:77
SampleUffPluginV2Ext::SampleUffPluginV2Ext
SampleUffPluginV2Ext(const UffSampleParams &params)
Definition: sampleUffPluginV2Ext.cpp:194
MaskRCNNConfig::MASK_POOL_SIZE
static const int MASK_POOL_SIZE
Definition: plugin/proposalLayerPlugin/mrcnn_config.h:30
polygraphy.tools.args.trt.loader.TrtLoaderArgs.network_api
network_api
Definition: tools/args/trt/loader.py:87
test_ir.TestNodeIO.test_add
def test_add(self, field_names)
Definition: test_ir.py:294
SampleSSD::mEngine
std::shared_ptr< nvinfer1::ICudaEngine > mEngine
The TensorRT engine used to run the network.
Definition: sampleSSD.cpp:94
nvinfer1::plugin::FlattenConcat::deserializeToDevice
Weights deserializeToDevice(const char *&hostBuffer, size_t count)
test_api.TestApi.test_import
def test_import(self)
Definition: test_api.py:30
SimpleProfiler::SimpleProfiler
SimpleProfiler(const char *name, const std::vector< SimpleProfiler > &srcProfilers=std::vector< SimpleProfiler >())
Definition: samples/common/common.h:157
BatchStream::mFileBatch
std::vector< float > mFileBatch
List of image files.
Definition: BatchStream.h:381
polygraphy.backend.onnx.loader.OnnxFromPath.__call__
def __call__(self)
Definition: backend/onnx/loader.py:35
half_float::half::operator+=
half & operator+=(float rhs)
Arithmetic assignment.
Definition: samples/common/half.h:1488
PluginFactory::mPlugin
std::unique_ptr< FCPlugin > mPlugin
Definition: samples/opensource/samplePlugin/fcPlugin.h:381
nvinfer1::CUDADriverWrapper::cuLinkCreate
CUresult cuLinkCreate(unsigned int numOptions, CUjit_option *options, void **optionValues, CUlinkState *stateOut) const
Definition: cudaDriverWrapper.cpp:101
sample::anonymous_namespace{sampleInference.cpp}::EnqueueGraph::EnqueueGraph
EnqueueGraph(TrtCudaGraph &graph)
Definition: sampleInference.cpp:237
polygraphy.tools.args.trt_legacy.TrtLegacyArgs.preprocessor
preprocessor
Definition: tools/args/trt_legacy.py:64
sample::Binding
Definition: sampleUtils.h:215
pytorch_quantization.nn.modules.quant_instancenorm.QuantInstanceNorm2d
Definition: quant_instancenorm.py:49
sample::Bindings::mDevicePointers
std::vector< void * > mDevicePointers
Definition: sampleUtils.h:453
tests.calibrator_test.TestMaxCalibrator
Definition: calibrator_test.py:34
TLTMaskRCNNConfig::TOP_DOWN_PYRAMID_SIZE
static const int TOP_DOWN_PYRAMID_SIZE
Definition: tlt_mrcnn_config.h:53
nvinfer1::ErrorCode::kFAILED_EXECUTION
@ kFAILED_EXECUTION
An error occurred during execution that caused TensorRT to end prematurely, either an asynchronous er...
polygraphy.backend.onnx.loader.SaveOnnx
Definition: backend/onnx/loader.py:153
nvinfer1::plugin::Region::getWorkspaceSize
size_t getWorkspaceSize(int maxBatchSize) const override
Definition: regionPlugin.cpp:393
tools.test_inspect.TestInspectModel
Definition: test_inspect.py:249
FCPlugin::mCudnn
cudnnHandle_t mCudnn
Definition: samples/opensource/samplePlugin/fcPlugin.h:315
SampleINT8API::writeNetworkTensorNames
void writeNetworkTensorNames(const SampleUniquePtr< nvinfer1::INetworkDefinition > &network)
Write network tensor names to a file.
Definition: sampleINT8API.cpp:252
nvinfer1::plugin::ProposalPlugin::mMaxBoxNum
size_t mMaxBoxNum
Definition: proposalPlugin.h:118
nvinfer1::plugin::MultilevelProposeROIPluginCreator::createPlugin
IPluginV2Ext * createPlugin(const char *name, const PluginFieldCollection *fc) override
Return a plugin object.
Definition: multilevelProposeROIPlugin.cpp:69
half_float::detail::functions::isnan
static bool isnan(half arg)
Classification implementation.
Definition: ieee_half.h:1874
nvinfer1::plugin::BatchedNMSPlugin::getOutputDimensions
Dims getOutputDimensions(int index, const Dims *inputs, int nbInputDims) override
Definition: batchedNMSPlugin.cpp:99
fp16
Definition: fp16.h:22
polygraphy.backend.onnx.loader.BytesFromOnnx.__call__
def __call__(self)
Definition: backend/onnx/loader.py:189
nvinfer1::plugin::ProposalPlugin::canBroadcastInputAcrossBatch
bool canBroadcastInputAcrossBatch(int inputIndex) const override
Definition: proposalPlugin.cpp:289
bert::FusedMultiHeadAttentionKernelMetaInfoV2::mDataType
Data_type mDataType
Definition: fused_multihead_attention_v2.h:160
sample::Logger::reportTestStart
static void reportTestStart(TestAtom &testAtom)
Report that a test has started.
Definition: samples/common/logging.h:324
samplesCommon::getElementSize
unsigned int getElementSize(nvinfer1::DataType t)
Definition: samples/common/common.h:613
nvinfer1::plugin::LReLU::configureWithFormat
void configureWithFormat(const Dims *inputDims, int nbInputs, const Dims *outputDims, int nbOutputs, DataType type, PluginFormat format, int maxBatchSize) override
Definition: lReluPlugin.cpp:81
nvinfer1::IPluginV2Ext::canBroadcastInputAcrossBatch
virtual bool canBroadcastInputAcrossBatch(int32_t inputIndex) const =0
Return true if plugin can use input that is broadcast across batch without replication.
pytorch_quantization.nn.modules.quant_pooling.QuantAvgPool2d.forward
def forward(self, input)
Definition: quant_pooling.py:98
onnx_models.lstm_model
def lstm_model()
Definition: onnx_models.py:114
nvinfer1::EnumMax< PoolingType >
constexpr int32_t EnumMax< PoolingType >()
Maximum number of elements in PoolingType enum.
Definition: NvInfer.h:1715
nvinfer1::NetworkDefinitionCreationFlag::kEXPLICIT_BATCH
@ kEXPLICIT_BATCH
Dynamic shape support requires that the kEXPLICIT_BATCH flag is set.
nvinfer1::plugin::SplitPlugin::SplitPlugin
SplitPlugin(int axis, std::vector< int > output_lengths)
Definition: split.h:75
LogStreamConsumerBase::LogStreamConsumerBase
LogStreamConsumerBase(std::ostream &stream, const std::string &prefix, bool shouldLog)
Definition: pluginLogging.h:110
helpers.tokenization._is_control
def _is_control(char)
Definition: tokenization.py:404
onnx_graphsurgeon.ir.tensor.Constant.inputs
inputs
Definition: tensor.py:198
comparator.test_comparator.TestComparator.test_postprocess
def test_postprocess(self)
Definition: test_comparator.py:86
nvinfer1::DimensionOperation::kPROD
@ kPROD
Product of the two operands.
inference.stream
stream
Definition: BERT/inference.py:151
polygraphy.util.misc.str_from_module_info
def str_from_module_info(module, name=None)
Definition: Polygraphy/polygraphy/util/misc.py:395
nmtSample::BeamSearchPolicy::mEndSequenceId
int mEndSequenceId
Definition: beamSearchPolicy.h:66
onnx_graphsurgeon.ir.tensor.Tensor.name
name
Definition: tensor.py:56
polygraphy.backend.cntk.runner.CNTKRunner.activate_impl
def activate_impl(self)
Definition: backend/cntk/runner.py:31
polygraphy.tools.run.run.Run.add_parser_args
def add_parser_args(self, parser)
Definition: run.py:117
nvinfer1::plugin::Reorg::getPluginNamespace
const char * getPluginNamespace() const override
Return the namespace of the plugin object.
Definition: reorgPlugin.cpp:126
bert::FusedMultiHeadAttentionKernelMetaInfoV2::mFuncName
const char * mFuncName
Definition: fused_multihead_attention_v2.h:166
polygraphy.backend.onnx.runner.OnnxTfRunner.activate_impl
def activate_impl(self)
Definition: backend/onnx/runner.py:38
polygraphy.tools.util.script.Script.runners
runners
Definition: script.py:139
polygraphy.backend.trt.util.add_binding_to_metadata
def add_binding_to_metadata(engine, binding, metadata)
Definition: trt/util.py:365
SampleUffMNIST::kDIGITS
const int kDIGITS
Definition: sampleUffMNIST.cpp:104
polygraphy.logger.logger.LoggerIndent.indent
indent
Definition: Polygraphy/polygraphy/logger/logger.py:47
SimpleProfiler::operator<<
friend std::ostream & operator<<(std::ostream &out, const SimpleProfiler &value)
Definition: samples/common/common.h:178
polygraphy.backend.trt_legacy.LoadNetworkFromCaffe.__call__
def __call__(self)
Definition: backend/trt_legacy.py:159
nvinfer1::plugin::MultilevelCropAndResizePluginCreator::getPluginName
const char * getPluginName() const override
Return the plugin name.
Definition: multilevelCropAndResizePlugin.cpp:47
polygraphy.tools.precision.precision.STCheckerBase.golden
golden
Definition: precision.py:75
onnx_graphsurgeon.logger.logger.Logger.letter
letter
Definition: onnx-graphsurgeon/onnx_graphsurgeon/logger/logger.py:105
nvuffparser::IPluginFactory::~IPluginFactory
virtual ~IPluginFactory()
Definition: NvUffParser.h:121
test_examples.Artifact.infer
infer
Definition: onnx-graphsurgeon/tests/test_examples.py:36
polygraphy.tools.args.tf.config.TfConfigArgs
Definition: tools/Polygraphy/polygraphy/tools/args/tf/config.py:21
SampleDynamicReshape::buildPredictionEngine
bool buildPredictionEngine(const SampleUniquePtr< nvinfer1::IBuilder > &builder)
Builds an engine for prediction (mPredictionEngine).
Definition: sampleDynamicReshape.cpp:208
SimpleProfiler
Definition: samples/common/common.h:139
backend.trt.test_calibrator.TestCalibrator.test_calibrator_generator_data
def test_calibrator_generator_data(self, identity_builder_network)
Definition: test_calibrator.py:52
bert::SkipLayerNormInterleavedPlugin::supportsFormatCombination
bool supportsFormatCombination(int pos, const nvinfer1::PluginTensorDesc *inOut, int nbInputs, int nbOutputs) override
Definition: skipLayerNormInt8InterleavedPlugin.cpp:119
test_ir.TestGraph.test_tensors_check_duplicates
def test_tensors_check_duplicates(self)
Definition: test_ir.py:587
polygraphy.util.format
Definition: format.py:1
SampleINT8APIArgs::imageFileName
std::string imageFileName
Definition: sampleINT8API.cpp:652
nmtSample::BenchmarkWriter::mOutputTokenCount
int mOutputTokenCount
Definition: benchmarkWriter.h:50
nvinfer1::ILoop::~ILoop
virtual ~ILoop()
Definition: NvInfer.h:4712
polygraphy.tools.args.comparator.ComparatorRunArgs._write
_write
Definition: tools/args/comparator.py:26
example.load_config
def load_config(config)
Definition: Polygraphy/examples/api/03_interoperating_with_tensorrt/example.py:48
pytorch_quantization.nn.modules.quant_rnn.QuantRNNCell.bias_ih
bias_ih
Definition: modules/quant_rnn.py:349
Logger::genCmdlineString
static std::string genCmdlineString(int argc, char const *const *argv)
generate a command line string from the given (argc, argv) values
Definition: pluginLogging.h:448
nvinfer1::IElementWiseLayer::~IElementWiseLayer
virtual ~IElementWiseLayer()
Definition: NvInfer.h:2668
samplesCommon::CpuTimer
Definition: samples/common/common.h:840
SampleCharRNNParams
Definition: sampleCharRNN.cpp:114
nmtSample::BeamSearchPolicy::~BeamSearchPolicy
~BeamSearchPolicy() override=default
nvinfer1::plugin::DetectionLayer::mMaxBatchSize
int mMaxBatchSize
Definition: detectionLayerPlugin.h:99
ProposalWorkSpace::argMaxBboxDims
nvinfer1::DimsHW argMaxBboxDims
Definition: maskRCNNKernels.h:130
tools.test_precision.test_polygraphy_precision_linear_sanity
def test_polygraphy_precision_linear_sanity()
Definition: test_precision.py:41
test_ir.TensorBaseTests.test_i
def test_i(self)
Definition: test_ir.py:80
polygraphy.backend.trt.loader.CreateNetwork.__call__
def __call__(self)
Definition: backend/trt/loader.py:75
nmtSample::BeamSearchPolicy::mBeamSearchTable
std::vector< Ray > mBeamSearchTable
Definition: beamSearchPolicy.h:71
SampleConfig::reduceVerbosity
void reduceVerbosity()
Decrease verbosity Level.
Definition: sampleConfig.h:121
backend.onnx.test_loader.TestExportOnnxFromTf.test_no_optimize
def test_no_optimize(self)
Definition: onnx/test_loader.py:42
nvinfer1::plugin::ProposalPlugin::getSerializationSize
size_t getSerializationSize() const override
Find the size of the serialization buffer required.
Definition: proposalPlugin.cpp:200
nmtSample::DataWriter::DataWriter
DataWriter()=default
nvinfer1::plugin::RPROIPlugin::clone
IPluginV2Ext * clone() const override
Clone the plugin object.
Definition: nvFasterRCNNPlugin.cpp:236
sample::LogStreamConsumerBuffer::putOutput
void putOutput()
Definition: samples/common/logging.h:72
UffPoolPluginV2::mDataType
DataType mDataType
Definition: sampleUffPluginV2Ext.cpp:636
test_ir.TestNodeIO.get_lists
def get_lists(self, field_names)
Definition: test_ir.py:226
nvinfer1::plugin::FlattenConcat::serialize
void serialize(void *buffer) const override
Serialize the layer.
Definition: flattenConcat.cpp:167
SampleSSDParams::nbCalBatches
int nbCalBatches
The number of batches for calibration.
Definition: sampleSSD.cpp:51
nvinfer1::plugin::ResizeNearestPluginCreator::getPluginName
const char * getPluginName() const override
Return the plugin name.
Definition: resizeNearestPlugin.cpp:47
nvinfer1::IPluginV2Ext::attachToContext
virtual void attachToContext(cudnnContext *, cublasContext *, IGpuAllocator *)
Attach the plugin object to an execution context and grant the plugin the access to some context reso...
Definition: NvInferRuntimeCommon.h:667
nvinfer1::DimensionType::kINDEX
@ kINDEX
Elements correspond to different batch index.
polygraphy.tools.args.data_loader.DataLoaderArgs
Definition: tools/args/data_loader.py:22
model.Postnet.__init__
def __init__(self, n_mel_channels, postnet_embedding_dim, postnet_kernel_size, postnet_n_convolutions)
Definition: demo/Tacotron2/tacotron2/model.py:127
sample::anonymous_namespace{sampleInference.cpp}::Iteration
Inference iteration and streams management.
Definition: sampleInference.cpp:281
SampleCharRNNBase::addLSTMLayers
virtual nvinfer1::ILayer * addLSTMLayers(SampleUniquePtr< nvinfer1::INetworkDefinition > &network)=0
Add inputs to the TensorRT network and configure LSTM layers using network definition API.
model.Tacotron2.n_frames_per_step
n_frames_per_step
Definition: demo/Tacotron2/tacotron2/model.py:592
polygraphy.backend.tf.loader
Definition: backend/tf/loader.py:1
sample::anonymous_namespace{sampleDevice.h}::cudaSleep
void cudaSleep(cudaStream_t stream, cudaError_t status, void *sleep)
Definition: sampleDevice.h:43
tools.test_inspect.TestInspectData.test_inputs
def test_inputs(self, opts)
Definition: test_inspect.py:300
nvinfer1::plugin::DetectionOutput::destroy
void destroy() override
Destroy the plugin object.
Definition: nmsPlugin.cpp:160
samplesCommon::loadLibrary
void loadLibrary(const std::string &path)
Definition: samples/common/common.h:896
polygraphy.backend.onnxrt.runner.OnnxrtRunner._sess
_sess
Definition: backend/onnxrt/runner.py:36
nvinfer1::plugin::GenerateDetection::mNbClasses
int mNbClasses
Definition: generateDetectionPlugin.h:95
nvinfer1::ISliceLayer
Slices an input tensor into an output tensor based on the offset and strides.
Definition: NvInfer.h:3922
nvinfer1::LayerType::kPARAMETRIC_RELU
@ kPARAMETRIC_RELU
Parametric ReLU layer.
polygraphy.backend.trt_legacy.TrtLegacyRunner
Definition: backend/trt_legacy.py:174
pytorch_quantization.nn.modules.quant_rnn.QuantRNNBase.flatten_parameters
def flatten_parameters(self)
Definition: modules/quant_rnn.py:103
nvinfer1::plugin::SplitPlugin::detachFromContext
void detachFromContext() override
Detach the plugin object from its execution context.
Definition: split.h:132
nvinfer1::plugin::GenerateDetection::initialize
int initialize() override
Initialize the layer for execution.
Definition: generateDetectionPlugin.cpp:135
nmtSample::BLEUScoreWriter::mPossibleMatchesByOrder
std::vector< size_t > mPossibleMatchesByOrder
Definition: bleuScoreWriter.h:59
nvinfer1::TopKOperation
TopKOperation
Enumerates the operations that may be performed by a TopK layer.
Definition: NvInfer.h:4055
nmtSample::LSTMEncoder
encodes input sentences into output states using LSTM
Definition: lstmEncoder.h:31
nmtSample::Likelihood::Likelihood
Likelihood()=default
half_float::detail::expr::value_
float value_
Internal expression value stored in single-precision.
Definition: ieee_half.h:365
nvinfer1::plugin::GroupNormalizationPluginCreator::setPluginNamespace
void setPluginNamespace(const char *pluginNamespace) override
Set the namespace of the plugin creator based on the plugin library it belongs to.
Definition: groupNormalizationPlugin.cpp:280
FCPlugin::mNbInputChannels
int mNbInputChannels
Definition: samples/opensource/samplePlugin/fcPlugin.h:308
nvinfer1::plugin::SpecialSlice::terminate
void terminate() override
Release resources acquired during plugin layer initialization.
Definition: specialSlicePlugin.h:51
nvinfer1::IPluginCreator::createPlugin
virtual IPluginV2 * createPlugin(const char *name, const PluginFieldCollection *fc)=0
Return a plugin object.
polygraphy.backend.tf.loader.SaveGraph.path
path
Definition: backend/tf/loader.py:323
nmtSample::SoftmaxLikelihood::SoftmaxLikelihoodCombinationOperator
Definition: softmaxLikelihood.h:33
nvinfer1::plugin::BatchedNMSPlugin::getNbOutputs
int getNbOutputs() const override
Get the number of outputs from the layer.
Definition: batchedNMSPlugin.cpp:75
polygraphy.tools.args.tf.runner.TfRunnerArgs.check_registered
def check_registered(self)
Definition: tools/args/tf/runner.py:37
sample::LogStreamConsumerBase
Convenience object used to initialize LogStreamConsumerBuffer before std::ostream in LogStreamConsume...
Definition: samples/common/logging.h:111
inference
Definition: BERT/inference.py:1
nvinfer1::plugin::DetectionLayer::detachFromContext
void detachFromContext() override
Detach the plugin object from its execution context.
Definition: detectionLayerPlugin.cpp:313
nvinfer1::plugin::BatchedNMSBasePluginCreator::mFC
static PluginFieldCollection mFC
Definition: batchedNMSPlugin.h:132
bert::TFusedMHAKernelFactory::getXMMAKernels
const TFusedMHAKernelList * getXMMAKernels(const typename TFusedMHAKernelList::KernelMeta *pKernelList, unsigned int nbKernels, Data_type type, unsigned int sm)
Definition: fused_multihead_attention.h:295
nvinfer1::plugin::MultilevelCropAndResize::getPluginVersion
const char * getPluginVersion() const override
Return the plugin version.
Definition: multilevelCropAndResizePlugin.cpp:135
polygraphy.tools.inspect.subtool.model.Model.run
def run(self, args)
Definition: tools/Polygraphy/polygraphy/tools/inspect/subtool/model.py:47
nvinfer1::plugin::RegionPluginCreator::getPluginName
const char * getPluginName() const override
Return the plugin name.
Definition: regionPlugin.cpp:479
SampleErrorRecorder::mErrorStack
errorStack mErrorStack
Definition: ErrorRecorder.h:132
nvinfer1::plugin::InstanceNormalizationPluginCreator::deserializePlugin
IPluginV2DynamicExt * deserializePlugin(const char *name, const void *serialData, size_t serialLength) override
Called during deserialization of plugin layer.
Definition: instanceNormalizationPlugin.cpp:399
polygraphy.logger.logger.Logger.ultra_verbose
def ultra_verbose(self, message, mode=LogMode.EACH)
Definition: Polygraphy/polygraphy/logger/logger.py:250
ProposalWorkSpace::preRefineScoreDims
nvinfer1::DimsHW preRefineScoreDims
Definition: maskRCNNKernels.h:126
nmtSample::BenchmarkWriter::mStartTS
std::chrono::high_resolution_clock::time_point mStartTS
Definition: benchmarkWriter.h:51
nvinfer1::plugin::Normalize::mPluginNamespace
std::string mPluginNamespace
Definition: normalizePlugin.h:104
tools.test_run.TestOnnxTf.test_onnx_tf
def test_onnx_tf(self)
Definition: test_run.py:177
nvinfer1::plugin::GridAnchorGenerator::destroy
void destroy() override
Destroy the plugin object.
Definition: gridAnchorPlugin.cpp:339
nvinfer1::plugin::DetectionOutput::initialize
int initialize() override
Initialize the layer for execution.
Definition: nmsPlugin.cpp:73
sample::InferenceEnvironment::bindings
std::vector< std::unique_ptr< Bindings > > bindings
Definition: sampleInference.h:38
copyright-scan.copyright_scan
def copyright_scan(directory, depth, args, exclude_dirs=[])
Definition: copyright-scan.py:152
test_exporters.TestOnnxExporter.test_export_node
def test_export_node(self)
Definition: test_exporters.py:80
half_float::detail::operator/
enable< expr, T, U >::type operator/(T x, U y)
Divide halfs.
Definition: ieee_half.h:2190
nvinfer1::plugin::PyramidROIAlign::getNbOutputs
int getNbOutputs() const override
Get the number of outputs from the layer.
Definition: pyramidROIAlignPlugin.cpp:88
nvinfer1::plugin::CropAndResizePlugin::getSerializationSize
size_t getSerializationSize() const override
Find the size of the serialization buffer required.
Definition: cropAndResizePlugin.cpp:142
nvinfer1::plugin::TRTException::function
const char * function
Definition: checkMacrosPlugin.h:94
pytorch_quantization.nn.modules.quant_rnn.QuantRNNCell.hidden_size
hidden_size
Definition: modules/quant_rnn.py:343
nvinfer1::TensorLocation
TensorLocation
The location for tensor data storage, device or host.
Definition: NvInferRuntimeCommon.h:1015
tests.quant_rnn_test.TestQuantLSTM
Definition: quant_rnn_test.py:255
nvinfer1::plugin::RPROIPlugin::params
RPROIParams params
Definition: nvFasterRCNNPlugin.h:98
sample::TrtCudaBuffer::allocate
void allocate(size_t size)
Definition: sampleDevice.h:249
nvinfer1::plugin::RPROIPlugin::getPluginVersion
const char * getPluginVersion() const override
Return the plugin version.
Definition: nvFasterRCNNPlugin.cpp:224
nvinfer1::plugin::Normalize::attachToContext
void attachToContext(cudnnContext *cudnnContext, cublasContext *cublasContext, IGpuAllocator *gpuAllocator) override
Attach the plugin object to an execution context and grant the plugin the access to some context reso...
Definition: normalizePlugin.cpp:221
FCPlugin::FCPlugin
FCPlugin(const void *data, size_t length)
Definition: samples/opensource/samplePlugin/fcPlugin.h:62
SampleMovieLensParams::outParamsVec
std::vector< OutputParams > outParamsVec
Definition: sampleMovieLens.cpp:75
half_float::detail::functions::modf
static half modf(half arg, half *iptr)
Decompression implementation.
Definition: samples/common/half.h:2167
PluginFactory::isPlugin
bool isPlugin(const char *name) override
A user implemented function that determines if a layer configuration is provided by an IPlugin.
Definition: samples/opensource/samplePlugin/fcPlugin.h:325
nvinfer1::LayerType::kFULLY_CONNECTED
@ kFULLY_CONNECTED
Fully connected layer.
test_ir.TensorBaseTests.test_can_copy_inputs_from_other_node
def test_can_copy_inputs_from_other_node(self)
Definition: test_ir.py:67
nvinfer1::plugin::MultilevelCropAndResize::getOutputDataType
DataType getOutputDataType(int index, const nvinfer1::DataType *inputTypes, int nbInputs) const override
Definition: multilevelCropAndResizePlugin.cpp:254
SampleFasterRCNN::SampleUniquePtr
std::unique_ptr< T, samplesCommon::InferDeleter > SampleUniquePtr
Definition: sampleFasterRCNN.cpp:58
nvinfer1::plugin::InstanceNormalizationPlugin::attachToContext
void attachToContext(cudnnContext *cudnn, cublasContext *cublas, nvinfer1::IGpuAllocator *allocator) override
Attach the plugin object to an execution context and grant the plugin the access to some context reso...
Definition: instanceNormalizationPlugin.cpp:280
BatchStream::mListFile
std::string mListFile
File name of the list of image names.
Definition: BatchStream.h:386
anonymous_namespace{normalizePlugin.cpp}::NORMALIZE_PLUGIN_VERSION
const char * NORMALIZE_PLUGIN_VERSION
Definition: normalizePlugin.cpp:31
polygraphy.tools.args.trt.loader.TrtLoaderArgs.onnx_loader_args
onnx_loader_args
Definition: tools/args/trt/loader.py:73
nmtSample::Vocabulary::mUnkStr
static const std::string mUnkStr
Definition: vocabulary.h:69
MultilevelProposeROIWorkSpace::argMaxScoreDims
nvinfer1::DimsHW argMaxScoreDims
Definition: maskRCNNKernels.h:164
chptToBin.chpt_to_dict_arrays
def chpt_to_dict_arrays()
Definition: chptToBin.py:54
nvinfer1::PluginTensorDesc
Fields that a plugin might see for an input or output.
Definition: NvInferRuntimeCommon.h:367
nvinfer1::plugin::MultilevelCropAndResize::isOutputBroadcastAcrossBatch
bool isOutputBroadcastAcrossBatch(int outputIndex, const bool *inputIsBroadcasted, int nbInputs) const override
Definition: multilevelCropAndResizePlugin.cpp:261
tests.tensor_quant_test.TestTensorQuant.test_simple_run
def test_simple_run(self)
Definition: tensor_quant_test.py:40
OutputParams::expectedPredictedMaxRatingItemProb
float expectedPredictedMaxRatingItemProb
Definition: sampleMovieLens.cpp:53
std::numeric_limits< half_float::half >::lowest
static half_float::half lowest()
Smallest finite value.
Definition: samples/common/half.h:4222
BatchStream::getBatchesRead
int getBatchesRead() const override
Definition: BatchStream.h:269
nvinfer1::IPlugin::terminate
virtual void terminate()=0
Release resources acquired during plugin layer initialization.
nvcaffeparser1::CaffeParser::parseRPROIParam
std::vector< nvinfer1::PluginField > parseRPROIParam(const trtcaffe::LayerParameter &msg, CaffeWeightFactory &weightFactory, BlobNameToTensor &tensors)
Definition: caffeParser.cpp:245
polygraphy.tools.inspect.subtool.data.Data.add_parser_args
def add_parser_args(self, parser)
Definition: data.py:32
SampleSSD::build
bool build()
Function builds the network engine.
Definition: sampleSSD.cpp:122
nvinfer1::plugin::Reorg::enqueue
int enqueue(int batchSize, const void *const *inputs, void **outputs, void *workspace, cudaStream_t stream) override
Definition: reorgPlugin.cpp:63
nvcaffeparser1::CaffeWeightFactory::checkForNans
bool checkForNans(const void *values, int count, const std::string &layerName)
Definition: caffeWeightFactory.cpp:381
nvinfer1::RNNGateType::kFORGET
@ kFORGET
Forget gate (f).
MaskRCNNUtils::Mask
Definition: sampleUffMaskRCNN.cpp:57
utils.get_mask_from_lengths
def get_mask_from_lengths(lengths)
Definition: demo/Tacotron2/common/utils.py:23
half_float::detail::float2half_impl
uint16 float2half_impl(float value, true_type)
Convert IEEE single-precision to half-precision.
Definition: ieee_half.h:454
DimsNCHW
Descriptor for data with one index dimension, one channel dimension and two spatial dimensions.
polygraphy.tools.args.tf.config.TfConfigArgs.add_to_parser
def add_to_parser(self, parser)
Definition: tools/Polygraphy/polygraphy/tools/args/tf/config.py:22
builder_varseqlen.transformer_layer_opt
def transformer_layer_opt(prefix, config, init_dict, network, input_tensor, mask_idx, cu_seqlens, max_seqlen)
Definition: builder_varseqlen.py:208
nvinfer1::plugin::RPROIPlugin::getOutputDimensions
Dims getOutputDimensions(int index, const Dims *inputs, int nbInputDims) override
Definition: nvFasterRCNNPlugin.cpp:127
sample::Logger::getTRTLogger
nvinfer1::ILogger & getTRTLogger()
Forward-compatible method for retrieving the nvinfer::ILogger associated with this Logger.
Definition: samples/common/logging.h:233
Int8EntropyCalibrator2::writeCalibrationCache
void writeCalibrationCache(const void *cache, size_t length) override
Definition: EntropyCalibrator.h:126
SampleMNISTAPI::teardown
bool teardown()
Cleans up any state created in the sample class.
Definition: sampleMNISTAPI.cpp:378
bert::fused_multihead_attention_v2_fp16_384_64_kernel_sm75_cubin
unsigned char fused_multihead_attention_v2_fp16_384_64_kernel_sm75_cubin[]
Definition: fused_multihead_attention_v2_fp16_384_64_kernel.sm75.cpp:19
chptToBin.main
def main(_)
Definition: chptToBin.py:358
IBatchStream
Definition: BatchStream.h:27
nvinfer1::ICudaEngine::createExecutionContext
virtual IExecutionContext * createExecutionContext() noexcept=0
Create an execution context.
anonymous_namespace{specialSlicePlugin.cpp}::SPECIALSLICE_PLUGIN_NAME
const char * SPECIALSLICE_PLUGIN_NAME
Definition: specialSlicePlugin.cpp:29
polygraphy.backend.trt.loader.EngineFromNetwork._config
_config
Definition: backend/trt/loader.py:407
data_function.TextMelCollate.__init__
def __init__(self, n_frames_per_step)
Definition: tacotron2/data_function.py:86
nvinfer1::CUDADriverWrapper::cuLaunchKernel
CUresult cuLaunchKernel(CUfunction f, unsigned int gridDimX, unsigned int gridDimY, unsigned int gridDimZ, unsigned int blockDimX, unsigned int blockDimY, unsigned int blockDimZ, unsigned int sharedMemBytes, CUstream hStream, void **kernelParams, void **extra) const
Definition: cudaDriverWrapper.cpp:132
chptToBin.save_layer_weights
def save_layer_weights(data, list_keys, dims, footer_string, file_name)
Definition: chptToBin.py:331
MaskRCNNUtils::resizeMask
PPM< uint8_t > resizeMask(const BBoxInfo &box, const float mask_threshold)
Definition: sampleUffMaskRCNN.cpp:190
polygraphy.backend.onnx.runner.OnnxTfRunner._model
_model
Definition: backend/onnx/runner.py:35
AlgorithmCacheWriter::reportAlgorithms
void reportAlgorithms(const nvinfer1::IAlgorithmContext *const *algoContexts, const nvinfer1::IAlgorithm *const *algoChoices, int32_t nbAlgorithms) noexcept override
called by TensorRT to report choices it made.
Definition: sampleAlgorithmSelector.cpp:76
pytorch_quantization.nn.modules.quant_rnn.QuantRNNBase.reset_parameters
def reset_parameters(self)
Definition: modules/quant_rnn.py:147
data_functions.get_collate_function
def get_collate_function(model_name, n_frames_per_step)
Definition: data_functions.py:25
arg_parser.parse_waveglow_args
def parse_waveglow_args(parent, add_help=False)
Definition: waveglow/arg_parser.py:19
pytorch_quantization.calib.max.MaxCalibrator.amaxs
def amaxs(self)
Definition: max.py:46
nvinfer1::plugin::MultilevelProposeROI::getOutputDataType
DataType getOutputDataType(int index, const nvinfer1::DataType *inputTypes, int nbInputs) const override
Definition: multilevelProposeROIPlugin.cpp:429
SampleUffFasterRcnnParams::profile
bool profile
Definition: sampleUffFasterRCNN.cpp:83
half_float::half::data_
detail::uint16 data_
Internal binary representation.
Definition: ieee_half.h:1197
nvinfer1::plugin::SplitPlugin::_output_lengths
std::vector< int > _output_lengths
Definition: split.h:41
pytorch_quantization.nn.modules.quant_rnn.QuantRNNBase._data_ptrs
_data_ptrs
Definition: modules/quant_rnn.py:111
SampleUffFasterRcnnParams::outputClassSize
int outputClassSize
Definition: sampleUffFasterRCNN.cpp:71
nvinfer1::plugin::LReLU::getPluginVersion
const char * getPluginVersion() const override
Return the plugin version.
Definition: lReluPlugin.cpp:115
pytorch_quantization.nn.modules.quant_conv._QuantConvNd
Definition: quant_conv.py:41
pytorch_quantization.tensor_quant.ScaledQuantDescriptor._narrow_range
_narrow_range
Definition: tensor_quant.py:105
bert::FusedMultiHeadAttentionKernelMetaInfoV2::mUnrollStep
unsigned int mUnrollStep
Definition: fused_multihead_attention_v2.h:169
tools.test_run.TestOther.test_0_iterations
def test_0_iterations(self)
Definition: test_run.py:222
bert::QKVToContextInterleavedPlugin::terminate
void terminate() override
Release resources acquired during plugin layer initialization.
Definition: qkvToContextInt8InterleavedPlugin.cpp:187
test_ir.toposort_multi_tier_input_graph
def toposort_multi_tier_input_graph()
Definition: test_ir.py:430
polygraphy.backend.onnx.util.str_from_onnx_graph
def str_from_onnx_graph(graph, mode, tensors, indent_level=0)
Definition: onnx/util.py:194
half_float::detail::functions::pow
static expr pow(float base, float exp)
Power implementation.
Definition: samples/common/half.h:1878
nmtSample::LSTMDecoder::getInfo
std::string getInfo() override
get the textual description of the component
Definition: lstmDecoder.cpp:142
polygraphy.backend.onnx.util.get_tensor_metadata
def get_tensor_metadata(tensors)
Definition: onnx/util.py:150
tests.tensor_quant_test.TestQuantDescriptor
Definition: tensor_quant_test.py:269
sample::anonymous_namespace{sampleEngines.cpp}::RndInt8Calibrator::getBatchSize
int getBatchSize() const override
Get the batch size used for calibration batches.
Definition: sampleEngines.cpp:173
nvinfer1::plugin::Region::enqueue
int enqueue(int batchSize, const void *const *inputs, void **outputs, void *workspace, cudaStream_t stream) override
Definition: regionPlugin.cpp:246
bert::fused_multihead_attention_v2_fp16_64_64_kernel_sm75_cubin_len
unsigned int fused_multihead_attention_v2_fp16_64_64_kernel_sm75_cubin_len
Definition: fused_multihead_attention_v2_fp16_64_64_kernel.sm75.cpp:1620
onnx_graphsurgeon.ir.tensor.Constant
Definition: tensor.py:186
UffPoolPluginV2::getNbOutputs
int getNbOutputs() const override
Get the number of outputs from the layer.
Definition: sampleUffPluginV2Ext.cpp:386
SampleINT8APIArgs
The SampleINT8APIArgs structures groups the additional arguments required by the INT8 API sample.
Definition: sampleINT8API.cpp:647
nvcaffeparser1::parseReLU
nvinfer1::ILayer * parseReLU(nvinfer1::INetworkDefinition &network, const trtcaffe::LayerParameter &msg, CaffeWeightFactory &, BlobNameToTensor &tensors)
Definition: parseReLU.cpp:23
SampleINT8APIParams::writeNetworkTensors
bool writeNetworkTensors
Definition: sampleINT8API.cpp:58
TLTMaskRCNNConfig::MODEL_DETECTION_SHAPE
static const Dims2 MODEL_DETECTION_SHAPE
Definition: tlt_mrcnn_config.h:174
polygraphy.tools.args.logger.LoggerArgs.verbosity_count
verbosity_count
Definition: Polygraphy/polygraphy/tools/args/logger.py:32
nvinfer1::INetworkDefinition::getInput
virtual ITensor * getInput(int32_t index) const =0
Get the input tensor specified by the given index.
nvinfer1::plugin::ProposalPluginCreator::getPluginVersion
const char * getPluginVersion() const override
Return the plugin version.
Definition: proposalPlugin.cpp:339
polygraphy.tools.args.trt_legacy.TrtLegacyArgs.register
def register(self, maker)
Definition: tools/args/trt_legacy.py:34
nvinfer1::plugin::Region::getSerializationSize
size_t getSerializationSize() const override
Find the size of the serialization buffer required.
Definition: regionPlugin.cpp:264
helpers.data_processing.get_final_text
def get_final_text(pred_text, orig_text, do_lower_case)
Definition: data_processing.py:224
pytorch_quantization.nn.modules.tensor_quantizer.TensorQuantizer.axis
def axis(self)
Definition: tensor_quantizer.py:139
OutputParams::allItems
std::vector< int32_t > allItems
Definition: sampleMovieLens.cpp:54
helpers.calibrator.BertCalibrator.max_query_length
max_query_length
Definition: demo/BERT/helpers/calibrator.py:43
parserutils::elementSize
unsigned int elementSize(nvinfer1::DataType t)
Definition: parserUtils.h:74
IInt8EntropyCalibrator2
generate.reshape
def reshape(self, data, shape)
Definition: 09_shape_operations_with_the_layer_api/generate.py:36
bert::Fused_multihead_attention_params_v2::scale_bmm1
uint32_t scale_bmm1
Definition: fused_multihead_attention_v2.h:58
comparator.test_struct.TestRunResults.test_contains
def test_contains(self, run_results)
Definition: comparator/test_struct.py:87
nvinfer1::plugin::detail::umulhi
__host__ __device__ __forceinline__ unsigned int umulhi(unsigned int x, unsigned int y)
Definition: reducedMathPlugin.h:35
sample::InferenceTrace::computeStart
float computeStart
Definition: sampleReporting.h:96
half_float::half::operator-=
half & operator-=(float rhs)
Arithmetic assignment.
Definition: samples/common/half.h:1497
polygraphy.tools.precision.precision.STWorstFirst.find
def find(self)
Definition: precision.py:180
nvinfer1::plugin::DetectionLayer::canBroadcastInputAcrossBatch
bool canBroadcastInputAcrossBatch(int inputIndex) const override
Definition: detectionLayerPlugin.cpp:288
onnx_models.Model
Definition: onnx_models.py:34
nvinfer1::plugin::SplitPlugin::_z_stride
int _z_stride
Definition: split.h:43
nvinfer1::IInt8MinMaxCalibrator::getAlgorithm
CalibrationAlgoType getAlgorithm() override
Signal that this is the MinMax Calibrator.
Definition: NvInfer.h:6099
nvcaffeparser1::IPluginFactory::~IPluginFactory
virtual ~IPluginFactory()
Definition: NvCaffeParser.h:108
nvinfer1::IRefitter::refitCudaEngine
virtual bool refitCudaEngine()=0
Updates associated engine.
UffPoolPluginV2::getOutputDataType
DataType getOutputDataType(int index, const DataType *inputTypes, int nbInputs) const override
Definition: sampleUffPluginV2Ext.cpp:530
nmtSample::LSTMDecoder::mRNNKind
bool mRNNKind
Definition: lstmDecoder.h:49
nvinfer1::plugin::CudnnError::CudnnError
CudnnError(const char *fl, const char *fn, int ln, int stat, const char *msg=nullptr)
Definition: checkMacrosPlugin.h:113
nvinfer1::plugin::CudaError
Definition: checkMacrosPlugin.h:101
nvcaffeparser1::parseReduction
nvinfer1::ILayer * parseReduction(nvinfer1::INetworkDefinition &network, const trtcaffe::LayerParameter &msg, CaffeWeightFactory &weightFactory, BlobNameToTensor &tensors)
Definition: parseReduction.cpp:23
polygraphy.logger.logger.Logger.critical
def critical(self, message)
Definition: Polygraphy/polygraphy/logger/logger.py:287
polygraphy.backend.onnx.loader.SaveOnnx._model
_model
Definition: backend/onnx/loader.py:162
models.parse_model_args
def parse_model_args(model_name, parser, add_help=False)
Definition: demo/Tacotron2/models.py:26
anonymous_namespace{pluginLogging.h}::LOG_WARN
LogStreamConsumer LOG_WARN(const Logger &logger)
produces a LogStreamConsumer object that can be used to log messages of severity kWARNING
Definition: pluginLogging.h:497
tests.quant_linear_test.TestQuantLinear
Definition: quant_linear_test.py:44
MultilevelProposeROIWorkSpace::argMaxLabelDims
nvinfer1::DimsHW argMaxLabelDims
Definition: maskRCNNKernels.h:166
nvinfer1::plugin::MultilevelProposeROIPluginCreator::deserializePlugin
IPluginV2Ext * deserializePlugin(const char *name, const void *data, size_t length) override
Called during deserialization of plugin layer.
Definition: multilevelProposeROIPlugin.cpp:106
pytorch_quantization.nn.modules.tensor_quantizer.TensorQuantizer.load_calib_amax
def load_calib_amax(self, *args, **kwargs)
Definition: tensor_quantizer.py:218
nvinfer1::IPluginV2Ext::clone
virtual IPluginV2Ext * clone() const =0
Clone the plugin object.
nvinfer1::plugin::LogStream::Buf::sync
int sync() override
Definition: checkMacrosPlugin.cpp:31
tests.utils.quantize_by_range
def quantize_by_range(x, num_bits)
Definition: tools/pytorch-quantization/tests/utils.py:26
sample::InferenceOptions::sleep
int sleep
Definition: sampleOptions.h:152
nvinfer1::plugin::PluginCreatorRegistry::~PluginCreatorRegistry
~PluginCreatorRegistry()
Definition: InferPlugin.cpp:127
nmtSample::SLPProjection::addToModel
void addToModel(nvinfer1::INetworkDefinition *network, nvinfer1::ITensor *input, nvinfer1::ITensor **outputLogits) override
add raw logits to the network
Definition: slpProjection.cpp:41
sample::anonymous_namespace{sampleInference.cpp}::EnqueueGraph::mGraph
TrtCudaGraph & mGraph
Definition: sampleInference.cpp:247
SampleUffFasterRcnn::mParams
SampleUffFasterRcnnParams mParams
The parameters for the sample.
Definition: sampleUffFasterRCNN.cpp:121
nvinfer1::plugin::Reorg::getSerializationSize
size_t getSerializationSize() const override
Find the size of the serialization buffer required.
Definition: reorgPlugin.cpp:72
model.WaveGlow.infer
def infer(self, spect, sigma=1.0)
Definition: demo/Tacotron2/waveglow/model.py:231
polygraphy.tools.args.tf.runner.TfRunnerArgs.register
def register(self, maker)
Definition: tools/args/tf/runner.py:27
SampleCharRNNParams::outputSentences
std::vector< std::string > outputSentences
Definition: sampleCharRNN.cpp:132
anonymous_namespace{regionPlugin.cpp}::SoftmaxTreeDeleter
Definition: regionPlugin.cpp:46
polygraphy.logger.logger.LoggerVerbosity.old_severity
old_severity
Definition: Polygraphy/polygraphy/logger/logger.py:61
sample::anonymous_namespace{sampleInference.cpp}::Iteration::getEvent
TrtCudaEvent & getEvent(EventType t)
Definition: sampleInference.cpp:392
sample::anonymous_namespace{sampleEngines.cpp}::RndInt8Calibrator::writeCalibrationCache
virtual void writeCalibrationCache(const void *, size_t) override
Definition: sampleEngines.cpp:180
nmtSample::SLPProjection
Linear logits calculation.
Definition: slpProjection.h:36
trt_utils.load_engine
def load_engine(engine_filepath, trt_logger)
Definition: trt_utils.py:46
nvinfer1::plugin::Normalize::Normalize
Normalize(const Weights *weights, int nbWeights, bool acrossSpatial, bool channelShared, float eps)
Definition: normalizePlugin.cpp:38
SamplePlugin::~SamplePlugin
~SamplePlugin()
Definition: samplePlugin.cpp:66
inference_varseqlen.question_features
def question_features(tokens, question)
Definition: inference_varseqlen.py:113
nvinfer1::plugin::InstanceNormalizationPlugin::getOutputDataType
DataType getOutputDataType(int index, const nvinfer1::DataType *inputTypes, int nbInputs) const override
Definition: instanceNormalizationPlugin.cpp:272
setup.no_publish
def no_publish()
Definition: onnx-graphsurgeon/setup.py:21
samplesCommon::PPM::w
int w
Definition: samples/common/common.h:655
nvinfer1::plugin::InstanceNormalizationPlugin::serialize
void serialize(void *buffer) const override
Serialize the layer.
Definition: instanceNormalizationPlugin.cpp:223
half_float::detail::round_half_impl
uint16 round_half_impl(uint16 value)
Round half-precision number to nearest integer value.
Definition: ieee_half.h:1008
nvinfer1::plugin::GridAnchorGenerator::getPluginNamespace
const char * getPluginNamespace() const override
Return the namespace of the plugin object.
Definition: gridAnchorPlugin.cpp:295
polygraphy.backend.onnx.util.get_values
def get_values(tensor)
Definition: onnx/util.py:144
bert::Fused_multihead_attention_params::o_ptr
void * o_ptr
Definition: fused_multihead_attention.h:55
nvinfer1::plugin::BatchedNMSPlugin::setPluginNamespace
void setPluginNamespace(const char *libNamespace) override
Set the namespace that this plugin object belongs to.
Definition: batchedNMSPlugin.cpp:409
nvinfer1::LayerType::kMATRIX_MULTIPLY
@ kMATRIX_MULTIPLY
Matrix multiply layer.
nvinfer1::plugin::CropAndResizePlugin::mNumboxes
size_t mNumboxes
Definition: cropAndResizePlugin.h:101
bert::launch_small
void launch_small(cudaStream_t stream, const int ld, const int total, const int8_t *input, const int8_t *skip, const half *beta, const half *gamma, int8_t *output, const float dqScaleIn, const float dqScaleSkip, const float qScale)
SampleUffFasterRcnn
Definition: sampleUffFasterRCNN.cpp:93
half_float::detail::islessgreater
bool islessgreater(half x, half y)
Comarison for less or greater.
Definition: ieee_half.h:2817
backend.trt.test_runner.TestTrtRunner.test_can_name_runner
def test_can_name_runner(self)
Definition: trt/test_runner.py:36
nvinfer1::IPluginV2::getSerializationSize
virtual size_t getSerializationSize() const =0
Find the size of the serialization buffer required.
sample::TrtCudaEvent::TrtCudaEvent
TrtCudaEvent(bool blocking=true)
Definition: sampleDevice.h:110
nvinfer1::plugin::RPROIPlugin::mPluginNamespace
std::string mPluginNamespace
Definition: nvFasterRCNNPlugin.h:95
sample::DeviceAllocator
Definition: sampleDevice.h:273
nmtSample::BenchmarkWriter::BenchmarkWriter
BenchmarkWriter()
Definition: benchmarkWriter.cpp:24
Logger::defineTest
static TestAtom defineTest(const std::string &name, const std::string &cmdline)
Define a test for logging.
Definition: pluginLogging.h:318
nvinfer1::plugin::MultilevelCropAndResize::mROICount
int mROICount
Definition: multilevelCropAndResizePlugin.h:97
polygraphy.util.misc.default_value
def default_value(value, default)
Definition: Polygraphy/polygraphy/util/misc.py:101
SharedMemory::open_rw
int open_rw()
Definition: sampleMovieLensMPS.cpp:136
Args::sequenceLength
int sequenceLength
Definition: demo/BERT/infer_c/common.h:31
SampleSSDParams
The SampleSSDParams structure groups the additional parameters required by the SSD sample.
Definition: sampleSSD.cpp:47
nmtSample::PinnedHostBuffer::PinnedHostBuffer
PinnedHostBuffer(size_t elementCount)
Definition: pinnedHostBuffer.h:37
nvinfer1::plugin::BatchedNMSBasePluginCreator::params
NMSParameters params
Definition: batchedNMSPlugin.h:133
polygraphy.backend.onnx.util.all_tensor_names
def all_tensor_names(model)
Definition: onnx/util.py:56
test_ir.build_two_layer_graph_multiple_io
def build_two_layer_graph_multiple_io()
Definition: test_ir.py:358
pytorch_quantization.nn.modules.tensor_quantizer
Definition: tensor_quantizer.py:1
SampleConfig::mVerbosity
Verbosity mVerbosity
Definition: sampleConfig.h:53
sample::anonymous_namespace{sampleInference.cpp}::EventType::kOUTPUT_E
@ kOUTPUT_E
builder.BertConfig.num_attention_heads
num_attention_heads
Definition: builder.py:92
samplesCommon::SampleParams::int8
bool int8
Allow runnning the network in Int8 mode.
Definition: argsParser.h:40
SampleUffMNIST
The SampleUffMNIST class implements the UffMNIST sample.
Definition: sampleUffMNIST.cpp:53
polygraphy.tools.args.onnx.loader.OnnxLoaderArgs.add_serialized_onnx_loader
def add_serialized_onnx_loader(self, script, disable_outputs=None)
Definition: tools/args/onnx/loader.py:111
tests.quant_linear_test.TestQuantLinear.test_fake_quant_per_channel
def test_fake_quant_per_channel(self)
Definition: quant_linear_test.py:136
nvinfer1::plugin::BatchedNMSDynamicPlugin::mPrecision
DataType mPrecision
Definition: batchedNMSPlugin.h:118
nvuffparser::IPluginFactoryExt
Plugin factory used to configure plugins with added support for TRT versioning.
Definition: NvUffParser.h:129
polygraphy.backend.onnx.util.str_from_onnx
def str_from_onnx(model, mode="full")
Definition: onnx/util.py:168
nvcaffeparser1::WeightType::kMOVING_AVERAGE
@ kMOVING_AVERAGE
nvinfer1::plugin::MultilevelProposeROI
Definition: multilevelProposeROIPlugin.h:35
tests.quant_rnn_test.TestQuantLSTMCell.test_no_quant_input_hidden
def test_no_quant_input_hidden(self, verbose)
Definition: quant_rnn_test.py:85
AlgorithmCacheWriter::AlgorithmCacheWriter
AlgorithmCacheWriter(const std::string &cacheFileName)
Definition: sampleAlgorithmSelector.cpp:108
nvcaffeparser1::parseScale
nvinfer1::ILayer * parseScale(nvinfer1::INetworkDefinition &network, const trtcaffe::LayerParameter &msg, CaffeWeightFactory &weightFactory, BlobNameToTensor &tensors)
Definition: parseScale.cpp:23
export_tacotron2_onnx.DecoderIter.decoder_rnn
decoder_rnn
Definition: export_tacotron2_onnx.py:123
SampleUffSSDParams::keepTopK
int32_t keepTopK
The maximum number of detection post-NMS.
Definition: sampleUffSSD.cpp:55
tests.quant_conv_transposed_test.TestQuantConvTranspose3D
Definition: quant_conv_transposed_test.py:238
bert::QKVToContextInterleavedPluginCreator::QKVToContextInterleavedPluginCreator
QKVToContextInterleavedPluginCreator()
Definition: qkvToContextInt8InterleavedPlugin.cpp:276
nvinfer1::plugin::ResizeNearestPluginCreator
Definition: resizeNearestPlugin.h:96
half_float::detail::functions::sinh
static expr sinh(float arg)
Hyperbolic sine implementation.
Definition: samples/common/half.h:1943
polygraphy.comparator.struct.IterationResult.runner_name
runner_name
Definition: comparator/struct.py:46
pytorch_quantization.nn.modules.quant_conv.QuantConvTranspose3d.forward
def forward(self, input, output_size=None)
Definition: quant_conv.py:378
nvinfer1::EnumMax< ReduceOperation >
constexpr int32_t EnumMax< ReduceOperation >()
Maximum number of elements in ReduceOperation enum.
Definition: NvInfer.h:3567
SampleCharRNNWeightNames::FCB_NAME
const std::string FCB_NAME
Definition: sampleCharRNN.cpp:78
half_float::detail::ilogb
int ilogb(half arg)
Extract exponent.
Definition: ieee_half.h:2669
nmtSample::Embedder::Embedder
Embedder()=default
test_ir.TestNode.test_equals_num_inputs_mismatch
def test_equals_num_inputs_mismatch(self)
Definition: test_ir.py:151
half_float::detail::half2int_impl
T half2int_impl(uint16 value)
Convert half-precision floating point to integer.
Definition: ieee_half.h:957
nvinfer1::plugin::ProposalLayer::supportsFormat
bool supportsFormat(DataType type, PluginFormat format) const override
Check format support.
Definition: proposalLayerPlugin.cpp:160
nvinfer1::plugin::Reorg::W
int W
Definition: reorgPlugin.h:87
polygraphy.tools.args.trt.loader.TrtLoaderArgs.fp16
fp16
Definition: tools/args/trt/loader.py:100
half_float::detail::functions::rint
static half rint(half arg)
Nearest integer implementation.
Definition: samples/common/half.h:2118
nmtSample::BeamSearchPolicy::mCandidates
std::vector< std::vector< int > > mCandidates
Definition: beamSearchPolicy.h:76
MaskRCNNUtils::BBoxInfo::box
samplesCommon::BBox box
Definition: sampleUffMaskRCNN.cpp:64
nvinfer1::Dims::d
int32_t d[MAX_DIMS]
The extent of each dimension.
Definition: NvInferRuntimeCommon.h:199
pytorch_quantization.nn._functions.quant_rnn.RNN
def RNN(*args, **kwargs)
Definition: _functions/quant_rnn.py:279
MaskRCNNUtils::readPPMFile
void readPPMFile(const std::string &filename, PPM< uint8_t > &ppm)
Definition: sampleUffMaskRCNN.cpp:79
polygraphy.tools.args.data_loader.DataLoaderArgs.seed
seed
Definition: tools/args/data_loader.py:59
helpers.tokenization.FullTokenizer.vocab
vocab
Definition: tokenization.py:164
polygraphy.tools.inspect.subtool.model.Model.inspect_tf
def inspect_tf(self, args)
Definition: tools/Polygraphy/polygraphy/tools/inspect/subtool/model.py:89
sample::Bindings::fill
void fill(int binding, const std::string &fileName)
Definition: sampleUtils.h:361
polygraphy.backend.cntk.runner.CNTKRunner
Definition: backend/cntk/runner.py:25
SampleGoogleNet::SampleGoogleNet
SampleGoogleNet(const samplesCommon::CaffeSampleParams &params)
Definition: sampleGoogleNet.cpp:52
sample::TrtCudaBuffer::~TrtCudaBuffer
~TrtCudaBuffer()
Definition: sampleDevice.h:239
pytorch_quantization.nn.modules._utils.pop_quant_desc_in_kwargs
def pop_quant_desc_in_kwargs(quant_cls, input_only=False, **kwargs)
Definition: _utils.py:139
nvuffparser::IUffParser
Class used for parsing models described using the UFF format.
Definition: NvUffParser.h:152
models.meta.Model.loader
loader
Definition: meta.py:33
RefineDetectionWorkSpace::sortClassLabelDims
nvinfer1::DimsHW sortClassLabelDims
Definition: maskRCNNKernels.h:101
test_infer_trt.load_and_setup_model
def load_and_setup_model(model_name, parser, checkpoint, amp_run, to_cuda=True)
Definition: test_infer_trt.py:72
nvinfer1::plugin::RPROIPlugin::getPluginType
const char * getPluginType() const override
Return the plugin type.
Definition: nvFasterRCNNPlugin.cpp:219
polygraphy.backend.trt.loader.CreateConfig.int8
int8
Definition: backend/trt/loader.py:324
pytorch_quantization.nn.modules.tensor_quantizer.TensorQuantizer._if_calib
_if_calib
Definition: tensor_quantizer.py:86
BufferDesc::getBufferSize
int getBufferSize()
Definition: sampleReformatFreeIO.cpp:127
polygraphy.backend.trt.loader.LoadPlugins.plugins
plugins
Definition: backend/trt/loader.py:49
pytorch_quantization.calib.histogram.HistogramCalibrator
Definition: histogram.py:30
onnx_graphsurgeon.importers.onnx_importer.import_onnx
Graph import_onnx("onnx.ModelProto" onnx_model)
Definition: onnx_importer.py:225
backend.trt.test_loader.TestProfile.test_can_add
def test_can_add(self)
Definition: trt/test_loader.py:161
SampleOnnxMNIST::constructNetwork
bool constructNetwork(SampleUniquePtr< nvinfer1::IBuilder > &builder, SampleUniquePtr< nvinfer1::INetworkDefinition > &network, SampleUniquePtr< nvinfer1::IBuilderConfig > &config, SampleUniquePtr< nvonnxparser::IParser > &parser)
Parses an ONNX model for MNIST and creates a TensorRT network.
Definition: sampleOnnxMNIST.cpp:163
sample::BuildOptions::disabledTactics
nvinfer1::TacticSources disabledTactics
Definition: sampleOptions.h:128
LogStreamConsumer::mSeverity
Severity mSeverity
Definition: pluginLogging.h:181
nvinfer1::plugin::BatchTilePlugin::getSerializationSize
size_t getSerializationSize() const override
Find the size of the serialization buffer required.
Definition: batchTilePlugin.cpp:122
tests.calibrator_test.TestPercentileCalibrator.test_one_tensor
def test_one_tensor(self, verbose)
Definition: calibrator_test.py:251
audio_processing.dynamic_range_compression
def dynamic_range_compression(x, C=1, clip_val=1e-5)
Definition: audio_processing.py:94
bert::QKVToContextInterleavedPluginCreator::getPluginName
const char * getPluginName() const override
Return the plugin name.
Definition: qkvToContextInt8InterleavedPlugin.cpp:282
nvinfer1::plugin::ProposalLayer::mAnchorBoxesHost
std::vector< float > mAnchorBoxesHost
Definition: proposalLayerPlugin.h:107
nvcaffeparser1::parseBatchNormalization
nvinfer1::ILayer * parseBatchNormalization(nvinfer1::INetworkDefinition &network, const trtcaffe::LayerParameter &msg, CaffeWeightFactory &weightFactory, BlobNameToTensor &tensors)
Definition: parseBatchNorm.cpp:80
half_float::detail::enable< T, expr, half, expr >::type
T type
Definition: ieee_half.h:384
nmtSample::Alignment::addAttentionKeys
virtual void addAttentionKeys(nvinfer1::INetworkDefinition *network, nvinfer1::ITensor *memoryStates, nvinfer1::ITensor **attentionKeys)=0
add attention keys calculation (from source memory states) to the network
backend.trt.test_loader.TestModifyNetwork.test_mark_custom_outputs
def test_mark_custom_outputs(self, identity_identity_network)
Definition: trt/test_loader.py:130
helpers.data_processing.convert_example_to_features
def convert_example_to_features(doc_tokens, question_text, tokenizer, max_seq_length, doc_stride, max_query_length)
Definition: data_processing.py:86
polygraphy.tools.util.script.Script.append_suffix
def append_suffix(self, line)
Definition: script.py:221
polygraphy.backend.tf.loader.GraphFromFrozen.path
path
Definition: backend/tf/loader.py:147
half_float::detail::functions::ilogb
static int ilogb(half arg)
Exponent implementation.
Definition: samples/common/half.h:2236
nvinfer1::ITopKLayer
Layer that represents a TopK reduction.
Definition: NvInfer.h:4075
polygraphy.tools.args.tf2onnx.loader.Tf2OnnxLoaderArgs.add_to_parser
def add_to_parser(self, parser)
Definition: tools/args/tf2onnx/loader.py:23
polygraphy.util.cuda.Cuda.dtoh
def dtoh(self, dst, src, nbytes, stream=None)
Definition: cuda.py:76
anonymous_namespace{multilevelProposeROIPlugin.cpp}::MULTILEVELPROPOSEROI_PLUGIN_NAME
const char * MULTILEVELPROPOSEROI_PLUGIN_NAME
Definition: multilevelProposeROIPlugin.cpp:35
polygraphy.util.cuda.Cuda.check
def check(self, status)
Definition: cuda.py:39
samplesCommon
Definition: argsParser.h:29
tests.utils.quant_np
def quant_np(x, amax, num_bits=8, fake=False, narrow_range=True)
Definition: tools/pytorch-quantization/tests/utils.py:118
test_examples.test_cli_inspect_examples
def test_cli_inspect_examples(example)
Definition: Polygraphy/tests/test_examples.py:135
bert::embSkipLayerNormVarSeqlen
int embSkipLayerNormVarSeqlen(cudaStream_t stream, int ld, int B, int S, const uint32_t *cuSeqlens, const int *inputIds, const int *token_ids, const T *beta, const T *gamma, const T *wordEmb, const T *posEmb, const T *tokEmb, T *output)
test_ir.TestNode.test_can_copy_outputs_from_other_node
def test_can_copy_outputs_from_other_node(self)
Definition: test_ir.py:184
MaskRCNNConfig::MODEL_DETECTION_SHAPE
static const Dims2 MODEL_DETECTION_SHAPE
Definition: plugin/proposalLayerPlugin/mrcnn_config.h:174
half_float::detail::frexp
half frexp(half arg, int *exp)
Decompress floating point number.
Definition: ieee_half.h:2627
samplesCommon::CaffeSampleParams::weightsFileName
std::string weightsFileName
Filename of trained weights file of a network.
Definition: argsParser.h:54
bert::DATA_TYPE_FP32
@ DATA_TYPE_FP32
Definition: fused_multihead_attention_common.h:28
UffPoolPluginV2::getSerializationSize
size_t getSerializationSize() const override
Find the size of the serialization buffer required.
Definition: sampleUffPluginV2Ext.cpp:459
nvinfer1::CalibrationAlgoType
CalibrationAlgoType
enum CalibrationAlgoType
Definition: NvInfer.h:5972
ParserOnnxConfig::destroy
virtual void destroy()
Destroy IOnnxConfig object.
Definition: parserOnnxConfig.h:146
half_float::detail::conditional< false, T, F >::type
F type
Definition: samples/common/half.h:329
bert::FusedMultiHeadAttentionKernelMetaInfoV2
Definition: fused_multihead_attention_v2.h:158
nvcaffeparser1::CaffeParser::parseBinaryProto
IBinaryProtoBlob * parseBinaryProto(const char *fileName) override
Parse and extract data stored in binaryproto file.
Definition: caffeParser.cpp:654
Bbox::ymin
T ymin
Definition: bboxUtils.h:28
nvinfer1::IPluginV2::initialize
virtual int32_t initialize()=0
Initialize the layer for execution.
polygraphy.backend.tf.util.get_tensor_metadata
def get_tensor_metadata(tensors)
Definition: tf/util.py:71
polygraphy.tools.inspect.subtool.model.Model.inspect_trt
def inspect_trt(self, args)
Definition: tools/Polygraphy/polygraphy/tools/inspect/subtool/model.py:64
sample::anonymous_namespace{sampleInference.cpp}::StreamType::kNUM
@ kNUM
nvinfer1::EngineCapability::kSAFE_GPU
@ kSAFE_GPU
Safety restricted capability, TensorRT flow that can only run on GPU devices via TensorRT nvinfer1::s...
CudaBind::mSize
size_t mSize
Definition: maskRCNNKernels.h:65
pytorch_quantization.nn.modules.quant_pooling.QuantMaxPool1d.forward
def forward(self, input)
Definition: quant_pooling.py:45
sample::Binding::buffer
MirroredBuffer buffer
Definition: sampleUtils.h:218
pytorch_quantization.nn.modules.quant_rnn.QuantRNNCell.bias
bias
Definition: modules/quant_rnn.py:344
polygraphy.backend.base.runner.BaseRunner.activate
def activate(self)
Definition: backend/base/runner.py:83
polygraphy.backend.trt.util.get_output_metadata_from_engine
def get_output_metadata_from_engine(engine, start_binding, end_binding)
Definition: trt/util.py:381
SampleMovieLens::SampleUniquePtr
std::unique_ptr< T, samplesCommon::InferDeleter > SampleUniquePtr
Definition: sampleMovieLens.cpp:86
nvinfer1::plugin::RPROIPlugin
Definition: nvFasterRCNNPlugin.h:30
nvinfer1::plugin::InstanceNormalizationPlugin::setPluginNamespace
void setPluginNamespace(const char *pluginNamespace) override
Set the namespace that this plugin object belongs to.
Definition: instanceNormalizationPlugin.cpp:262
backend.trt.test_loader.identity_engine
def identity_engine()
Definition: trt/test_loader.py:33
nmtSample::Likelihood::ptr
std::shared_ptr< Likelihood > ptr
Definition: likelihood.h:36
int
nvinfer1::plugin::LReLU::destroy
void destroy() override
Destroy the plugin object.
Definition: lReluPlugin.cpp:120
SampleUffSSD::verifyOutput
bool verifyOutput(const samplesCommon::BufferManager &buffers)
Filters output detections and verify results.
Definition: sampleUffSSD.cpp:323
nvinfer1::plugin::SplitPlugin::getPluginType
const char * getPluginType() const override
Return the plugin type.
Definition: split.h:110
nvinfer1::IRefitter
Updates weights in an engine.
Definition: NvInferRuntime.h:828
RefineNMSParameters
Definition: maskRCNNKernels.h:84
SampleMNISTAPIParams::inputH
int inputH
The input height.
Definition: sampleMNISTAPI.cpp:48
inference_trt.infer_tacotron2_trt
def infer_tacotron2_trt(encoder, decoder_iter, postnet, encoder_context, decoder_context, postnet_context, sequences, sequence_lengths, measurements, fp16)
Definition: inference_trt.py:175
nvinfer1::plugin::RegionPluginCreator::getFieldNames
const PluginFieldCollection * getFieldNames() override
Return a list of fields that needs to be passed to createPlugin.
Definition: regionPlugin.cpp:489
tests.functional_test.TestClip.test_raise
def test_raise(self)
Definition: functional_test.py:44
bert::SkipLayerNormInterleavedPlugin::mParamWordsize
size_t mParamWordsize
Definition: skipLayerNormInt8InterleavedPlugin.h:80
samplesCommon::HostFree
Definition: buffers.h:207
fp16::bitwise_cast
T bitwise_cast(U u)
Definition: fp16.h:27
nvinfer1::plugin::Reorg::attachToContext
void attachToContext(cudnnContext *cudnnContext, cublasContext *cublasContext, IGpuAllocator *gpuAllocator) override
Attach the plugin object to an execution context and grant the plugin the access to some context reso...
Definition: reorgPlugin.cpp:170
polygraphy.tools.precision.precision.STCheckerBase.parser
parser
Definition: precision.py:78
half_float::detail::bits
Type traits for floating point bits.
Definition: ieee_half.h:315
Batch
Definition: sampleMovieLensMPS.cpp:189
BertInference::~BertInference
~BertInference()
Definition: bert_infer.h:324
polygraphy.tools.args.trt.loader.TrtLoaderArgs._network_api
_network_api
Definition: tools/args/trt/loader.py:28
test_ir.TestVariable.input_node
input_node
Definition: test_ir.py:113
generate.gather
def gather(self, data, indices)
Definition: 09_shape_operations_with_the_layer_api/generate.py:41
nvinfer1::ICudaEngine::destroy
virtual void destroy() noexcept=0
Destroy this object;.
tests.quant_conv_test.TestQuantConv1D.test_fake_quant_per_channel
def test_fake_quant_per_channel(self)
Definition: quant_conv_test.py:360
nvinfer1::LayerType::kPOOLING
@ kPOOLING
Pooling layer.
nvinfer1::plugin::SpecialSlice::mBboxesCnt
int mBboxesCnt
Definition: specialSlicePlugin.h:95
BatchStream::BatchStream
BatchStream(int batchSize, int maxBatches, const std::string &prefix, const std::vector< std::string > &directories)
Definition: frcnnUtils.h:222
nvinfer1::plugin::ProposalPluginCreator
Definition: proposalPlugin.h:125
pytorch_quantization.calib.calibrator._Calibrator._axis
_axis
Definition: tools/pytorch-quantization/pytorch_quantization/calib/calibrator.py:33
sample::cudaCheck
void cudaCheck(cudaError_t ret, std::ostream &err=std::cerr)
Definition: sampleDevice.h:28
nvinfer1::plugin::PriorBoxParameters::clip
bool clip
Definition: NvInferPluginUtils.h:96
inference.MeasureTime.__init__
def __init__(self, measurements, key, cpu_run=False)
Definition: Tacotron2/inference.py:165
nvinfer1::plugin::TRTException
Definition: checkMacrosPlugin.h:74
sample::anonymous_namespace{sampleInference.cpp}::Iteration::syncAll
void syncAll(const TimePoint &cpuStart, const TrtCudaEvent &gpuStart, std::vector< InferenceTrace > &trace, bool skipTransfers)
Definition: sampleInference.cpp:356
BatchStream::mBatchCount
int mBatchCount
Definition: BatchStream.h:375
onnx_graphsurgeon.ir.graph.Graph.layer
def layer(self, inputs=[], outputs=[], *args, **kwargs)
Definition: graph.py:397
nmtSample::Vocabulary::mEosId
int mEosId
Definition: vocabulary.h:77
onnx_graphsurgeon.util.misc.SynchronizedList.field_name
field_name
Definition: onnx-graphsurgeon/onnx_graphsurgeon/util/misc.py:72
nvinfer1::plugin::DetectionLayer::getOutputDataType
DataType getOutputDataType(int index, const nvinfer1::DataType *inputTypes, int nbInputs) const override
Definition: detectionLayerPlugin.cpp:275
SampleMovieLens::processInput
bool processInput(const samplesCommon::BufferManager &buffers)
Copies a batch of input data from SampleMovieLensParams into managed input buffers.
Definition: sampleMovieLens.cpp:333
SampleINT8API::verifyOutput
bool verifyOutput(const samplesCommon::BufferManager &buffers) const
Verifies that the output is correct and prints it.
Definition: sampleINT8API.cpp:455
nvinfer1::INetworkDefinition::addUnary
virtual IUnaryLayer * addUnary(ITensor &input, UnaryOperation operation)=0
Add a unary layer to the network.
polygraphy.util.cuda.Stream.handle
handle
Definition: cuda.py:92
pytorch_quantization.nn.modules._utils.QuantMixin.default_quant_desc_weight
default_quant_desc_weight
Definition: _utils.py:34
nmtSample::DataReader::reset
virtual void reset()=0
Reset the reader position, the data reader is ready to read the data from th ebeginning again after t...
nvinfer1::CUDADriverWrapper::_cuModuleUnload
CUresult(* _cuModuleUnload)(CUmodule)
Definition: cudaDriverWrapper.h:73
nvinfer1::IFullyConnectedLayer::~IFullyConnectedLayer
virtual ~IFullyConnectedLayer()
Definition: NvInfer.h:1610
half_float::detail::unary_specialized< expr >::negate
static expr negate(float arg)
Definition: samples/common/half.h:2532
polygraphy.backend.onnxrt
Definition: tools/Polygraphy/polygraphy/backend/onnxrt/__init__.py:1
nvinfer1::DimensionOperation::kSUM
@ kSUM
Sum of the two operands.
bert::fused_multihead_attention_int8_384_64_kernel_sm75_cu_o_len
unsigned int fused_multihead_attention_int8_384_64_kernel_sm75_cu_o_len
Definition: fused_multihead_attention_int8_384_64_kernel.sm75.cpp:2688
MaskRCNNUtils::PPM::fileName
std::string fileName
Definition: sampleUffMaskRCNN.cpp:74
nvcaffeparser1::CaffeWeightFactory::mMsg
const trtcaffe::NetParameter & mMsg
Definition: caffeWeightFactory.h:58
EntropyCalibratorImpl::mInputBlobName
const char * mInputBlobName
Definition: EntropyCalibrator.h:90
samplesCommon::print_version
void print_version()
Definition: samples/common/common.h:483
nvinfer1::plugin::ProposalPlugin::attachToContext
void attachToContext(cudnnContext *cudnnContext, cublasContext *cublasContext, IGpuAllocator *gpuAllocator) override
Attach the plugin object to an execution context and grant the plugin the access to some context reso...
Definition: proposalPlugin.cpp:309
polygraphy.backend.trt.util
Definition: trt/util.py:1
SampleCharRNNBase::copyRNNOutputsToInputs
void copyRNNOutputsToInputs(samplesCommon::BufferManager &buffers)
Copies Ct/Ht output from the RNN to the Ct-1/Ht-1 input buffers for next time step.
Definition: sampleCharRNN.cpp:930
polygraphy.tools.args.onnx.loader.OnnxLoaderArgs.tf2onnx_loader_args
tf2onnx_loader_args
Definition: tools/args/onnx/loader.py:24
nvinfer1::INetworkDefinition::addRaggedSoftMax
virtual IRaggedSoftMaxLayer * addRaggedSoftMax(ITensor &input, ITensor &bounds)=0
Add a RaggedSoftMax layer to the network.
nvinfer1::plugin::GenerateDetection::canBroadcastInputAcrossBatch
bool canBroadcastInputAcrossBatch(int inputIndex) const override
Definition: generateDetectionPlugin.cpp:310
nvinfer1::plugin::BatchedNMSPluginCreator
Definition: batchedNMSPlugin.h:139
nvinfer1::plugin::ProposalPlugin::mAnchorSizes
std::vector< float > mAnchorSizes
Definition: proposalPlugin.h:119
evaluate-v1.exact_match_score
def exact_match_score(prediction, ground_truth)
Definition: evaluate-v1.1.py:60
nvinfer1::plugin::DetectionLayer::terminate
void terminate() override
Release resources acquired during plugin layer initialization.
Definition: detectionLayerPlugin.cpp:136
bert::SkipLayerNormInterleavedPluginCreator::getFieldNames
const nvinfer1::PluginFieldCollection * getFieldNames() override
Return a list of fields that needs to be passed to createPlugin.
Definition: skipLayerNormInt8InterleavedPlugin.cpp:285
model.WN.in_layers
in_layers
Definition: demo/Tacotron2/waveglow/model.py:95
sample::anonymous_namespace{sampleInference.cpp}::EnqueueImplicit::EnqueueImplicit
EnqueueImplicit(nvinfer1::IExecutionContext &context, void **buffers, int batch)
Definition: sampleInference.cpp:195
sample::Binding::volume
int volume
Definition: sampleUtils.h:219
nvinfer1::IPluginExt::getTensorRTVersion
virtual int32_t getTensorRTVersion() const
Return the API version with which this plugin was built.
Definition: NvInferRuntime.h:232
polygraphy.logger.logger.Logger.logger_callbacks
logger_callbacks
Definition: Polygraphy/polygraphy/logger/logger.py:135
bert::FusedMultiHeadAttentionKernelMetaInfoV2::mThreadsPerCTA
unsigned int mThreadsPerCTA
Definition: fused_multihead_attention_v2.h:168
tests.quant_pooling_test.TestQuantMaxPool2d.test_input_variable_bits
def test_input_variable_bits(self)
Definition: quant_pooling_test.py:80
tests.quant_conv_test.TestQuantConv3D.test_no_quant
def test_no_quant(self)
Definition: quant_conv_test.py:463
polygraphy.tools.args.tf.runner.TfRunnerArgs.timeline_path
timeline_path
Definition: tools/args/tf/runner.py:43
SampleAlgorithmSelector::mEngine
std::shared_ptr< nvinfer1::ICudaEngine > mEngine
The TensorRT engine used to run the network.
Definition: sampleAlgorithmSelector.cpp:357
polygraphy.tools.surgeon.subtool.extract.Extract
Definition: extract.py:24
tests.quant_pooling_test.TestQuantAvgPool3d.test_raise
def test_raise(self)
Definition: quant_pooling_test.py:199
nvinfer1::plugin::GenerateDetectionPluginCreator::getPluginVersion
const char * getPluginVersion() const override
Return the plugin version.
Definition: generateDetectionPlugin.cpp:56
nmtSample::SLPAttention::SLPAttention
SLPAttention(ComponentWeights::ptr weights)
Definition: slpAttention.cpp:24
sample::anonymous_namespace{sampleEngines.cpp}::CaffeBufferShutter
Definition: sampleEngines.cpp:43
nvinfer1::IAlgorithmContext::getName
virtual const char * getName() const =0
Return name of the algorithm node.
RefineDetectionWorkSpace
Definition: maskRCNNKernels.h:90
tools.test_run.TestTrt.test_trt_dynamic_input_shape
def test_trt_dynamic_input_shape(self)
Definition: test_run.py:107
nvinfer1::ITensor::isShapeTensor
virtual bool isShapeTensor() const =0
Whether the tensor is a shape tensor.
test_ir.TestGraph.test_layer_with_strings
def test_layer_with_strings(self)
Definition: test_ir.py:518
polygraphy.backend.pyt.runner.PytRunner.get_input_metadata
def get_input_metadata(self)
Definition: backend/pyt/runner.py:61
test_ir.TestVariable
Definition: test_ir.py:110
profiler.alignData
def alignData(reference, profile, threshold)
Definition: profiler.py:136
half_float::half::half
half()
Default constructor.
Definition: ieee_half.h:1102
nvinfer1::EnumMax< WeightsRole >
constexpr int32_t EnumMax< WeightsRole >()
Maximum number of elements in WeightsRole enum.
Definition: NvInferRuntime.h:696
TLTMaskRCNNConfig::RPN_ANCHOR_SCALE
static const float RPN_ANCHOR_SCALE
Definition: tlt_mrcnn_config.h:64
backend.tf.test_loader.TestSaveGraph
Definition: tf/test_loader.py:60
SampleMNIST::mInputDims
nvinfer1::Dims mInputDims
The dimensions of the input to the network.
Definition: sampleMNIST.cpp:98
nvinfer1::plugin::RPROIPluginCreator
Definition: nvFasterRCNNPlugin.h:103
test_ir.TensorBaseTests.test_o
def test_o(self)
Definition: test_ir.py:94
LogStreamConsumerBuffer::mPrefix
std::string mPrefix
Definition: pluginLogging.h:99
polygraphy.util.cuda.Cuda.malloc
def malloc(self, nbytes)
Definition: cuda.py:59
polygraphy.util.cuda.DeviceBuffer._check_dtype_matches
def _check_dtype_matches(self, host_buffer)
Definition: cuda.py:154
anonymous_namespace{cropAndResizePlugin.cpp}::CROP_AND_RESIZE_PLUGIN_NAME
static const char * CROP_AND_RESIZE_PLUGIN_NAME
Definition: cropAndResizePlugin.cpp:32
nvinfer1::ProfilingVerbosity::kDEFAULT
@ kDEFAULT
Register layer names in NVTX message field.
polygraphy.backend.trt.runner.TrtRunner.deactivate_impl
def deactivate_impl(self)
Definition: backend/trt/runner.py:176
tests.quant_conv_transposed_test.TestQuantConvTranspose1D.test_fake_quant_per_channel_other_prec
def test_fake_quant_per_channel_other_prec(self)
Definition: quant_conv_transposed_test.py:449
onnx_graphsurgeon.ir.graph.Graph.__deepcopy__
def __deepcopy__(self, memo)
Definition: graph.py:447
FCPlugin::~FCPlugin
~FCPlugin()
Definition: samples/opensource/samplePlugin/fcPlugin.h:81
nvinfer1::plugin::FlattenConcat::mInputConcatAxis
std::vector< int > mInputConcatAxis
Definition: flattenConcat.h:100