Newest 'quantization-aware-training' Questions

0 votes

1 answer

124 views

Apply Quantization on a CNN

I want to apply a quantization function to a deep CNN. This CNN is used for an image classification(in 4 classes) task, and my data consists of 224×224 images. When I run this code, I get an error. ...

jasmine

31

asked Dec 9, 2025 at 11:36

0 votes

0 answers

58 views

Error while converting quantized Torch model to ONNX

I’m applying QAT to YOLOv8n model with the following configuration: QConfig( activation=FakeQuantize.with_args( observer=MovingAverageMinMaxObserver, quant_min=0, quant_max=...

Matteo

111

asked Sep 5, 2025 at 14:39

1 vote

0 answers

42 views

Quantization In Tensorflow2, Instance error

I am trying to quantize a model in tensorflow using tfmot. This is a sample model, inputs = keras.layers.Input(shape=(512, 512, 1)) x = keras.layers.Conv2D(3, kernel_size=1, padding='same')(inputs) x =...

Sai

11

asked Aug 29, 2025 at 17:03

1 vote

0 answers

109 views

how to convert a QAT quantization aware trained tensorflow graph into tflite model?

I have am quantizing a neural network using QAT and I want to convert it into tflite. Quantization nodes get added to the skeleton graph and we get a new graph. I am able to load the trained QAT ...

Prateek Sharma

11

asked Apr 8, 2025 at 9:08

0 votes

1 answer

654 views

"NotImplementedError: Could not run 'aten::add.out' with arguments from the 'QuantizedCPU' backend" while implementing QAT on resnet18 using pytorch

I am trying to implement Quantization Aware Training(QAT) resnet18 model. While inferring I get this error NotImplementedError: Could not run 'aten::add.out' with arguments from the 'QuantizedCPU' ...

Pavan Varyani

1,536

asked Dec 1, 2024 at 0:02

0 votes

1 answer

86 views

What does QuantizeWrapperV2 actually do?

So I am training this small CNN model which has few Conv2D layers and some MaxPool2D, Activations, Dense, basically the basic layers that Tensorflow provides. I want it to run on an embedded system ...

Jhon Margalit

577

asked Oct 25, 2024 at 22:36

0 votes

1 answer

261 views

Quantization Aware Training: ValueError: `to_quantize` can only either be a keras Sequential or Functional model

I'm Trying to test Quantization Aware Training from TensorFlow Lite. The following source code creates an AI model (variable: model) trained with the MNIST dataset (just 1 epoch for testing purpose). ...

eddy33

1

asked Sep 22, 2024 at 17:41

0 votes

1 answer

941 views

ValueError: `to_quantize` can only either be a keras Sequential or Functional model

I'm currently learning TinyML with Tensorflow Lite and Tensorflow Lite for Micro. I'm working with the book "Hands-on TinyML" from R. Banerjee. I'm trying to quantize a model but it ...

eddy33

1

asked Sep 22, 2024 at 12:58

1 vote

0 answers

70 views

ValueError: ('Expected `model` argument to be a `Model` instance, got ', <keras.engine.sequential.Sequential object at 0x7f234263dfd0>)

I want to do Quantization Aware Training, Here's my model architecture. Model: "sequential_4" _________________________________________________________________ Layer (type) ...

Vina

27

asked Jul 22, 2024 at 6:14

0 votes

0 answers

185 views

How to quantize a pretrained model (e.g. MobileNet)

I am using tensorflow lite framework in order to create a quantized model for an experiment. I want to deploy this model on my Raspberry Pi but it seems that using a pretrained model for quantizing ...

Ayush Dave

1

asked Jun 10, 2024 at 8:35

0 votes

0 answers

116 views

How to quantize a pretrained model using tensorflow lite?

I have been trying to use a pretrained model from tensorflow.keras library - which is MobileNet If I try to quantize it using tfmot.quantization.keras.quantize_model(base_model) It gives me an error ...

Ayush Dave

1

asked Jun 2, 2024 at 12:02

1 vote

0 answers

279 views

error: 'tf.TensorListSetItem' op is neither a custom op nor a flex op while trying to quantize a model

I am trying to learn about quantization so was playing with a github repo trying to quantize it into int8 format. I have used the following code to quantize the model. modelClass = DTLN_model() ...

Niaz Palak

327

asked Apr 13, 2024 at 7:02

0 votes

0 answers

205 views

Adapters after QLoRA fine-tuning on a llama architecture model reach about 2 GB, which is very far from the general trend seen online

I was Fine Tuning a Llama Architecture Model that supports multiple languages: English, Hindi as well as Roman Hindi. So, I loaded the model in quantized form using bitsandbytes in nf4 form along with ...

Killua

1

asked Mar 20, 2024 at 16:43

1 vote

0 answers

377 views

Is it possible to convert the Google MediaPipe FaceMeshV2 TFLite model with post-training quantization to a fully integer-quantized model version?

I am seeking assistance regarding the conversion of the MediaPipe FaceMeshV2 model for use with the Coral EdgeTPU Accelerator. As per the Coral documentation, a model must undergo full integer ...

Miass500

11

asked Mar 14, 2024 at 10:18

1 vote

1 answer

392 views

Quantization aware training Conv1D is not supported

I want to do 1D-CNN and quantization aware training, it gives error keras.src.layers.convolutional.conv1d.Conv1D'> is not supported.You can quantize this layer by passing a `tfmot.quantization....

Kia

21

asked Feb 5, 2024 at 4:45

Collectives™ on Stack Overflow

Apply Quantization on a CNN

Error while converting quantized Torch model to ONNX

Quantization In Tensorflow2, Instance error

how to convert a QAT quantization aware trained tensorflow graph into tflite model?

"NotImplementedError: Could not run 'aten::add.out' with arguments from the 'QuantizedCPU' backend" while implementing QAT on resnet18 using pytorch

What does QuantizeWrapperV2 actually do?

Quantization Aware Training: ValueError: `to_quantize` can only either be a keras Sequential or Functional model

ValueError: `to_quantize` can only either be a keras Sequential or Functional model

ValueError: ('Expected `model` argument to be a `Model` instance, got ', <keras.engine.sequential.Sequential object at 0x7f234263dfd0>)

How to quantize a pretrained model (e.g. MobileNet)

How to quantize a pretrained model using tensorflow lite?

error: 'tf.TensorListSetItem' op is neither a custom op nor a flex op while trying to quantize a model

Adapters after QLoRA fine-tuning on a llama architecture model reach about 2 GB, which is very far from the general trend seen online

Is it possible to convert the Google MediaPipe FaceMeshV2 TFLite model with post-training quantization to a fully integer-quantized model version?

Quantization aware training Conv1D is not supported

Hot Network Questions