Newest 'deep-learning' Questions

-3 votes

0 answers

28 views

How to generate and verify an INT8 calibration cache for trtexec on Jetson Nano (TensorRT 8.2.1.8) [closed]

I have a custom trained model converted to .engine file using trtexec for both FP32, FP16 on Jetson Nano and have also used them for inference(example code https://github.com/NVIDIA-AI-IOT/...

mehrul13

1

asked yesterday

2 votes

2 answers

69 views

Decoder only model AI making repetitive responses

I am making a Decoder only transformer using Pytorch and my dataset of choice is the fullEnglish dataset from kaggle Plaintext Wikipedia (full English). The problem is that my model output is ...

Kirito

13

asked Oct 29 at 14:32

2 votes

1 answer

25 views

AttributeError: 'NoneType' object has no attribute 'blocks' when running Cache-DiT example with Wan2.2 model

I’m trying to use Cache-DiT to accelerate inference for the Wan2.2 model. However, when I run the example script, python run_wan_2.2_i2v.py --steps 28 --cache I get the following error. Namespace(...

傅靖茹

51

asked Oct 27 at 9:21

0 votes

1 answer

33 views

Pretrained ESRGAN (.pb) gives reddish or purple image — is this a preprocessing issue or model issue?

I'm trying to use a pretrained ESRGAN model that I downloaded in .pb format. The model runs without errors, but the output image has a noticeable reddish/purple tint instead of the correct colors. ...

Ahmed Almakki

1

asked Oct 20 at 15:54

0 votes

0 answers

38 views

Utilizing GPU with RNN models which takes it's output as input [torch]

I have a machine-translation model. In this model, I calculate a vector for a given sentence and I take this vector, aggregate with each generated output of RNN and put it into RNN again for ...

cuneyttyler

1,395

asked Oct 15 at 14:20

1 vote

0 answers

18 views

Why does the same YOLOv8n-pose model with different weights have significantly different inference speeds?

I’m testing YOLOv8n-pose models that share the exact same architecture, input size, hardware (GPU), framework, batch size, and precision settings. The only difference between them is the trained ...

Hạnh Nhi Đỗ

11

asked Oct 15 at 10:15

1 vote

1 answer

118 views

Torch Conv2d results in both dimensions convolved

I have input shape to a convolution (50, 1, 7617, 10). Here, 7617 is word vectors as rows, and 10 is the number of words in columns. I want to convolve column-wise and obtain (2631, 1, 7617, 1), 1 ...

cuneyttyler

1,395

asked Oct 12 at 5:34

0 votes

0 answers

55 views

Avoid overlap of bipartite network nodes in ggraph

I'm plotting a bipartite (two-mode) network using igraph and ggraph. But the nodes are overlapping a lot, even though there is still space in the graphic window. I would like to plot this using ggraph,...

mmmap

67

asked Oct 7 at 12:51

0 votes

0 answers

78 views

Kohya-SS SDXL LoRA Training Resets Steps Despite Successful State Loading

I am running SDXL LoRA training using Kohya's sd-scripts and accelerate. I have enabled --save_state and am trying to resume training, but the training steps always reset to 0, even though the log ...

Akash Chaudhari

21

asked Oct 5 at 14:01

0 votes

0 answers

80 views

Trouble configuring R-group substitution in REINVENT 4 (AstraZeneca) — validation errors for RLConfig and ScorerConfig

I’m using AstraZeneca’s REINVENT 4 (v4.6.27) to generate SMILES from a scaffold via R-group substitution, optimizing for 5-HT2A / D2 / 5-HT1A (maximize) and minimizing H1 / M1 / α1A, with DockStream ...

Reuben Udohaya

1

asked Sep 30 at 15:39

0 votes

1 answer

97 views

ValueError: Only instances of keras.Layer can be added to a Sequential model when using TensorFlow Hub KerasLayer

I’m trying to build a Keras Sequential model using a feature extractor from TensorFlow Hub, but I’m running into this error: ValueError: Only instances of `keras.Layer` can be added to a Sequential ...

user31600948

1

asked Sep 30 at 9:02

0 votes

1 answer

177 views

Getting “Sizes of tensors must match” error when using ComfyUI WanVideoWrapper (wan2.2) to generate video

I am trying to generate a video using Wan 2.2. My goal is to take a motion sequence from an input video and a single reference image, and then generate a new video where the character in the reference ...

hongxigoo

11

asked Sep 30 at 5:04

2 votes

1 answer

113 views

Keras Model throwing Error while integrating with frontend

I trained a model on Colab for my final year project EfficientNetB0. After all the layer training, I tested it and its result was excellent, but now I want to integrate the model to the frontend web ...

Narendra Patne

21

asked Sep 30 at 2:41

0 votes

1 answer

117 views

Preventing GPU memory leak due to a custom neural network layer

I am using the MixStyle methodology for domain adaptation, and it involves using a custom layer that is inserted after every encoder stage. However, it is causing VRAM to grow linearly, which causes ...

Vedant Dalimkar

3

asked Sep 28 at 15:00

3 votes

0 answers

73 views

Multimodel for image captioning with CNN and LSTM over flickr30k does not learn. How to fuse image features and word embeddings?

I'm working on an image captioning project using a simple CNN + LSTM architecture, as required by the course I'm studying. The full code is available here on GitHub (note: some parts are memory-...

Malihe Mahdavi sefat

473

asked Sep 27 at 15:34

Collectives™ on Stack Overflow

How to generate and verify an INT8 calibration cache for trtexec on Jetson Nano (TensorRT 8.2.1.8) [closed]

Decoder only model AI making repetitive responses

AttributeError: 'NoneType' object has no attribute 'blocks' when running Cache-DiT example with Wan2.2 model

Pretrained ESRGAN (.pb) gives reddish or purple image — is this a preprocessing issue or model issue?

Utilizing GPU with RNN models which takes it's output as input [torch]

Why does the same YOLOv8n-pose model with different weights have significantly different inference speeds?

Torch Conv2d results in both dimensions convolved

Avoid overlap of bipartite network nodes in ggraph

Kohya-SS SDXL LoRA Training Resets Steps Despite Successful State Loading

Trouble configuring R-group substitution in REINVENT 4 (AstraZeneca) — validation errors for RLConfig and ScorerConfig

ValueError: Only instances of keras.Layer can be added to a Sequential model when using TensorFlow Hub KerasLayer

Getting “Sizes of tensors must match” error when using ComfyUI WanVideoWrapper (wan2.2) to generate video

Keras Model throwing Error while integrating with frontend

Preventing GPU memory leak due to a custom neural network layer

Multimodel for image captioning with CNN and LSTM over flickr30k does not learn. How to fuse image features and word embeddings?

Hot Network Questions