27,258 questions
-3
votes
0
answers
28
views
How to generate and verify an INT8 calibration cache for trtexec on Jetson Nano (TensorRT 8.2.1.8) [closed]
I have a custom trained model converted to .engine file using trtexec for both FP32, FP16 on Jetson Nano and have also used them for inference(example code https://github.com/NVIDIA-AI-IOT/...
2
votes
2
answers
69
views
Decoder only model AI making repetitive responses
I am making a Decoder only transformer using Pytorch and my dataset of choice is the fullEnglish dataset from kaggle Plaintext Wikipedia (full English).
The problem is that my model output is ...
2
votes
1
answer
25
views
AttributeError: 'NoneType' object has no attribute 'blocks' when running Cache-DiT example with Wan2.2 model
I’m trying to use
Cache-DiT
to accelerate inference for the Wan2.2 model.
However, when I run the example script,
python run_wan_2.2_i2v.py --steps 28 --cache
I get the following error.
Namespace(...
0
votes
1
answer
33
views
Pretrained ESRGAN (.pb) gives reddish or purple image — is this a preprocessing issue or model issue?
I'm trying to use a pretrained ESRGAN model that I downloaded in .pb format.
The model runs without errors, but the output image has a noticeable reddish/purple tint instead of the correct colors.
...
0
votes
0
answers
38
views
Utilizing GPU with RNN models which takes it's output as input [torch]
I have a machine-translation model. In this model, I calculate a vector for a given sentence and I take this vector, aggregate with each generated output of RNN and put it into RNN again for ...
1
vote
0
answers
18
views
Why does the same YOLOv8n-pose model with different weights have significantly different inference speeds?
I’m testing YOLOv8n-pose models that share the exact same architecture, input size, hardware (GPU), framework, batch size, and precision settings. The only difference between them is the trained ...
1
vote
1
answer
118
views
Torch Conv2d results in both dimensions convolved
I have input shape to a convolution (50, 1, 7617, 10). Here, 7617 is word vectors as rows, and 10 is the number of words in columns. I want to convolve column-wise and obtain (2631, 1, 7617, 1), 1 ...
0
votes
0
answers
55
views
Avoid overlap of bipartite network nodes in ggraph
I'm plotting a bipartite (two-mode) network using igraph and ggraph.
But the nodes are overlapping a lot, even though there is still space in the graphic window.
I would like to plot this using ggraph,...
0
votes
0
answers
78
views
Kohya-SS SDXL LoRA Training Resets Steps Despite Successful State Loading
I am running SDXL LoRA training using Kohya's sd-scripts and accelerate. I have enabled --save_state and am trying to resume training, but the training steps always reset to 0, even though the log ...
0
votes
0
answers
80
views
Trouble configuring R-group substitution in REINVENT 4 (AstraZeneca) — validation errors for RLConfig and ScorerConfig
I’m using AstraZeneca’s REINVENT 4 (v4.6.27) to generate SMILES from a scaffold via R-group substitution, optimizing for 5-HT2A / D2 / 5-HT1A (maximize) and minimizing H1 / M1 / α1A, with DockStream ...
0
votes
1
answer
97
views
ValueError: Only instances of keras.Layer can be added to a Sequential model when using TensorFlow Hub KerasLayer
I’m trying to build a Keras Sequential model using a feature extractor from TensorFlow Hub, but I’m running into this error:
ValueError: Only instances of `keras.Layer` can be added to a Sequential ...
0
votes
1
answer
177
views
Getting “Sizes of tensors must match” error when using ComfyUI WanVideoWrapper (wan2.2) to generate video
I am trying to generate a video using Wan 2.2. My goal is to take a motion sequence from an input video and a single reference image, and then generate a new video where the character in the reference ...
2
votes
1
answer
113
views
Keras Model throwing Error while integrating with frontend
I trained a model on Colab for my final year project EfficientNetB0. After all the layer training, I tested it and its result was excellent, but now I want to integrate the model to the frontend web ...
0
votes
1
answer
117
views
Preventing GPU memory leak due to a custom neural network layer
I am using the MixStyle methodology for domain adaptation, and it involves using a custom layer that is inserted after every encoder stage. However, it is causing VRAM to grow linearly, which causes ...
3
votes
0
answers
73
views
Multimodel for image captioning with CNN and LSTM over flickr30k does not learn. How to fuse image features and word embeddings?
I'm working on an image captioning project using a simple CNN + LSTM architecture, as required by the course I'm studying. The full code is available here on GitHub (note: some parts are memory-...