20,016 questions
-2
votes
0
answers
16
views
Improving the training and f1, dice and IoU score of U-net architecture model
I am training a semantic segmentation model, with U-net architecture with encoders EfficientNet and MobileNet and the fi , dice and IoU are like 0.71,0.71 and 0.54 respectively. I am doing the ...
1
vote
1
answer
118
views
Torch Conv2d results in both dimensions convolved
I have input shape to a convolution (50, 1, 7617, 10). Here, 7617 is word vectors as rows, and 10 is the number of words in columns. I want to convolve column-wise and obtain (2631, 1, 7617, 1), 1 ...
0
votes
0
answers
131
views
Installation error while installing GroundingDino
I am trying to install the GroundingDino as instructed in the README file of their official GitHub repo, but I am facing the error below:
Obtaining file:///home/kgupta/workspace/Synthetic_Data_gen/...
0
votes
1
answer
103
views
Why does a LSTM pytorch model yield constant values?
I am training a LSTM model with data from yfinance. The process is really standard. I get the data with yf.download(ticker=ticker) where ticker='AAPL and do df.rolling(30, min_periods=1) to smooth the ...
0
votes
1
answer
117
views
Preventing GPU memory leak due to a custom neural network layer
I am using the MixStyle methodology for domain adaptation, and it involves using a custom layer that is inserted after every encoder stage. However, it is causing VRAM to grow linearly, which causes ...
-3
votes
1
answer
82
views
Can I visualize a neural network’s loss landscape to see if it’s stuck in a bad minimum? Any code example for this? [closed]
So, I’m trying to understand why sometimes neural networks get stuck during training. I heard people talk about ‘local minima’ and ‘saddle points,’ but I can’t really picture them. I want to actually ...
0
votes
0
answers
72
views
KFold cross-validation in Keras: model not resetting between folds (MobileNet backbone)
I am trying to perform KFold cross-validation on a Keras model. The first fold runs exactly as expected, but from the second fold onwards the model doesn’t seem to reset. The training behaves ...
2
votes
0
answers
162
views
TensorFlow/Keras model accumulates system and GPU RAM during training
I am training a model using TensorFlow/Keras using TensorFlow 2.19.0/Keras 3.10.0. During training, I monitor nvidia-smi and top, and the system RAM and the GPU RAM increase during the training period....
0
votes
1
answer
77
views
Differentiable weight setting in flax NNX
I'm doing some experiments with Flax NNX (not Linen!).
What I'm trying to do is compute the weights of a network using another network:
A hypernetwork receives some input parameters W and outputs a ...
3
votes
1
answer
112
views
Neural Network built from scratch using numpy isn't learning
I'm building a neural network from scratch using only Python and numpy, It's meant for classifying the MNIST data set, I got everything to work but the network isn't really learning, at epoch 0 it's ...
0
votes
1
answer
32
views
Model with ResNet blocks stuck at low accuracy
I am trying to implement classification of ECG segments from PTB-XL database (https://physionet.org/content/ptb-xl/1.0.3/). The architecture of the model which I am using is:
import torch
import torch....
0
votes
0
answers
65
views
Building NN from scratch, why does my NN not memorize a small sample size of training data? It ends up being a class distribution
No matter which input I give it after training, it still spits the class distribution.. whereas if I just remove the hidden layer and use a single layer nn, it works much better.
I know the proper ...
0
votes
0
answers
66
views
Tensorflow speed of tf.nn.conv2D used instead of opencv GaussianBlur
I'm trying to move some computer vision tasks to tensorflow. The most intensive ops are convolutions, like GaussianBlur. The timings I get using timeit suggest that the GPU equivalent is >10 x ...
-1
votes
1
answer
58
views
Randomized Search CV with Neural Network
I want to tune a neural network but since i would be doing a lot of test runs i decided to use Randomized search CV
I first started by freeing up some space.
# Reseting layers and freeing up some ...
2
votes
1
answer
91
views
Error forming mini-batch for network input
I am trying to predict vehicle trajectory, using t-30 data sequence to predict until t+30 trajectory. The data I have created is an neighbour occupancy matrix for each frames for each cars, this data ...