Skip to main content
-2 votes
0 answers
16 views

Improving the training and f1, dice and IoU score of U-net architecture model

I am training a semantic segmentation model, with U-net architecture with encoders EfficientNet and MobileNet and the fi , dice and IoU are like 0.71,0.71 and 0.54 respectively. I am doing the ...
Sillyasker's user avatar
1 vote
1 answer
118 views

Torch Conv2d results in both dimensions convolved

I have input shape to a convolution (50, 1, 7617, 10). Here, 7617 is word vectors as rows, and 10 is the number of words in columns. I want to convolve column-wise and obtain (2631, 1, 7617, 1), 1 ...
cuneyttyler's user avatar
  • 1,395
0 votes
0 answers
131 views

Installation error while installing GroundingDino

I am trying to install the GroundingDino as instructed in the README file of their official GitHub repo, but I am facing the error below: Obtaining file:///home/kgupta/workspace/Synthetic_Data_gen/...
Mahfuzur Mahim Rahman's user avatar
0 votes
1 answer
103 views

Why does a LSTM pytorch model yield constant values?

I am training a LSTM model with data from yfinance. The process is really standard. I get the data with yf.download(ticker=ticker) where ticker='AAPL and do df.rolling(30, min_periods=1) to smooth the ...
franjefriten's user avatar
0 votes
1 answer
117 views

Preventing GPU memory leak due to a custom neural network layer

I am using the MixStyle methodology for domain adaptation, and it involves using a custom layer that is inserted after every encoder stage. However, it is causing VRAM to grow linearly, which causes ...
Vedant Dalimkar's user avatar
-3 votes
1 answer
82 views

Can I visualize a neural network’s loss landscape to see if it’s stuck in a bad minimum? Any code example for this? [closed]

So, I’m trying to understand why sometimes neural networks get stuck during training. I heard people talk about ‘local minima’ and ‘saddle points,’ but I can’t really picture them. I want to actually ...
prithvisyam's user avatar
0 votes
0 answers
72 views

KFold cross-validation in Keras: model not resetting between folds (MobileNet backbone)

I am trying to perform KFold cross-validation on a Keras model. The first fold runs exactly as expected, but from the second fold onwards the model doesn’t seem to reset. The training behaves ...
pd_prince's user avatar
2 votes
0 answers
162 views

TensorFlow/Keras model accumulates system and GPU RAM during training

I am training a model using TensorFlow/Keras using TensorFlow 2.19.0/Keras 3.10.0. During training, I monitor nvidia-smi and top, and the system RAM and the GPU RAM increase during the training period....
CopyOfA's user avatar
  • 931
0 votes
1 answer
77 views

Differentiable weight setting in flax NNX

I'm doing some experiments with Flax NNX (not Linen!). What I'm trying to do is compute the weights of a network using another network: A hypernetwork receives some input parameters W and outputs a ...
Riccardo Rota's user avatar
3 votes
1 answer
112 views

Neural Network built from scratch using numpy isn't learning

I'm building a neural network from scratch using only Python and numpy, It's meant for classifying the MNIST data set, I got everything to work but the network isn't really learning, at epoch 0 it's ...
buzzbuzz20xx's user avatar
0 votes
1 answer
32 views

Model with ResNet blocks stuck at low accuracy

I am trying to implement classification of ECG segments from PTB-XL database (https://physionet.org/content/ptb-xl/1.0.3/). The architecture of the model which I am using is: import torch import torch....
zion's user avatar
  • 9
0 votes
0 answers
65 views

Building NN from scratch, why does my NN not memorize a small sample size of training data? It ends up being a class distribution

No matter which input I give it after training, it still spits the class distribution.. whereas if I just remove the hidden layer and use a single layer nn, it works much better. I know the proper ...
Turtleishly's user avatar
0 votes
0 answers
66 views

Tensorflow speed of tf.nn.conv2D used instead of opencv GaussianBlur

I'm trying to move some computer vision tasks to tensorflow. The most intensive ops are convolutions, like GaussianBlur. The timings I get using timeit suggest that the GPU equivalent is >10 x ...
Frank_Coumans's user avatar
-1 votes
1 answer
58 views

Randomized Search CV with Neural Network

I want to tune a neural network but since i would be doing a lot of test runs i decided to use Randomized search CV I first started by freeing up some space. # Reseting layers and freeing up some ...
Gilbert Quarshie's user avatar
2 votes
1 answer
91 views

Error forming mini-batch for network input

I am trying to predict vehicle trajectory, using t-30 data sequence to predict until t+30 trajectory. The data I have created is an neighbour occupancy matrix for each frames for each cars, this data ...
Barbaros Teoman Kosoglu's user avatar

15 30 50 per page
1
2 3 4 5
1335