Skip to main content
1 vote
3 answers
2k views

I have a Natural Language Dataset where I am performing classification. The tf-idf model always performs better than a Convolutional Neural Network model. I have performed a lot of hyperparameter ...
Advice
0 votes
0 replies
30 views

Hello everyone I would like to request some information on best practices for training a neural network for image segmentation. I am conflicted between two approaches lets assume i have patches of ...
Advice
0 votes
8 replies
87 views

I am unable to visualise how variational autoencoders work graphically. I have understood the equations but am unable to figure out how this would all translate visually, like what is the purpose of P(...
Advice
0 votes
0 replies
36 views

I am training a PyTorch segmentation model and using: torch.optim.AdamW monai.optimizers.WarmupCosineSchedule My optimizer: optimizer = torch.optim.AdamW( model.parameters(), lr=1e-4, ...
4 votes
1 answer
8k views

I am new to attention mechanisms and I want to learn more about it by doing some practical examples. I came across a Keras implementation for multi-head attention found it in this website Pypi keras ...
Tooling
0 votes
0 replies
50 views

I am working on a project where I need to extract Arabic text from images and PDF documents. Currently, I am using Surya OCR, but I would like to explore other alternatives that might provide better ...
3 votes
1 answer
1k views

Context I am currently running some experiments with LSTMs / GRUs in Keras. Anyhow, the following questions also relate to the general functionality of these networks, which means an answer does not ...
Advice
0 votes
2 replies
70 views

I recently started a course about machine-learning as a part of my master that is apparently essentially a course in PyTorch, since starting the course i have learned that PyTorch is actually used a ...
3 votes
0 answers
401 views

The Department of Justice has recently released Volumes 09 and 10 of the Epstein files. Among them is a PDF: https://www.justice.gov/epstein/files/DataSet%209/EFTA01012650.pdf This PDF contains ...
Advice
1 vote
6 replies
145 views

I’m an undergraduate CS student working on a final project due in about a month, and I’m trying to design and implement a C++-based AI Neural Network Simulator integrated into a small game environment....
Advice
2 votes
1 replies
145 views

I am looking to grow my career in the AI/ML field, but I am currently feeling a bit confused about the right direction. At the moment, I am learning Python, NumPy, and Pandas. Could someone please ...
2 votes
2 answers
854 views

I am using feature-column dataset in my code, in newer version of TensorFlow 2.16.1 and later there is no keras.layers.DenseFeatures class in order to ready the input layer for the DNN. what is the ...
Tooling
2 votes
0 replies
52 views

Seems like Bayesian learning is the go to method now for reducing over fitting in ML models, I've read Manning's NLP in action, Deep Learning with python, O'Reilly Deep Learning foundations, Hands on ...
-1 votes
1 answer
42 views

I am trying to predict google stock price using LSTM model from PyTorch. However after training my model and plotting the predicted results vs the real value, I see periodic sharp downward spikes. ...
Best practices
0 votes
0 replies
39 views

I am training WGAN-GP on Eurosat dataset, splitted into train/val/test sets in counts 18900/4050/4050. Since FID scores are widely used in GANs in image generation, I based my hyperparameter search on ...

15 30 50 per page
1
2 3 4 5
1812