27,179 questions
Advice
0
votes
0
replies
30
views
Does random shifted artificial border harms network training?
Hello everyone I would like to request some information on best practices for training a neural network for image segmentation.
I am conflicted between two approaches lets assume i have patches of ...
Advice
1
vote
6
replies
145
views
Ai Neural Network | C++ | Project Guidance --- Matrix Part
I’m an undergraduate CS student working on a final project due in about a month, and I’m trying to design and implement a C++-based AI Neural Network Simulator integrated into a small game environment....
Advice
0
votes
2
replies
70
views
Do you use raw pytorch or some wrapper or such?
I recently started a course about machine-learning as a part of my master that is apparently essentially a course in PyTorch, since starting the course i have learned that PyTorch is actually used a ...
Tooling
2
votes
0
replies
52
views
What are a good books to read in Bayesian Learning
Seems like Bayesian learning is the go to method now for reducing over fitting in ML models, I've read Manning's NLP in action, Deep Learning with python, O'Reilly Deep Learning foundations, Hands on ...
Advice
0
votes
0
replies
36
views
Regarding's MONAI WarmupCosineSchedule with AdamW, should scheduler.step() be called per batch, and does optimizer lr define the peak lr?
I am training a PyTorch segmentation model and using:
torch.optim.AdamW
monai.optimizers.WarmupCosineSchedule
My optimizer:
optimizer = torch.optim.AdamW(
model.parameters(),
lr=1e-4,
...
-1
votes
1
answer
42
views
Why does my LSTM time series prediction show periodic sharp downward spikes on test data?
I am trying to predict google stock price using LSTM model from PyTorch.
However after training my model and plotting the predicted results vs the real value, I see periodic sharp downward spikes.
...
Advice
1
vote
0
replies
86
views
Book Recommendation in PyTorch
I am looking to find a book on PyTorch that is suitable for beginners, Ive used sklearn in the past for ML its a simple workflow for me prepare the X and Y data, fit/train a model, and make ...
Advice
0
votes
8
replies
87
views
Variational Autoencoders Visualisation
I am unable to visualise how variational autoencoders work graphically. I have understood the equations but am unable to figure out how this would all translate visually, like what is the purpose of P(...
Advice
2
votes
1
replies
145
views
Need Guide for AI/ML
I am looking to grow my career in the AI/ML field, but I am currently feeling a bit confused about the right direction. At the moment, I am learning Python, NumPy, and Pandas.
Could someone please ...
Best practices
0
votes
3
replies
48
views
How to detect mislabeled images in a directory based image classification dataset?
What is the best way to find mislabeled images in a dataset where labels are based on folders?
Each class is already separated into its own directory, but some images inside are clearly in the wrong ...
Best practices
0
votes
7
replies
107
views
Which is the best way to detect lines in historically book pages
I am working on an OCR project and need to create a dataset consisting of approximately 1247 pages from 6 books. I need to crop the images line by line and transcribe the text for training a model. ...
3
votes
0
answers
401
views
Training Tesseract to decode the Epstein Files
The Department of Justice has recently released Volumes 09 and 10 of the Epstein files. Among them is a PDF: https://www.justice.gov/epstein/files/DataSet%209/EFTA01012650.pdf
This PDF contains ...
-1
votes
0
answers
82
views
Is the following problem an issue with Google Colab or an error in the code?
I am trying to follow "Deep Learning in Python" by Francois Chollet.
I have copied and pasted the code at the end of this post from the github page into colab. I am getting a very different, ...
Advice
0
votes
1
replies
60
views
can V-JEPA be used to detect audience engagement during a seminar from live video
I am experimenting with the V-JEPA model developed by Meta for video understanding.
My goal is to analyze a live video stream of people attending a seminar and determine their engagement level (for ...
1
vote
1
answer
34
views
How to apply SHAP explanations to a CNN+BiLSTM model trained on severely imbalanced network intrusion detection data?
I am building an explainable Intrusion Detection System for IoT networks using a hybrid CNN+BiLSTM+Multi-Head Self-Attention architecture in Keras/TensorFlow. My dataset is CIC-BoT-IoT which has ...