Skip to main content

Questions tagged [computer-vision]

Computer Vision is a subfield of computer science which deals with analyzing and understanding images. This includes detection of objects like faces in images or segmenting images.

0 votes
0 answers
18 views

Is object aspect ratio truly important for resize robustness, or is this suggestion based on a misunderstanding — e.g., treating a very wide object as if it is “bigger” or “pixel-richer” than a square ...
vinvin's user avatar
  • 1
0 votes
1 answer
305 views

I'm trying to distill a YOLO11x detection model into a YOLO11n for inference speed improvements without sacrificing too much detection performance. For this, I just overloaded some functions in the ...
Simon Hergott's user avatar
3 votes
1 answer
62 views

I have ~1,000 pictures like this. I really want the long, thin rock core in the middle, but they all differ (slightly) in angle, have different lengths, shaped ends and rock colours vary. I tried ...
user24007's user avatar
  • 133
2 votes
0 answers
68 views

I am training an DensNet model on medical dataset which has gold standards as per annotation. After training i noticed accuracy is just 60%. Later i performed following changes but still no luck. ...
NIrbhay Mathur's user avatar
2 votes
0 answers
63 views

I'm working on a Raspberry Pi 4–based project involving the MLX90640 thermal camera breakout. The camera outputs a thermal heat map (a low-resolution infrared image of 32x24 pixels). My goal is to ...
Zak A's user avatar
  • 21
5 votes
1 answer
135 views

I’m working on an object detection system and I'm new to this field. Here i'm talking with respect to camera point of view. When a object is detected which is far from the camera, it appears small and ...
Basavaraj Kittali's user avatar
1 vote
1 answer
82 views

My requirement: Need to extract license plates without duplicates and store images in a folder,then apply ocr to extract text from images. What i have achieved: Iam able to detect license plates ...
Raj's user avatar
  • 11
0 votes
0 answers
43 views

I am working on my first object detection project and need to implement multi-object detection using ResNet-18 (I am restricted to using this architecture). My dataset follows the COCO format and ...
Daniel's user avatar
  • 11
0 votes
0 answers
39 views

I am working on 6D pose tracking, where the goal is to estimate how 3D position and orientation of an object changes from frame t-1 to t. Train/validation datasets are synthetic and come from a single ...
zak's user avatar
  • 81
0 votes
0 answers
52 views

I have an image of a one-line substation schema diagram that includes various components (like transformers, circuit breakers, etc.) and the connections between them. I’m looking for a way to convert ...
Necrosis's user avatar
1 vote
0 answers
29 views

I am using a 6D continuous rotation representation (e.g., two orthogonal vectors from a 3×3 rotation matrix) to predict camera rotations in panoramic video sequences. Since panoramic videos involve ...
yep123's user avatar
  • 111
1 vote
0 answers
45 views

I am currently building my first CNN network on my own for a regression task for which the network must predict the coordinates I am looking at on my screen based on an input image taken through my ...
bebel's user avatar
  • 175
0 votes
1 answer
41 views

I'm looking for images dataset which have multiple images per instance. For example, healthcare dataset, where each person is classified with a diagnosys and have several images describing them.
J. Doe's user avatar
  • 101
1 vote
0 answers
38 views

I have an example of a generated image containing words, as well as several red arrows pointing to certain characters. I need to get these characters from GPT, but when I ask "what characters do ...
user175111's user avatar
2 votes
0 answers
60 views

I've a large multi-class object detection image dataset. The goal is to use a Yolo(v11) model to be trained on the aforementioned dataset to solve the object detection task. My intuition says that the ...
Ramiro Hum-Sah's user avatar

15 30 50 per page
1
2 3 4 5
41