Newest 'https+python+web-scraping' Questions

0 votes

1 answer

70 views

Python Requests GET with Proxy - HTTPS scheme returns expected result but HTTP returns header

When setting any URL to use HTTPS as the scheme (i.e., https://), I get my desired response (i.e., page source), but any http url (i.e., http://) fails or I receive a header and I don't understand why ...

Simonhawk

73

asked Feb 12 at 1:20

1 vote

1 answer

110 views

ERR_TUNNEL_CONNECTION_FAILED for selenium webdriver HTTPS scraping attempt

I'm currently working on a scraping project and facing some issues with using a proxy from Webshare. This is my first time using a proxy, and I can't seem to access the target webpage. Every time I ...

Joshua Petersen

11

asked Jun 29, 2024 at 20:10

0 votes

1 answer

35 views

Taking a list from a webpage and Displaying It?

I am working on a program that would display a wide range of martial arts. I figured that web scraping would be the best way to do this so that as things grow and evolve, the program would, hopefully, ...

Nathan

1

asked Jun 4, 2024 at 23:19

0 votes

1 answer

293 views

ConnectionResetError with Python requests and urllib libraries when accessing specific URL

I'm encountering a ConnectionResetError when attempting to access a specific URL using both Python requests and urllib libraries. Despite providing appropriate headers, the connection is being ...

chandandanjo

13

asked Apr 11, 2024 at 12:53

0 votes

1 answer

46 views

Item not clickable

So I have a task I have a site and there is a button I need to click and take all the info from the next page that opens. My problem is that I cant access the item(I did some research on what the ...

Kori

1

asked May 6, 2023 at 16:56

1 vote

1 answer

113 views

How to get around anti-adblock html page while scraping using python requests library?

I'm trying to automate a clerical report that I have been assigned and a part of that involves getting the zip code from a street address. I have tried a bunch of stuff like the geopy library and I ...

snowball

155

asked Feb 7, 2023 at 15:03

1 vote

2 answers

76 views

how to parse data from a specific website

I'm trying to parse data from this page: https://rules.art/card/jmks-season-1-common You can find hereunder the basic code I'm using to try to retrieve some data (e.g. the card's name "JMK$")...

waticousin

11

asked Dec 30, 2022 at 11:54

0 votes

0 answers

69 views

Solution added to the answer: Web Scraping Python AttributeError: 'NoneType' object has no attribute 'group' Web Scraping Python

There are two python files. The file titled munilist.py looks as follows (with around 7000+ municipality name-URL pairs): munilist = ('porto-empedocle', 'https://openbilanci.it/armonizzati/bilanci/...

Pepa

159

asked Dec 5, 2022 at 1:20

0 votes

3 answers

55 views

Cannot do webscraping with Selenium because recognised as bot

I am trying to webscrape with Python "https://www.futbol24.com/" and I am recognised as bot. I tried everything, including the removal of signatures in the javascript of chromedriver.exe, or ...

Sav

156

asked Oct 27, 2022 at 8:59

0 votes

0 answers

61 views

Unsure of what this http response is (Phone Application Data Scraping)

I am trying to scrape messages off of this phone application (similar to a messaging application) and I was using burp suite to obtain https request information and all the information returned is in ...

mdssi

5

asked Aug 9, 2022 at 17:53

0 votes

1 answer

1k views

how can i reduce bandwidth usage when making hhtp/https requests to a website

I am running a small personal python service that uses a paid rotating proxy with limited bandwidth to make requests to websites and scrapes data from websites that have no api. My question is now how ...

critiqalfish

33

asked Jan 27, 2022 at 21:30

1 vote

2 answers

2k views

Python urllib request always results in Error 400: Bad Request

Thanks for reading. For a small reserach project, I'm trying to gather some data from KBB (www.kbb.com). However, I'm always getting a "urllib.error.HTTPError: HTTP Error 400: Bad Request" ...

dmort

301

asked Nov 10, 2021 at 19:14

0 votes

0 answers

44 views

Url requests not working while the flask app is hosted

I have a flask web app running a just-dial scraper code, In my code, I have to request multiple pages of the Justdial site to use it in the bs4 module to extract the data and fill it in the excel ...

Gurupra5ad

1

asked Jun 11, 2021 at 14:53

1 vote

1 answer

589 views

Extracting data from the ECB Data Warehouse

I would like to extract with Pandas a given value of the ECB Data Warehouse in this page. I have been going through the ECB help page but could not find the way to make it work. I have tried the ...

alexdico

37

asked Feb 1, 2021 at 15:46

0 votes

0 answers

83 views

Unable to scrape "https" in python 3.7 using beautifulsoup 4.6.3 (Anaconda)

Failing to scrape odds from sportsbet.com.au as it is throwing an exception. I cannot read the exception because the terminal closes too quickly. However, I believe the error is due to scraping from ...

Jacob Woodruff

1

asked Mar 13, 2019 at 0:20

Collectives™ on Stack Overflow

All Questions

Python Requests GET with Proxy - HTTPS scheme returns expected result but HTTP returns header

ERR_TUNNEL_CONNECTION_FAILED for selenium webdriver HTTPS scraping attempt

Taking a list from a webpage and Displaying It?

ConnectionResetError with Python requests and urllib libraries when accessing specific URL

Item not clickable

How to get around anti-adblock html page while scraping using python requests library?

how to parse data from a specific website

Solution added to the answer: Web Scraping Python AttributeError: 'NoneType' object has no attribute 'group' Web Scraping Python

Cannot do webscraping with Selenium because recognised as bot

Unsure of what this http response is (Phone Application Data Scraping)

how can i reduce bandwidth usage when making hhtp/https requests to a website

Python urllib request always results in Error 400: Bad Request

Url requests not working while the flask app is hosted

Extracting data from the ECB Data Warehouse

Unable to scrape "https" in python 3.7 using beautifulsoup 4.6.3 (Anaconda)

Hot Network Questions

Collectives™ on Stack Overflow

All Questions

Related Tags