Newest 'rss+python+xml' Questions

0 votes

3 answers

777 views

UnicodeEncodeError: 'utf-8' codec can't encode character '\ud83c' in position 0: surrogates not allowed

I am trying to parse "https://tre.tbe.taleo.net/tre01/ats/servlet/Rss?org=arobpers2&cws=42" but I am getting the error "UnicodeEncodeError: 'utf-8' codec can't encode character '\...

Asher Ross

195

asked Jun 3, 2024 at 6:23

1 vote

1 answer

114 views

Unsuccessful in using scrapy to load an already filtered RSS feed

For reference see my code below: import scrapy headers = \ {'Host': 'log.rlsbb.cc', 'User-Agent': 'Mozilla/5.0 (X11; Linux x86_64; rv:109.0) Gecko/20100101 Firefox/110.0', 'Accept': 'text/html,...

Ozooha Ozooha

83

asked Feb 22, 2023 at 2:22

4 votes

3 answers

6k views

Reading RSS feed in Python

I am trying to obtain the first title.text from this RSS feed: https://www.mmafighting.com/rss/current. The feed is up to date and operational. However, when I use the following code, it appears the ...

Dalej400

73

asked Jan 26, 2023 at 16:36

0 votes

0 answers

88 views

How do I write HTML code in Django's syndication feed framework?

I'm using Django's syndication feed framework to generate RSS for my website, referring to this document I have done the work according to the example it provides, the code is similar to the following,...

Yarin Zhang

1

asked Nov 14, 2022 at 8:56

0 votes

0 answers

42 views

I'm trying to store pubdate tag of xml into database using python. I'm using beautifulsoup for web crawler

<pubDate> <![CDATA[ Wed, 17 Aug 2022 14:32:47 +0530 ]]></pubDate> Above is the xml tag now how can I store this date tag into dbms? from bs4 import BeautifulSoup import requests ...

ASTHA JAIN

1

asked Aug 17, 2022 at 16:15

0 votes

2 answers

82 views

How do I return the first link in a non-list output

I am attempting to return only the first url that pops up when scraping "https://www.sec.gov/cgi-bin/browse-edgar?action=getcurrent&CIK=&type=8-k&company=&dateb=&owner=include&...

pilotso

39

asked Jul 18, 2022 at 15:55

1 vote

2 answers

71 views

How to scrape keywords that change every time?

I am trying to scrape a keyword in an xml document with BeautifulSoup but am unsure how to do so. The xml document contains "Central Index Key," which changes each time for each document ...

pilotso

39

asked Jul 15, 2022 at 18:24

1 vote

1 answer

158 views

Web scraper does not update/loop properly

I am trying to make a web scraper that refreshes infinitely every 5 seconds to update the output window with a new article with specific keywords when it is posted. However, this code only refreshes ...

pilotso

39

asked Jul 8, 2022 at 15:13

0 votes

1 answer

73 views

Return for specific title keyword with beautifoulsoup

I'm trying to create a web scraper that returns articles only if there is a certain keyword in the title from an rss feed (xml format). However, whenever I run the code it returns blank, even if the ...

pilotso

39

asked Jul 5, 2022 at 14:09

-1 votes

1 answer

115 views

Feedparser not returning values, only metadata

I'm using feedparser to get info from a public database (https://knesset.gov.il/Odata/ParliamentInfo.svc/KNS_Bill()). Each of my entries looks as follows When accessing specific properties: url = '...

Numy

1

asked May 14, 2022 at 14:22

1 vote

0 answers

55 views

Retrieving title of most recent post in an RSS feed as quickly as possible

I am writing a Python script that depends on being able to poll an RSS feed for updates as quickly as possible. The relevant information for my purposes is contained in the title of the post. I would ...

amiller3513

165

asked Oct 28, 2021 at 21:09

0 votes

1 answer

57 views

Parse specific item in XML by id

I'm a beginner so to improve myself i'm working on those kind of things. I'm trying to get a specific rss/xml item with it's id. Live XML/RSS example is here I want to get specific blog post content ...

spancer

15

asked Dec 10, 2020 at 13:53

1 vote

1 answer

61 views

I am doing RSS feed news scrapting using python3.7. I am not get the exact information. Help me to get the proper data

Here I am trying to get the news from the RSS feed and I am not getting the exact information. I am using the requests and BeautifulSoup to achieve the goal. I have the following object. <item> ...

Mehul Dhariyaparmar

123

asked Jun 19, 2020 at 9:37

1 vote

0 answers

104 views

removing relative links from rss feed in python django

When creating the news, relative links were added to the text of the news itself by [link_name] (downloads/generic/2020.04.1/) When I load this text through the standard rss feed handler class ...

kanvull

21

asked Jun 11, 2020 at 10:32

0 votes

0 answers

51 views

Inclusive RSS parsing in Python?

I'm parsing a set of rss feeds dynamically. This is my code which works for most sites. class ParseFeeds: @staticmethod def parse(source): logger = logging.getLogger(__name__) ...

Melissa Stewart

3,635

asked Feb 11, 2019 at 0:15

Collectives™ on Stack Overflow

All Questions

UnicodeEncodeError: 'utf-8' codec can't encode character '\ud83c' in position 0: surrogates not allowed

Unsuccessful in using scrapy to load an already filtered RSS feed

Reading RSS feed in Python

How do I write HTML code in Django's syndication feed framework?

I'm trying to store pubdate tag of xml into database using python. I'm using beautifulsoup for web crawler

How do I return the first link in a non-list output

How to scrape keywords that change every time?

Web scraper does not update/loop properly

Return for specific title keyword with beautifoulsoup

Feedparser not returning values, only metadata

Retrieving title of most recent post in an RSS feed as quickly as possible

Parse specific item in XML by id

I am doing RSS feed news scrapting using python3.7. I am not get the exact information. Help me to get the proper data

removing relative links from rss feed in python django

Inclusive RSS parsing in Python?

Hot Network Questions

Collectives™ on Stack Overflow

All Questions

Related Tags