Added a timeout parameter to the parse function #80

JPFrancoia · 2016-10-09T10:18:14Z

The default is set to 30 seconds.

I only saw #77 after I made my corrections. However PR 77 introduces a hardcoded parameter, and does not respect the API of feedparser.

I recommend using this PR instead of 77.

Usage:

    feed = feedparser.parse("http://feeds.rsc.org/rss/cc", timeout=1)

But old syntax will still work, of course:

    feed = feedparser.parse("http://feeds.rsc.org/rss/cc")

30 seconds.

peterashwell · 2016-10-24T20:58:31Z

cool 👍

rhoog · 2017-02-23T16:21:32Z

👍

kut · 2017-06-28T15:51:21Z

Agreed this would be awesome. should the default timeout actually be -1 so existing usages of feedparser.parse would behave exactly the same?

I've had to build a workaround using the requests library in the meantime, would love to switch back to using only feedparser, but there's no other clean way besides fussing with global/thread timeout.

keithfma · 2017-08-31T15:03:43Z

This feature would be very useful -- any plans to merge?

JPFrancoia · 2017-08-31T20:35:30Z

I think this repo is dead, isn't it ? This PR is one year old.

kurtmckee · 2018-12-22T04:08:47Z

My plan is to remove custom HTTP code from feedparser at some point in the future. Therefore I don't want to encourage people to depend on HTTP features in feedparser and would instead suggest using far more robust libraries like requests.

I have consistently rejected adding timeouts to feedparser. Please consider using a library like requests instead. 😄

peterashwell · 2018-12-22T21:54:51Z

ok, but dont expect people to find your library very useful when it hangs their scripts and programs

…

On Fri, Dec 21, 2018 at 11:08 PM Kurt McKee ***@***.***> wrote: My plan is to remove custom HTTP code from feedparser at some point in the future. Therefore I don't want to encourage people to depend on HTTP features in feedparser and would instead suggest using far more robust libraries like requests. I have consistently rejected adding timeouts to feedparser. Please consider using a library like requests instead. 😄 — You are receiving this because you commented. Reply to this email directly, view it on GitHub <#80 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AAo7p5Nlxc3xkUtDUPRpMrMw69SJVfvoks5u7bBQgaJpZM4KR9zK> .

JPFrancoia · 2018-12-23T10:43:15Z

@kurtmckee so what's your advice on this one? use requests to fetch the page and then parse it with feedparser? In this case what's the point of feedparser, we could do it with any other xml parser, no?

buhtz · 2018-12-23T13:13:14Z

@peterashwell I think you missunderstood Kurts answer.

@JPFrancoia Your are part right. But using another xml parser would cause you a lot more work. Because feedparser does more then just parsing the xml. It interprete the data in the context of feeds.

IMO this packages is essential for me (currently working on a feedreader).

kurtmckee · 2018-12-23T15:39:19Z

@JPFrancoia, I do recommend using a strong HTTP client library like requests. feedparser's HTTP client was written at a time when the only game in town was the standard library, and it was painful to interact with. Adding an HTTP client to feedparser was really helpful to people.I It's been a decade and a half, and libraries have emerged with great features and usability, so I recommend using those libraries. requests is a good option for sure! feedparser's strength is not its combination of HTTP and XML; feedparser's strength is its ability to handle real-world edge cases, including mangled XML that compliant XML parsers would choke on and reject.

…

On December 23, 2018 10:43:15 AM UTC, JPFrancoia ***@***.***> wrote: @kurtmckee so what's your advice on this one? use requests to fetch the page and then parse it with feedparser? In this case what's the point of feedparser, we could do it with any other xml parser, no? -- You are receiving this because you were mentioned. Reply to this email directly or view it on GitHub: #80 (comment)

introspectionism · 2018-12-25T19:24:03Z

@kurtmckee
it sounds like a good idea to me, to remove the custom-made http code from feedparser. then the code base will get leaner. the focus is to parse the content, not doing http requests.

just re-wrote my implementation to use urllib.request to fetch the data, instead of parsing the data from a remote url.

feedparser is great at parsing feeds, not at fetching data. Following its author's recommendation[1], we are no longer relying on feedparser to fetch the feeds, and instead we use 'requests'. [1] kurtmckee/feedparser#80 Closes: #12

adbenitez · 2021-08-15T18:58:38Z

I had issues with connection hanging forever, my recommendation is to remove then the broken HTTP client, and expect the data to be passed directly to feedparser.parse(), otherwise fix the thing, it doesn't makes sense to ship something broken.

(btw, thanks for this library, it is really helpful :)

buhtz · 2021-08-16T10:27:01Z

@kurtmckee I know that there is currently no time schedule for removing the HTTP-request part from feedparser. But maybe it is a good idea to add a deprecation warning to the next release if someone use it.

…e/feedparser#80) as per feedparser author and pass data only to feedparser. Add exceptions to catch timeouts and sites having network connections not working

Added a timeout parameter to the parse function. The default is set to

de7aceb

30 seconds.

JPFrancoia mentioned this pull request Oct 9, 2016

Add default timeout of 30s to requests #77

Closed

This was referenced Oct 26, 2017

Added timeout argument #102

Closed

Maintained? #108

Closed

kurtmckee closed this Dec 22, 2018

marado mentioned this pull request Nov 16, 2019

feeparser hangs marado/hattai-fortune#12

Closed

membralala mentioned this pull request Jul 3, 2021

Minimal solution for RSS timeout exception digitalfabrik/integreat-cms#881

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added a timeout parameter to the parse function #80

Added a timeout parameter to the parse function #80

JPFrancoia commented Oct 9, 2016

peterashwell commented Oct 24, 2016

rhoog commented Feb 23, 2017

kut commented Jun 28, 2017

keithfma commented Aug 31, 2017

JPFrancoia commented Aug 31, 2017

kurtmckee commented Dec 22, 2018

peterashwell commented Dec 22, 2018 via email

JPFrancoia commented Dec 23, 2018

buhtz commented Dec 23, 2018

kurtmckee commented Dec 23, 2018 via email

introspectionism commented Dec 25, 2018

adbenitez commented Aug 15, 2021 •

edited

buhtz commented Aug 16, 2021

Added a timeout parameter to the parse function #80

Added a timeout parameter to the parse function #80

Conversation

JPFrancoia commented Oct 9, 2016

peterashwell commented Oct 24, 2016

rhoog commented Feb 23, 2017

kut commented Jun 28, 2017

keithfma commented Aug 31, 2017

JPFrancoia commented Aug 31, 2017

kurtmckee commented Dec 22, 2018

peterashwell commented Dec 22, 2018 via email

JPFrancoia commented Dec 23, 2018

buhtz commented Dec 23, 2018

kurtmckee commented Dec 23, 2018 via email

introspectionism commented Dec 25, 2018

adbenitez commented Aug 15, 2021 • edited

buhtz commented Aug 16, 2021

adbenitez commented Aug 15, 2021 •

edited