.c shouldn't scrape DuckDuckGo HTML #161

auscompgeek · 2015-04-03T05:21:41Z

The DuckDuckGo API returns calculations perfectly fine...

Moreover, why is it scraping HTML before trying the API?

auscompgeek · 2015-04-06T07:11:10Z

A side effect of attempting to scrape HTML before using the API: if BeautifulSoup isn't installed, it borks completely, where it could potentially get a result.

kaneda · 2015-04-06T12:46:29Z

@auscompgeek this is explicitly mentioned in the readme: https://github.com/myano/jenni

auscompgeek · 2015-04-07T12:51:41Z

@kaneda Yes, but it causes an exception, so it doesn't even attempt to get a result from the DDG API or from Wolfram|Alpha.

myano · 2015-04-29T19:47:59Z

The "HTML" is scraped first before the API, because a) many times the "HTML" pages has a simpler answer and b) sometimes it is faster. I believe there was another reason of why I originally did it that way, but I can't remember any other reasons as of now.

myano · 2015-07-08T03:37:09Z

Personally, I really wish the google results for .c were more reliable in jenni. They seem/work more reliably when done in a browser.

kaneda added Feature Low Priority labels Apr 3, 2015

myano added a commit that referenced this issue Apr 7, 2015

fixed issue #161

3fba328

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.c shouldn't scrape DuckDuckGo HTML #161

.c shouldn't scrape DuckDuckGo HTML #161

auscompgeek commented Apr 3, 2015

auscompgeek commented Apr 6, 2015

kaneda commented Apr 6, 2015

auscompgeek commented Apr 7, 2015

myano commented Apr 29, 2015

myano commented Jul 8, 2015

.c shouldn't scrape DuckDuckGo HTML #161

.c shouldn't scrape DuckDuckGo HTML #161

Comments

auscompgeek commented Apr 3, 2015

auscompgeek commented Apr 6, 2015

kaneda commented Apr 6, 2015

auscompgeek commented Apr 7, 2015

myano commented Apr 29, 2015

myano commented Jul 8, 2015