summaryrefslogtreecommitdiff
path: root/searx
AgeCommit message (Collapse)Author
2021-02-25fix fetch_languages for bingMarc Abonce Seguin
Bing has a list of regions that it supports and some of these regions may have more than one possible language. In some cases, like Switzerland, these languages are always shown as options, so there is no issue. But in other cases, like Andorra, Bing will only show one language at the time, either the region's default or the request's language if the latter is supported by that region. For example, if the HTTP request is in French, Andorra will appear as fr-AD but if the same page is requested in any other language Andorra will appear as ca-AD. This is specially a problem when Bing assumes that the request is in English because it overrides enough language codes to make several major languages like Arabic dissappear from the languages.py file. To avoid that issue, I set the Accept-Language header to a language that's only supported in one region to hopefully avoid these overrides.
2021-02-24[mod] add utils/fetch_external_bangs.pyAlexandre Flament
Based on duckduckgo bangs Store bangs on a trie to allow autocomplete (not in this commit)
2021-02-23Merge pull request #2595 from dalf/update-wikidata-unitsAlexandre Flament
[mod] update wikidata_units.json and fetch_wikidata_units.py
2021-02-23[mod] update currencies.json and fetch_currencies.pyAlexandre Flament
use a sparql request on wikidata to get the list of currencies. currencies.json contains the translation for all supported searx languages. Supersede #993
2021-02-23[mod] update wikidata_units.json and fetch_wikidata_units.pyAlexandre Flament
The fetch_wikidata_units.py result won't change randomly. See comments in the script.
2021-02-22Fix paging of Bing ImagesNoémi Ványi
2021-02-20Added rumble.com video search engine. TODO video embedding.datagram1
Update rumble.py some lines too long. Disable Rumble engine disabled : True PEP8 fix change line spacing
2021-02-16Merge pull request #2573 from unixfox/yggtorrentAlexandre Flament
update yggtorrent url + add it back
2021-02-15fix yggtorrent url + add it backEmilien Devos
2021-02-13Improve peertube searchingThorben Günther
At the moment videos without a description are not shown - setting default content to "" fixes this. Another current bug is that thumbnails are not displayed. This is caused by a double slash in the url. For this every trailing slash is now stripped (for backwards compatibility) and the API response is correctly parsed.
2021-02-12Merge pull request #2566 from dalf/remove-yandexAlexandre Flament
[remove] yandex engine
2021-02-12[fix] duckduckgo engine: "!ddg !g" do not redirect to googleAlexandre Flament
* searx understand "!ddg !g time" as : send "!g time" to DDG * !g a DDG bang for Google: DDG return a HTTP redirect to Google This commit adds a the allows_redirect param not to follow HTTP redirect. The DDG engine returns a empty result as before without HTTP redirect.
2021-02-12Merge pull request #2562 from dalf/mod-json-engineAlexandre Flament
[mod] json_engine: add content_html_to_text and title_html_to_text
2021-02-12Merge pull request #2565 from dalf/upd-wikipediaAlexandre Flament
[upd] wikipedia engine: return an empty result on query with illegal characters
2021-02-12Merge pull request #2564 from dalf/fix-seznamAlexandre Flament
[fix] fix seznam engine
2021-02-12Merge pull request #2560 from dalf/fix-duckduckgoAlexandre Flament
Fix duckduckgo
2021-02-12Merge pull request #2557 from dalf/fix-raise_for_httperrorAlexandre Flament
Fix: activate raise_for_error by default
2021-02-11Merge pull request #2541 from return42/mediathekviewwebAlexandre Flament
[enh] add engine MediathekViewWeb (API)
2021-02-11[remove] yandex engineAlexandre Flament
2021-02-11[fix] fix seznam engineAlexandre Flament
no paging support
2021-02-11[upd] wikipedia engine: return an empty result on query with illegal charactersAlexandre Flament
on some queries (like an IT error message), wikipedia returns an HTTP error 400. this commit returns an empty result instead of showing an error to the user.
2021-02-10[mod] json_engine: add content_html_to_text and title_html_to_textAlexandre Flament
Some JSON API returns HTML in either in the HTML or the content. This commit adds two new parameters to the json_engine: content_html_to_text and title_html_to_text, False by default. If True, then the searx.utils.html_to_text removes the HTML tags. Update crossref, openairedatasets and openairepublications engines
2021-02-10Merge pull request #2544 from mrwormo/congresslibraryAlexandre Flament
[Engine] Add Library of Congress engine
2021-02-10Merge pull request #2556 from dalf/fix-apk-mirrorAlexandre Flament
[fix] fix apk_mirror engine
2021-02-09[mod] duckduckgo engine: better support of the language preferenceAlexandre Flament
After the main request, send a second to https://duckduckgo.com/t/sl_h See https://github.com/searx/searx/issues/2259
2021-02-09[mod] poolrequests: for one (user request, engine) always use the same ↵Alexandre Flament
HTTPAdapter The duckduckgo engine requires an additional request after the results have been sent. This commit makes sure that the second request uses the same HTTPAdapter = the same IP address, and the same proxy.
2021-02-09[enh] add engine MediathekViewWeb (API)Markus Heiser
Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>
2021-02-09Add Library of Congress enginemrwormo
2021-02-09[upd] remove google_play_music engineAlexandre Flament
Google Play Music has been replaced by Youtube music.
2021-02-09Fix: activate raise_for_error by defaultAlexandre Flament
Fix commit d703119d3a313a406482b121ee94c6afee3bc307 : Some engines need to parse the HTTP error but raise_for_error is always set to False in the "request" function.
2021-02-09[fix) fix apk_mirror engineAlexandre Flament
2021-02-09Merge pull request #2542 from kvch/fix-naver-engineAlexandre Flament
Fix XPATHs in Naver engine
2021-02-09Merge pull request #2554 from MarcAbonce/zh-variants-in-wikipediaAlexandre Flament
Add support for Chinese variants in Wikipedia
2021-02-08Updated webutils.highlight_content to ignore double-quotes when highlighting ↵Daniel Hones
query parts
2021-02-08add support for Chinese variants in WikipediaMarc Abonce Seguin
2021-02-07Fix naver engineNoémi Ványi
Closes #2540
2021-02-07[drop] metager - xpath engine won't work anymoreMarkus Heiser
The new version of MetaGer needs to reload the reults (into a iframe) with a unique tag (see HTML response below). Implementing a dedicated metager-engine for searx makes no sense to me. The great days of MetaGer seems to be ended. I remember the good old days this project started in the 90's of the last century. But in the last few years it becomes more and more crap. As the name suggested, MetaGer was made for germans in the first place. They have added a english and spain translation but the i18n is very poor compared to what searx offers. It's a pity, lets drop MetaGer. This is the first response, the id (b82679980656899ba5a17ffd02a56846) is unique for each query: $ curl "https://metager.org/meta/meta.ger3?eingabe=foo&submit-query=&focus=web" <!DOCTYPE html> <html lang="en"> <head> <meta charset="UTF-8"> <link rel="stylesheet" href="/index.css?id=b82679980656899ba5a17ffd02a56846"> <script src="/index.js?id=b82679980656899ba5a17ffd02a56846"></script> <title>foo - MetaGer</title> <meta name="viewport" content="width=device-width, initial-scale=1, maximum-scale=1" /> </head> <body> <iframe id="mg-framed" src="https://metager.org/meta/meta.ger3?eingabe=foo&amp;submit-query=&amp;focus=web&amp;mgv=b82679980656899ba5a17ffd02a56846" autofocus="true" onload="this.contentWindow.focus();"></iframe> </body> Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>
2021-02-07[feat] recoll: paged json supportHermógenes Oliveira
2021-02-04[fix] get correct locale with country from browserMarc Abonce Seguin
Some of our interface locales include uppercase country codes, which are separated by `_` instead of the more common `-`. Also, a browser's `Accept-Language` header could be in lowercase. This commit attempts to normalize those cases so a browser's language+country codes can better match with our locales. This solution assumes that our UI locales have nothing more than language and optionally country. If we ever add a script specific locale like `zh-Hant-TW` this would have to change to accomodate that, but the idea would be pretty much the same as this fix.
2021-02-04Add Creative Commons search enginemrwormo
2021-02-01[mod] dynamically set language_support variableAlexandre Flament
The language_support variable is set to True by default, and set to False in only 5 engines. Except the documentation and the /config URL, this variable is not used. This commit remove the variable definition in the engines, and set value according to supported_languages length: False when the length is 0, True otherwise. Close #2485
2021-01-28[fix] google: avoid unnecessary SearxEngineXPathException errorsMarkus Heiser
Avoid SearxEngineXPathException errors when parsing non valid results:: .//div[@class="yuRUbf"]//a/@href index 0 not found Traceback (most recent call last): File "./searx/engines/google.py", line 274, in response url = eval_xpath_getindex(result, href_xpath, 0) File "./searx/searx/utils.py", line 608, in eval_xpath_getindex raise SearxEngineXPathException(xpath_spec, 'index ' + str(index) + ' not found') searx.exceptions.SearxEngineXPathException: .//div[@class="yuRUbf"]//a/@href index 0 not found Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>
2021-01-28[mod] checker: add some additional testsMarkus Heiser
BTW: fix indentation by 2 spaces The additional tests has been commented out in the google engines to not release any CAPTCHA issues. Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>
2021-01-28[fix] normalize the language & region aspects of all google enginesMarkus Heiser
BTW: make the engines ready for search.checker: - replace eval_xpath by eval_xpath_getindex and eval_xpath_list - google_images: remove outer try/except block Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>
2021-01-24[data] make engines.languagesMarkus Heiser
Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>
2021-01-24[fix] google-videos: parse values for 'length' & 'author'Markus Heiser
The 'video.html' template from the 'oscar' design supports replacement for *author* and *length*. Google-videos does not have an author, alternatively the publisher info from is used for the *author*. Hint: these replacements are not supported by the 'simple' design. Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>
2021-01-24[fix] revise of the google-Video engineMarkus Heiser
This revise is based on the methods developed in the revise of the google engine (see commit 410c2f9). Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>
2021-01-24[fix] google_news: avoid one HTTP redirect except for the English resultsAlexandre Flament
also add params['soft_max_redirects'] = 1 to avoid false error reporting in /stats/errors
2021-01-23[fix] google-news: query uses locale without country tagMarkus Heiser
Wthout country-region tag google will redirect to correct the contry tag [1]: SEARX_DEBUG=1 searx-checker -v "google news" ... https://news.google.com:443 "GET /search?q=computer&hl=en... HTTP/1.1" 302 0 https://news.google.com:443 "GET /search?q=computer&hl=en-US&.... HTTP/1.1" 200 None ... [1] https://github.com/searx/searx/pull/2483#issuecomment-765600849 Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>
2021-01-22[fix] revise of the google-news engineMarkus Heiser
This revise is based on the methods developed in the revise of the google engine (see commit 410c2f9). Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>