summaryrefslogtreecommitdiff
path: root/searx/engines
diff options
context:
space:
mode:
authorMarkus Heiser <markus.heiser@darmarIT.de>2025-05-13 10:18:28 +0200
committerGitHub <noreply@github.com>2025-05-13 10:18:28 +0200
commit90068660196d898896219d1df7a088348c5d3d14 (patch)
tree1903c593b48c5d5a66b2109cbe1fe1a748b15db7 /searx/engines
parent5d99373bc65c7087ee743a1fe44897bad6065338 (diff)
[fix] engine archlinux: avoid Anubis challenge by User-Agent "SearXNG" (#4779)
Of the archlinux wikis only wiki.archlinux.org has a has Anubis challenge. About Anubis[1]: > Anubis decides to present a challenge using this logic: > > - User-Agent contains "Mozilla" > ... > This should ensure that git clients, RSS readers, and other low-harm clients > can get through without issue .. [1] https://github.com/TecharoHQ/anubis/blob/6c0ff3f4d5e7a66ea85e5873e6ee4c28692091d7/docs/docs/design/how-anubis-works.mdx#challenge-presentation Suggested-by: @unixfox https://github.com/searxng/searxng/issues/4646#issuecomment-2855322406 Closes: https://github.com/searxng/searxng/issues/4646 Signed-off-by: Markus Heiser <markus.heiser@darmarit.de>
Diffstat (limited to 'searx/engines')
-rw-r--r--searx/engines/archlinux.py3
1 files changed, 3 insertions, 0 deletions
diff --git a/searx/engines/archlinux.py b/searx/engines/archlinux.py
index 820b31799..63de157dc 100644
--- a/searx/engines/archlinux.py
+++ b/searx/engines/archlinux.py
@@ -51,6 +51,9 @@ def request(query, params):
if netloc == main_wiki:
eng_lang: str = traits.get_language(sxng_lang, 'English') # type: ignore
query += ' (' + eng_lang + ')'
+ # wiki.archlinux.org is protected by anubis
+ # - https://github.com/searxng/searxng/issues/4646#issuecomment-2817848019
+ params['headers']['User-Agent'] = "SearXNG"
elif netloc == 'wiki.archlinuxcn.org':
base_url = 'https://' + netloc + '/wzh/index.php?'