-
I wanted to archive this locally , since the original is a bloomberg article. I got this :( Is there a way to archive from that site, so I have local copies? I can access the site no problem with the same IP, so this is definitely happening because of the archiving effort. |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 1 reply
-
Is your DNS set to 1.1.1.1? The owner of Archive.is hates Cloudflare and famously shows this "unsolvable captcha" page to troll people using Cloudflare DNS. Check if there's any chance your container, Docker VM, or host machine are set up to resolve DNS through 1.1.1.1. The simple waydocker compose run archivebox add 'https://one.one.one.one/help/'
# then view the snapshot output to see if it shows you resolving DNS through cloudflare The hard waydocker compose up -d
docker compose exec archivebox /bin/bash
apt update -qq
apt install -y dnsutils
nslookup archive.is
dig +trace archive.is Check for any cloudflare domains or IPs in the DNS trace. The FixChange the https://github.com/ArchiveBox/ArchiveBox/wiki/Configuration#curl_user_agent # see the default user agent
archivebox config --get CHROME_USER_AGENT
# change the user agent
archivebox config --set CHROME_USER_AGENT='Mozilla/5.0 ...' |
Beta Was this translation helpful? Give feedback.
Is your DNS set to 1.1.1.1? The owner of Archive.is hates Cloudflare and famously shows this "unsolvable captcha" page to troll people using Cloudflare DNS.
Check if there's any chance your container, Docker VM, or host machine are set up to resolve DNS through 1.1.1.1.
The simple way
The hard way
docker compose up -d docker compose exec archivebox /bin/bash apt update -qq apt install -y dnsutils nslookup archive.is dig +trace archive.is
Check for any cloudflare domains or IPs in the DNS trace.
The Fix
Change the
dns:
section indock…