Cornflake wrote:
I think you're right about the reason why a jpeg file is returned - when the browser isn't recognised as one supporting WebP a jpeg is substituted. I guessed at the Google user agent name but it's proven by using "Monkey Poo" instead: a jpeg is still retrieved.
Yeah seems less likely that the user-agent is the problem. I read the man page, the default user-agent wget sends is "Wget/
version". If cloudflare accepts "monkey poo" that suggests it must deliberately block wget, rather than using a whitelist of acceptable user-agents
Cornflake wrote:
there is also an issue with the source site refusing USA IPs.
A VPN certainly could be involved. The captcha challenge could have been triggered by using an IP associated with VPNs.
If I read Fenn's posts correctly he managed to retrieve the image, as a WebP, after completing the captcha challenge - if Cloudflare deals with troublesome IPs with temporary range bans, I'd still start looking elsewhere for the problem first (particularly javascript settings), but keep the VPN in mind.
_________________
Behold! we are not bound for ever to the circles of the world, and beyond them is more than memory, Farewell!