Skip to content

Conversation

@ArshansGithub
Copy link
Contributor

@ArshansGithub ArshansGithub commented May 12, 2023

added feature for GOOGLE_ABUSE_EXEMPTION cookie which bypasses the captcha presented by google when passed into search query (optional feature)

added feature for GOOGLE_ABUSE_EXEMPTION cookie which bypasses the captcha presented by google
@opsdisk
Copy link
Owner

opsdisk commented May 13, 2023

Hi @ArshansGithub - thanks for submitting the PR. The GOOGLE_ABUSE_EXEMPTION header is interesting...I haven't run into that before. Can you provide some more background on how they are collected/scraped? Are they part of a paid Google API offering?

@ArshansGithub
Copy link
Contributor Author

Hey @opsdisk
Whenever google forces a captcha (to prove you're not a robot) when you solve the captcha it places the GOOGLE_ABUSE_EXEMPTION cookie into the browser to be used in further google searches. By harvesting this cookie we can place it in the scraper and further minimize rate limits, etc. No this isn't part of a Google API offering.

@opsdisk
Copy link
Owner

opsdisk commented May 15, 2023

Appreciate the background @ArshansGithub - give me a few days to review the PR.

@ArshansGithub
Copy link
Contributor Author

Any updates?

1 similar comment
@ArshansGithub
Copy link
Contributor Author

Any updates?

@opsdisk
Copy link
Owner

opsdisk commented Jun 10, 2023

Apologies it took longer to get this one. Appreciate the PR @ArshansGithub!

@opsdisk opsdisk closed this Jun 10, 2023
@opsdisk opsdisk reopened this Jun 10, 2023
@opsdisk opsdisk merged commit 985487d into opsdisk:master Jun 10, 2023
@ArshansGithub
Copy link
Contributor Author

Awesome stuff, thanks for the approval.

@ArshansGithub ArshansGithub deleted the patch-1 branch June 11, 2023 00:44
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants