GHC 2021-04-26

4 comments.

, https://git.io/JOdsu in jarun/googler
Clicking on "I agree" takes you to [https://consent.google.com](https://consent.google.com/m?continue=https://www.google.com/search?ie%3DUTF-8%26oe%3DUTF-8%26q%3DBiden%26sei%3Drt1Yvv03QqmJw3%2B861xOtA%26tbm%3Dnws&gl=DE&hl=de&pc=srp&src=1&et=12), which sets a cookie like this along with a 303 redirect, FWIW:

```http
set-cookie: NID=214=DeX...<long string omitted>..._qE; expires=Tue, 26-Oct-2021 03:35:42 GMT; path=/; domain=.google.com; Secure; HttpOnly; SameSite=none
```

On NID cookie: https://policies.google.com/technologies/cookies?hl=en-US#:~:text=For%20example%2C%20most,user%E2%80%99s%20last%20use.

, https://git.io/JOdsz in jarun/googler
Translated:

<img src="https://user-images.githubusercontent.com/4149852/116025435-0df4b680-a683-11eb-847d-094bff255f89.png" width="300">

It's just some annoying cookie consent crap. Some EU user needs figure out what cookie to add to suppress this.

, https://git.io/JOdI3 in jarun/googler
Wait it's actually some kind of notice...

<img width="558" alt="notice" src="https://user-images.githubusercontent.com/4149852/116020145-4c38a880-a678-11eb-979a-84140330dcac.png">

So the good news is we don't actually need to successfully parse this tag soup, as the content is meaningless. The bad news is I'm not sure how to come up with a way to get around this when I don't even get it in the first place.

Probably need user contribution.

, https://git.io/JOdIs in jarun/googler
Yes, this is wrongly closed tag soup, can't even prettier it:

```console
$ prettier -w googler-response-3vjkj5_r.html
googler-response-3vjkj5_r.html
[error] googler-response-3vjkj5_r.html: SyntaxError: Unexpected closing tag "body". It may happen when the tag has already been closed by another tag. For more info see https://www.w3.org/TR/html5/syntax.html#closing-elements-that-have-implied-end-tags (1067:346)
[error]   1065 | ]
[error]   1066 | ]
[error] > 1067 | , sideChannel: {}});</script><script id="wiz_jd" nonce="bzE0bpnLpLEKxNbvHL7Y6w">if (window['_wjdc']) {const wjd = {}; window['_wjdc'](wjd); delete window['_wjdc'];}</script><script aria-hidden="true" nonce="bzE0bpnLpLEKxNbvHL7Y6w">window.wiz_progress&&window.wiz_progress(); window.stopScanForCss&&window.stopScanForCss(); ccTick('bl');</script></body></html>
[error]        |                                                                                                                                                                                                                                                                                                                                                          ^^^^^^^
```

I'll see what I can do other than introducing a full blown HTML5 parser later.