Skip to content
This repository was archived by the owner on Nov 6, 2023. It is now read-only.

Commit bea7426

Browse files
galenideJ0WI
authored andcommitted
[Archive.org] Expanded coverage, removed nonexistent domains, documented Political-TV-Ad-Archive.xml ruleset (#7697)
1 parent 44c0a6d commit bea7426

1 file changed

Lines changed: 36 additions & 31 deletions

File tree

src/chrome/content/rules/Internet-Archive.xml

Lines changed: 36 additions & 31 deletions
Original file line numberDiff line numberDiff line change
@@ -6,27 +6,30 @@
66
- Archive.org_Way_Back_Machine.xml
77
- Archive-It.org.xml
88
- Open_Library.org.xml
9+
- Political-TV-Ad-Archive.xml
910
1011
11-
Nonfunctional subdomains:
12+
Problematic domains:
1213
13-
- pgdp01.us ²
14-
15-
² Refused
16-
17-
18-
Problematic subdomains:
19-
20-
- blog.openlibrary *
21-
22-
* Works, cert only matches *.archive.org
14+
- ait.blog.archive.org (cert mismatch)
15+
- crawler.archive.org (refused)
16+
- gmeta[0-7].us.archive.org (refused)
17+
- graphite.us.archive.org (refused)
18+
- lending.us.archive.org (time out)
19+
- *.s3dns.us.archive.org (cert mismatch)
20+
- www\d\d.us.archive.org (cert mismatch)
2321
2422
2523
Fully covered hosts in *archive.org:
2624
2725
- (www.)?
26+
- analytics
2827
- blog
29-
- blog.openlibrary (→ blog.openlibrary.org)
28+
- bt1
29+
- bt2
30+
- builds
31+
- developers
32+
- store
3033
- *.us
3134
3235
-->
@@ -35,31 +38,33 @@
3538
<!-- Direct rewrites:
3639
-->
3740
<target host="archive.org" />
41+
<target host="www.archive.org" />
42+
<target host="analytics.archive.org" />
3843
<target host="blog.archive.org" />
44+
<target host="bt1.archive.org" />
45+
<target host="bt2.archive.org" />
46+
<target host="builds.archive.org" />
47+
<target host="developers.archive.org" />
48+
<target host="store.archive.org" />
3949
<target host="*.us.archive.org" />
40-
<target host="www.archive.org" />
41-
42-
<!-- Complications:
43-
-->
44-
<target host="blog.openlibrary.archive.org" />
45-
46-
<exclusion pattern="^http://(?:graphite|pgdp01|s3|lending)\.us\.archive\.org/" />
47-
48-
<test url="http://graphite.us.archive.org/" />
49-
<test url="http://pgdp01.us.archive.org/" />
50-
<test url="http://s3.us.archive.org/" />
51-
<test url="http://lending.us.archive.org/" />
52-
53-
<!-- The rule does not handle web anyway:
54-
55-
<exclusion pattern="^http://web\.archive\.org/" /-->
5650

5751

58-
<securecookie host="^\.archive\.org$" name=".+" />
52+
<securecookie host="^(www|analytics|bt[12]|builds|developers|store)\.archive\.org$"
53+
name=".+" />
5954

6055

61-
<rule from="^http://blog\.openlibrary\.archive\.org/"
62-
to="https://blog.openlibrary.org/" />
56+
<test url="http://ia301543.us.archive.org/" />
57+
<exclusion pattern="^http://gmeta[0-7]\.us\.archive\.org/" />
58+
<test url="http://gmeta4.us.archive.org/" />
59+
<exclusion pattern="^http://(graphite|lending)\.us\.archive\.org/" />
60+
<test url="http://graphite.us.archive.org/" />
61+
<test url="http://lending.us.archive.org/" />
62+
<exclusion pattern="^http://([\w-]+\.)+s3dns\.us\.archive\.org/" />
63+
<test url="http://stats.ia902706.s3dns.us.archive.org/" />
64+
<test url="http://hom.3d.ia802600.s3dns.us.archive.org/" />
65+
<test url="http://djaj-zine.ia601509.s3dns.us.archive.org/" />
66+
<exclusion pattern="^http://www\d\d\.us\.archive\.org/" />
67+
<test url="http://www17.us.archive.org/" />
6368

6469
<rule from="^http:"
6570
to="https:" />

0 commit comments

Comments
 (0)