I tried finding information on what indexer they are using. Are they using their own?
Edit: says this in the readme:
The commoncrawl organization for crawling the web and making the dataset readily available. Even though we have our own crawler now, commoncrawl has been a huge help in the early stages of development.
Personally I would have some sort of notice regarding these on affected projects, but I don’t think it’s enough to warrant slapping an anti-feature flag on them just because of the author’s choice of code respoitory hosting provider or CDN.