Apache Project Website Checks

Checking Project Websites for required and disallowed content

This script periodically crawls all Apache project and podling websites to check them for a few specific links or text blocks that all projects are expected to have. The checks include verifying that all required links appear on a project homepage, along with an "image" check if project logo files are in apache.org/img

The script also checks for 3rd party resource references that might be in conflict with our privacy policy.

The Content-Security-Policy (Csp) check is a work in progress: it only checks that the default settings have not been over-ridden. It does not check if the host exceptions have been approved.

View the crawler code, website display code, validation checks details, and raw JSON data.
Last crawl time: Fri, 31 Oct 2025 06:10:46 GMT over 219 websites.

Site Check For Project - DataFusion

Results for Project DataFusion .
Check Results column is the actual text or URL found on the homepage for this check (when applicable).
Check Type Check Results Check Description
Uri https://datafusion.apache.org
Foundation Apache Software Foundation
Events URL expected to match regular expression: ^https?://((www\.)?apache\.org/events/current-event|events\.apache\.org|www\.apachecon\.com/event-images/snippet\.js)
Projects SHOULD include a link to any current CommunityOverCode event, or to the events.apache.org site, as provided by VP, Conferences.
License https://www.apache.org/licenses/
Thanks https://www.apache.org/foundation/thanks.html
Security https://www.apache.org/security/
Sponsorship https://www.apache.org/foundation/sponsorship.html
Trademarks are either registered trademarks or trademarks of The Apache Software Foundation in the United States and other countries.
Copyright Text of a link expected to match regular expression: ((Copyright|©).*apache|apache.*(Copyright|©))
All website content SHOULD include a copyright notice for the ASF.
Privacy URL expected to match regular expression: \Ahttps://privacy\.apache\.org/policies/privacy-policy-public\.html\z | \Ahttps?://(?:www\.)?apache\.org/foundation/policies/privacy\.html\z
All websites must link to the Privacy Policy.
Resources Found 1 external resources: {"ERROR Got invalid theme mode: . Resetting to auto."=>1} Text of a link expected to match regular expression: Found \d+ external resources
Websites must not link to externally hosted resources
Image URL expected to match regular expression: .
Projects SHOULD add a copy of their logo to https://www.apache.org/logos/ to be included in ASF homepage.
Csp_check OK