-
Couldn't load subscription status.
- Fork 398
[CI] Lychee: automated check of links in doc #1258
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
|
Hi,
So, as per image above, Lychee currently checks 152 on the Airflow doc site and find no errors. |
Note that complete Airflow site also includes 10s of millions of |
|
More information how our docs are built (and architecture) https://github.com/apache/airflow/blob/main/docs/README.md |
|
Ah - sorry it's much more than 10M lines, but it's just 265 K .html files. Still a lot. I am running some bulk update on those now - so counted them :D |
|
Hi, I'll extend my workflow code tomorrow to exactly list all the links that are checked by Lychee. |
|
Hi, I changed my code to have a much deeper scan but I now get lots of errors due to obsolete links mostly in release notes (old JIRA links, etc.) . So, I'll close this for now and try to come up with a smarter approach to detect real issues only. |
Hi,
adding a github workflow to validate - at least daily - that all links from Airflow Doc are correct.
This workflow checks all links from Airflow doc via Lychee: https://github.com/lycheeverse/lychee
Specific aspects:
is no activity on this repo itself.
When tested on 'https://airflow.apache.org/docs/' from my repo clone, it delivers this report:
Didier