Several sanitization steps for blue links
This MR introduces multiple processing steps to sanitize blue links that yield null topic QIDs.
Closes https://phabricator.wikimedia.org/T323523.
Highlights:
- lowercase the first character. Comply with the expected markup to maximize redirects resolution and topic QIDs lookup
- filter empty-string links due to
mwp.strip_code
calls - handle category links
- handle media link. This has a significant impact in execution time, so pass a boolean flag to enable it
See the Phab ticket and commit messages for more details.