======= Modules ======= .. toctree:: :maxdepth: 1 intro extract_text azure_ocr clean_html clean_xml clean_csv extract_entities correctness_text lang_detect analyze_text_statistics text_similarity beautiful_html html_to_markdown