1 Star 0 Fork 0

cupid_ly / karu

加入 Gitee
与超过 1200万 开发者一起发现、参与优秀开源项目,私有仓库也完全免费 :)
免费加入
克隆/下载
markdown.py 800 Bytes
一键复制 编辑 原始数据 按行查看 历史
yu.liu 提交于 2022-10-29 21:29 . 添加html转markdown脚本
import html2text
import requests
import re
from bs4 import BeautifulSoup
class MarkDown:
def __init__(self, url):
resp = requests.get(url)
self.html = resp.content.decode(encoding='UTF-8')
print(self.html)
self.soup = BeautifulSoup(self.html, 'html.parser')
print(self.soup.title.string)
def to_file(self):
self.markdownText = html2text.html2text(self.html)
invalid_chars = '[\\\/:*?"<>|]'
replace_char = '-'
filename = re.sub(invalid_chars, replace_char, self.soup.title.string)
with open(f"{filename}.md", mode="wb") as file:
file.write(self.markdownText.encode(encoding='UTF-8'))
if __name__ == '__main__':
url = "https://semver.org/lang/zh-CN/"
md = MarkDown(url)
md.to_file()
Python
1
https://gitee.com/liuyu_cupid/karu.git
git@gitee.com:liuyu_cupid/karu.git
liuyu_cupid
karu
karu
master

搜索帮助