Skip to content
This repository has been archived by the owner on May 25, 2024. It is now read-only.

xLog-ArticleURLCrawler is a tool based on the xLog platform for crawling blog article URLs. It aims to automatically extract all article URLs from xLog blogs, providing users with a convenient and quick way to obtain article links.

License

Notifications You must be signed in to change notification settings

endercatone/xLog-ArticleURLCrawler

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

xLog-ArticleURLCrawler

xLog-ArticleURLCrawler 是一个用于从基于 xLog 的博客中爬取文章 URL 的 Python 脚本。

本项目使用ChatGPT协助开发

运行流程

  1. config.ini读取博客的URL
  2. 通过xLog的RSS订阅功能获取文章名和文章URL
  3. 多线程获取真实URL
  4. 写到article.json

使用方法

  1. 克隆项目到本地:
git clone https://github.com/endercatone/xLog-ArticleURLCrawler.git
  1. 安装依赖:
pip install feedparser requests
  1. 在终端中运行脚本:
python main.py
  1. 根据提示输入博客的 URL,例如:
博客URL: https://example.com
  1. 等待脚本执行完成。获取到的文章 URL 将保存在当前目录下的 article.json 文件中。

  2. 打开 article.json 文件,即可查看获取到的文章标题和对应的 URL。

许可证

该项目使用 MIT 许可证。详细信息请参阅 LICENSE 文件。

About

xLog-ArticleURLCrawler is a tool based on the xLog platform for crawling blog article URLs. It aims to automatically extract all article URLs from xLog blogs, providing users with a convenient and quick way to obtain article links.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages