Commit Graph

407 Commits

Author SHA1 Message Date
Relakkes c26270146f fix: 修复ip代理池设计bug 2024-04-13 18:04:33 +08:00
Relakkes e367129b5f fix: 修复pr#229的小bug 2024-04-13 13:48:29 +08:00
程序员阿江-Relakkes a341dc2aff
Merge pull request #229 from Tianci-King/main
feat(core): 新增控制爬虫参数起始页面的页数start_page;perf(argparse): 向命令行解析器添加程序参数…
2024-04-13 13:37:35 +08:00
程序员阿江-Relakkes da949f0c7a
Merge pull request #226 from leantli/feat/sub_comments
feat: 支持爬取小红书二级评论
2024-04-13 01:17:55 +08:00
leantli 6cabece01a chore: remove redundant line breaks 2024-04-12 18:18:01 +08:00
leantli ad01dfba95 feat: 轻量化支持爬取小红书二级评论 2024-04-12 17:32:20 +08:00
Tianci-King 1115b0d90c feat(core): 新增控制爬虫 参数起始页面的页数start_page;perf(argparse): 向命令行解析器添加程序参数起始页面页数和关键字 2024-04-12 00:52:47 +08:00
leantli 81a9946afd feat: 支持爬取小红书二级评论 2024-04-11 17:16:13 +08:00
程序员阿江-Relakkes bba9841c26
Merge pull request #223 from Ermeng98/main
新增对微博博客内照片获取的支持 文件存放路径data/weibo/images
2024-04-11 00:11:04 +08:00
Er_Meng 9cd6efb916 使用isort对引用进行格式化排序 修改微博获取图片默认配置关闭 2024-04-10 09:54:28 +08:00
Ermeng 52c720591f
Merge branch 'NanmiCoder:main' into main 2024-04-10 09:51:14 +08:00
程序员阿江-Relakkes cab91474ad
Merge pull request #224 from Klu5ure/main
docs: 恢复文档中误删的字
2024-04-09 23:14:14 +08:00
Relakkes 488acd04df fix: #222 2024-04-09 23:10:51 +08:00
Klu5ure 4ed5d4d73e docs: 恢复文档中误删的字 2024-04-09 23:00:13 +08:00
Er_Meng 16413c3074 新增对微博博客内照片获取的支持 文件存放路径data/weibo/images 2024-04-09 17:21:52 +08:00
Relakkes 5c409c6f0c docs: 群二维码失效,更新 2024-04-08 23:41:53 +08:00
Relakkes 8f02da73ad fix: #219
docs: update README.md
2024-04-08 00:19:50 +08:00
程序员阿江-Relakkes b54f168c6b
Merge pull request #217 from Klu5ure/main
docs: 修改错别字
2024-04-07 23:11:19 +08:00
Your Name fd7c597305 docs: 修改错别字 2024-04-07 22:09:05 +08:00
Relakkes d392747fe7 fix: 移除orm的所有内容 2024-04-06 23:51:03 +08:00
程序员阿江-Relakkes 1627ae4344
Merge pull request #216 from NanmiCoder/feature/refactor_db
数据存储DB重构,移除tortoise-orm依赖
2024-04-06 22:21:30 +08:00
Relakkes f9cbeb2b14 chore: 移除tortise-orm依赖 2024-04-06 22:14:03 +08:00
Relakkes 0c8484c334 feat: db数据存储重构完成 2024-04-06 22:11:10 +08:00
Relakkes de4a437dd7 style: rename image name 2024-04-06 12:23:55 +08:00
Relakkes 27f3461e31 docs: update README.md 2024-04-06 10:58:35 +08:00
Relakkes dde3c0429e refactor: IP代理池重构 2024-04-06 10:58:35 +08:00
程序员阿江-Relakkes d0c578c2bf
Merge pull request #213 from Styunlen/patch-1
chore: fix wrong log output when weibo crawler finished
2024-04-06 00:51:36 +08:00
Styunlen 40daa8d6f3
chore: fix wrong log output when weibo crawler finished
Scripts output "Bilibili crawler finished" when Weibo crawler finished.
2024-04-06 00:41:05 +08:00
程序员阿江-Relakkes 5495d76fdb
Merge pull request #212 from chunpat/fixbug_qrcode_show
Remove duplication Qrcode Show
2024-04-05 22:02:06 +08:00
chunpat 6422500e32 Remove duplication Qrcode Show 2024-04-05 21:24:06 +08:00
程序员阿江-Relakkes 87e3d0848a
Merge pull request #209 from xbsheng/chore/text
chore: 修改错别字
2024-04-04 18:11:42 +08:00
xbsheng 2d24cf0f44 chore: 修改错别字 2024-04-04 17:32:23 +08:00
程序员阿江-Relakkes 6160428909
Merge pull request #202 from zhihuiio/patch-1
Update playwright version for compiling
2024-04-04 13:12:50 +08:00
程序员阿江-Relakkes 208978b88f
Merge pull request #206 from leantli/fix/max_notes_count
fix: 修复爬取视频/帖子最大数设置值较低导致不爬取的问题
2024-04-04 00:46:42 +08:00
leantli 68a60faa7f chore: 简化判断方式 2024-04-04 00:11:22 +08:00
Relakkes 3de57459c1 docs: 增加交流2群二维码 2024-04-03 13:40:38 +08:00
Relakkes 85382ac7ed docs:更新许可证协议,不允许商用。 2024-04-03 12:34:34 +08:00
leantli 133f978477 fix: 修复爬取视频/帖子最大数设置值较低导致不爬取的问题 2024-04-03 12:18:23 +08:00
Relakkes e4836847cd docs: update README.md 2024-04-01 22:08:26 +08:00
zhihuiio 085702820a
Update requirements.txt 2024-04-01 20:28:50 +08:00
程序员阿江-Relakkes 285ab7abe6
Update README.md 2024-04-01 12:54:53 +08:00
Relakkes bb732ed2ff docs: 增加MediaCrawler视频课程链接 2024-03-31 23:51:37 +08:00
Relakkes 6d8c8fb22e feat: add mysql table 2024-03-31 15:05:34 +08:00
程序员阿江-Relakkes 807af1abed
Merge pull request #201 from Schofi/main
Update README.md
2024-03-31 14:40:11 +08:00
Relakkes e950e0d6e3 feat: add abstract api client to all platform 2024-03-30 21:27:25 +08:00
Relakkes 67ec49498a refactor: rename xhs to xiaohongshu 2024-03-30 21:17:33 +08:00
Schofi 81317482f8
Update README.md
python版本补充
2024-03-30 18:29:41 +08:00
Relakkes aa257aab51 docs: update README.md 2024-03-30 14:14:10 +08:00
relakkes aca1924bd7
Merge pull request #183 from BaoZhuhan/fix-markdown
fix : 修复”常见问题.md“
2024-03-19 23:13:34 +08:00
BaoZhuhan 1389ad0f81 Merge branch 'fix-markdown' of github.com:BaoZhuhan/MediaCrawler into fix-markdown 2024-03-19 22:59:33 +08:00