MediaCrawler/media_platform/kuaishou/graphql.py

# 声明：本代码仅供学习和研究目的使用。使用者应遵守以下原则：  
# 1. 不得用于任何商业用途。  
# 2. 使用时应遵守目标平台的使用条款和robots.txt规则。  
# 3. 不得进行大规模爬取或对平台造成运营干扰。  
# 4. 应合理控制请求频率，避免给目标平台带来不必要的负担。   
# 5. 不得用于任何非法或不当的用途。
#   
# 详细许可条款请参阅项目根目录下的LICENSE文件。  
# 使用本代码即表示您同意遵守上述原则和LICENSE中的所有条款。  


# 快手的数据传输是基于GraphQL实现的
# 这个类负责获取一些GraphQL的schema
from typing import Dict


class KuaiShouGraphQL:
    graphql_queries: Dict[str, str]= {}

    def __init__(self):
        self.graphql_dir = "media_platform/kuaishou/graphql/"
        self.load_graphql_queries()

    def load_graphql_queries(self):
        graphql_files = ["search_query.graphql", "video_detail.graphql", "comment_list.graphql", "vision_profile.graphql","vision_profile_photo_list.graphql","vision_profile_user_list.graphql","vision_sub_comment_list.graphql"]

        for file in graphql_files:
            with open(self.graphql_dir + file, mode="r") as f:
                query_name = file.split(".")[0]
                self.graphql_queries[query_name] = f.read()

    def get(self, query_name: str) -> str:
        return self.graphql_queries.get(query_name, "Query not found")
chore: 增加代码学习声明，严格禁止非法、禁止商业、不当用途 2024-10-19 16:43:25 +00:00			`# 声明：本代码仅供学习和研究目的使用。使用者应遵守以下原则：`
			`# 1. 不得用于任何商业用途。`
			`# 2. 使用时应遵守目标平台的使用条款和robots.txt规则。`
			`# 3. 不得进行大规模爬取或对平台造成运营干扰。`
			`# 4. 应合理控制请求频率，避免给目标平台带来不必要的负担。`
			`# 5. 不得用于任何非法或不当的用途。`
			`#`
			`# 详细许可条款请参阅项目根目录下的LICENSE文件。`
			`# 使用本代码即表示您同意遵守上述原则和LICENSE中的所有条款。`


feat: 搜索接口调试完成 2023-11-24 16:02:33 +00:00			`# 快手的数据传输是基于GraphQL实现的`
			`# 这个类负责获取一些GraphQL的schema`
feat: 增加 IP 代理的最新实现 2023-12-02 08:14:36 +00:00			`from typing import Dict`
feat: 搜索接口调试完成 2023-11-24 16:02:33 +00:00

			`class KuaiShouGraphQL:`
feat: 快手视频评论爬取done；数据保存到DB、CSV done 2023-11-26 13:43:39 +00:00			`graphql_queries: Dict[str, str]= {}`
feat: 搜索接口调试完成 2023-11-24 16:02:33 +00:00
			`def __init__(self):`
			`self.graphql_dir = "media_platform/kuaishou/graphql/"`
			`self.load_graphql_queries()`

			`def load_graphql_queries(self):`
快手指定创作者主页和二级评论 2024-06-13 06:49:07 +00:00			`graphql_files = ["search_query.graphql", "video_detail.graphql", "comment_list.graphql", "vision_profile.graphql","vision_profile_photo_list.graphql","vision_profile_user_list.graphql","vision_sub_comment_list.graphql"]`
feat: 搜索接口调试完成 2023-11-24 16:02:33 +00:00
			`for file in graphql_files:`
			`with open(self.graphql_dir + file, mode="r") as f:`
			`query_name = file.split(".")[0]`
			`self.graphql_queries[query_name] = f.read()`

			`def get(self, query_name: str) -> str:`
			`return self.graphql_queries.get(query_name, "Query not found")`