{"id":204926,"date":"2025-05-29T14:57:43","date_gmt":"2025-05-29T06:57:43","guid":{"rendered":"https:\/\/server.hk\/cnblog\/204926\/"},"modified":"2025-05-29T14:57:43","modified_gmt":"2025-05-29T06:57:43","slug":"beautifulsoup%e4%b8%adfind_all%e6%8f%90%e5%8f%96%e5%85%83%e7%b4%a0%e5%8c%85%e5%90%ab%e5%9b%9e%e8%bd%a6%e7%ac%a6%e5%a6%82%e4%bd%95%e5%a4%84%e7%90%86%ef%bc%9f","status":"publish","type":"post","link":"https:\/\/server.hk\/cnblog\/204926\/","title":{"rendered":"BeautifulSoup\u4e2dfind_all\u63d0\u53d6\u5143\u7d20\u5305\u542b\u56de\u8f66\u7b26\u5982\u4f55\u5904\u7406\uff1f"},"content":{"rendered":"<p><b><\/b>     <\/p>\n<h1>BeautifulSoup\u4e2dfind_all\u63d0\u53d6\u5143\u7d20\u5305\u542b\u56de\u8f66\u7b26\u5982\u4f55\u5904\u7406\uff1f<\/h1>\n<p>\u6700\u8fd1\u53d1\u73b0\u4e0d\u5c11\u5c0f\u4f19\u4f34\u90fd\u5bf9\u5f88\u611f\u5174\u8da3\uff0c\u6240\u4ee5\u4eca\u5929\u7ee7\u7eed\u7ed9\u5927\u5bb6\u4ecb\u7ecd<span style=\"color: #FF6600;, Helvetica, Arial, sans-serif;font-size: 14px;background-color: #FFFFFF\">\u6587\u7ae0<\/span>\u76f8\u5173\u7684\u77e5\u8bc6\uff0c\u672c\u6587<span style=\"color: #FF6600;, Helvetica, Arial, sans-serif;font-size: 14px;background-color: #FFFFFF\">\u300aBeautifulSoup\u4e2dfind_all\u63d0\u53d6\u5143\u7d20\u5305\u542b\u56de\u8f66\u7b26\u5982\u4f55\u5904\u7406\uff1f\u300b<\/span>\u4e3b\u8981\u5185\u5bb9\u6d89\u53ca\u5230<span style=\"color: #FF6600;, Helvetica, Arial, sans-serif;font-size: 14px;background-color: #FFFFFF\"><\/span>\u7b49\u7b49\u77e5\u8bc6\u70b9\uff0c\u5e0c\u671b\u80fd\u5e2e\u5230\u4f60\uff01\u5f53\u7136\u5982\u679c\u9605\u8bfb\u672c\u6587\u65f6\u5b58\u5728\u4e0d\u540c\u60f3\u6cd5\uff0c\u53ef\u4ee5\u5728\u8bc4\u8bba\u4e2d\u8868\u8fbe\uff0c\u4f46\u662f\u8bf7\u52ff\u4f7f\u7528\u8fc7\u6fc0\u7684\u63aa\u8f9e~<\/p>\n<p><img decoding=\"async\" src=\"https:\/\/www.17golang.com\/uploads\/20241119\/1732006316673c51ac67084.jpg\" class=\"aligncenter\"><\/p>\n<p><strong>\u5982\u4f55\u5728 bs4 \u4e2d\u5904\u7406 find_all \u63d0\u53d6\u5143\u7d20\u65f6\u5305\u542b\u56de\u8f66\u7b26\u7684\u73b0\u8c61<\/strong><\/p>\n<p>beautifulsoup \u7684 find_all \u51fd\u6570\u5728\u63d0\u53d6\u9875\u9762\u5143\u7d20\u65f6\uff0c\u5982\u679c\u5143\u7d20\u5185\u5bb9\u4e2d\u5305\u542b\u56de\u8f66\u7b26\uff0c\u4f1a\u5bfc\u81f4\u5143\u7d20\u88ab\u62c6\u5206\u4e3a\u591a\u4e2a\u5143\u7d20\u3002\u5bf9\u4e8e\u53ea\u60f3\u63d0\u53d6\u5143\u7d20\u6587\u672c\u5185\u5bb9\u7684\u60c5\u51b5\uff0c\u8fd9\u53ef\u80fd\u4f1a\u5e26\u6765\u9ebb\u70e6\u3002<\/p>\n<p>\u8981\u89e3\u51b3\u6b64\u95ee\u9898\uff0c\u53ef\u4ee5\u5728\u4f7f\u7528 .get_text() \u65b9\u6cd5\u83b7\u53d6\u5143\u7d20\u6587\u672c\u4e4b\u524d\uff0c\u5148\u5bf9\u5143\u7d20\u5185\u5bb9\u8fdb\u884c\u9884\u5904\u7406\u3002\u53ef\u4ee5\u4f7f\u7528 replace \u51fd\u6570\u66ff\u6362\u6389\u5143\u7d20\u4e2d\u7684\u6362\u884c\u7b26\uff08&#8217;n&#8217;\uff09\u3002<\/p>\n<p>\u4ee5\u4e0b\u662f\u5982\u4f55\u4fee\u6539\u4ee3\u7801\u4ee5\u89e3\u51b3\u56de\u8f66\u7b26\u95ee\u9898\uff1a<\/p>\n<pre>from urllib.request import urlopen\nfrom bs4 import BeautifulSoup\n\nhtml = urlopen('http:\/\/www.pythonscraping.com\/pages\/warandpeace.html')\nbs = BeautifulSoup(html.read(), 'html.parser')\n\nname_list = bs.find_all('span', {'class':'green'}) \nfor name in name_list:\n    print(name.get_text().replace('\\n', ''))  # \u6dfb\u52a0 replace('\\n', '')<\/pre>\n<p>\u8fd9\u6837\uff0c\u5143\u7d20\u6587\u672c\u4e2d\u7684\u6362\u884c\u7b26\u5c06\u88ab\u66ff\u6362\u4e3a\u7a7a\u5b57\u7b26\u4e32\uff0c\u5e76\u4e14 get_text() \u65b9\u6cd5\u5c06\u8fd4\u56de\u4e00\u4e2a\u4e0d\u5305\u542b\u6362\u884c\u7b26\u7684\u5b57\u7b26\u4e32\u3002<\/p>\n<p>\u7ec8\u4e8e\u4ecb\u7ecd\u5b8c\u5566\uff01\u5c0f\u4f19\u4f34\u4eec\uff0c\u8fd9\u7bc7\u5173\u4e8e\u300aBeautifulSoup\u4e2dfind_all\u63d0\u53d6\u5143\u7d20\u5305\u542b\u56de\u8f66\u7b26\u5982\u4f55\u5904\u7406\uff1f\u300b\u7684\u4ecb\u7ecd\u5e94\u8be5\u8ba9\u4f60\u6536\u83b7\u591a\u591a\u4e86\u5427\uff01\u6b22\u8fce\u5927\u5bb6\u6536\u85cf\u6216\u5206\u4eab\u7ed9\u66f4\u591a\u9700\u8981\u5b66\u4e60\u7684\u670b\u53cb\u5427~\u516c\u4f17\u53f7\u4e5f\u4f1a\u53d1\u5e03\u6587\u7ae0\u76f8\u5173\u77e5\u8bc6\uff0c\u5feb\u6765\u5173\u6ce8\u5427\uff01<\/p>\n","protected":false},"excerpt":{"rendered":"<p>BeautifulSoup\u4e2dfi&#46;&#46;&#46;<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[4925],"tags":[],"class_list":["post-204926","post","type-post","status-publish","format-standard","hentry","category-4925"],"_links":{"self":[{"href":"https:\/\/server.hk\/cnblog\/wp-json\/wp\/v2\/posts\/204926","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/server.hk\/cnblog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/server.hk\/cnblog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/server.hk\/cnblog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/server.hk\/cnblog\/wp-json\/wp\/v2\/comments?post=204926"}],"version-history":[{"count":0,"href":"https:\/\/server.hk\/cnblog\/wp-json\/wp\/v2\/posts\/204926\/revisions"}],"wp:attachment":[{"href":"https:\/\/server.hk\/cnblog\/wp-json\/wp\/v2\/media?parent=204926"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/server.hk\/cnblog\/wp-json\/wp\/v2\/categories?post=204926"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/server.hk\/cnblog\/wp-json\/wp\/v2\/tags?post=204926"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}