{"id":205107,"date":"2025-05-29T17:07:22","date_gmt":"2025-05-29T09:07:22","guid":{"rendered":"https:\/\/server.hk\/cnblog\/205107\/"},"modified":"2025-05-29T17:07:22","modified_gmt":"2025-05-29T09:07:22","slug":"whisper-%e5%9c%a8-mac-mini-%e4%b8%8a%e5%ae%89%e8%a3%85%e8%bf%90%e8%a1%8c%e6%9c%89%e9%97%ae%e9%a2%98%ef%bc%8c%e6%9c%89%e5%93%aa%e4%ba%9b-python-%e8%af%ad%e9%9f%b3%e8%af%86%e5%88%ab%e5%ba%93%e5%8f%af","status":"publish","type":"post","link":"https:\/\/server.hk\/cnblog\/205107\/","title":{"rendered":"Whisper \u5728 Mac mini \u4e0a\u5b89\u88c5\u8fd0\u884c\u6709\u95ee\u9898\uff0c\u6709\u54ea\u4e9b Python \u8bed\u97f3\u8bc6\u522b\u5e93\u53ef\u4ee5\u66ff\u4ee3\uff1f"},"content":{"rendered":"<p><b><\/b>     <\/p>\n<h1>Whisper \u5728 Mac mini \u4e0a\u5b89\u88c5\u8fd0\u884c\u6709\u95ee\u9898\uff0c\u6709\u54ea\u4e9b Python \u8bed\u97f3\u8bc6\u522b\u5e93\u53ef\u4ee5\u66ff\u4ee3\uff1f<\/h1>\n<p>\u4ece\u73b0\u5728\u5f00\u59cb\uff0c\u6211\u4eec\u8981\u52aa\u529b\u5b66\u4e60\u5566\uff01\u4eca\u5929\u6211\u7ed9\u5927\u5bb6\u5e26\u6765\uff0c\u611f\u5174\u8da3\u7684\u670b\u53cb\u8bf7\u7ee7\u7eed\u770b\u4e0b\u53bb\u5427\uff01\u4e0b\u6587\u4e2d\u7684\u5185\u5bb9\u6211\u4eec\u4e3b\u8981\u4f1a\u6d89\u53ca\u5230<span style=\"color: #FF6600;, Helvetica, Arial, sans-serif;font-size: 14px;background-color: #FFFFFF\"><\/span>\u7b49\u7b49\u77e5\u8bc6\u70b9\uff0c\u5982\u679c\u5728\u9605\u8bfb\u672c\u6587\u8fc7\u7a0b\u4e2d\u6709\u9047\u5230\u4e0d\u6e05\u695a\u7684\u5730\u65b9\uff0c\u6b22\u8fce\u7559\u8a00\u5440\uff01\u6211\u4eec\u4e00\u8d77\u8ba8\u8bba\uff0c\u4e00\u8d77\u5b66\u4e60\uff01<\/p>\n<p><img decoding=\"async\" src=\"https:\/\/www.17golang.com\/uploads\/20241123\/17323691926741db286cf63.jpg\" class=\"aligncenter\"><\/p>\n<p><strong>python \u8bed\u97f3\u8bc6\u522b\u5e93\u7684\u9009\u62e9<\/strong><\/p>\n<p>\u5bf9\u4e8e\u9700\u8981\u8bed\u97f3\u8f6c\u6362\u6587\u672c\u7684\u9700\u6c42\uff0c\u539f\u6709\u7684 python \u5e93 whisper \u53ef\u80fd\u5b58\u5728\u517c\u5bb9\u6027\u95ee\u9898\uff0c\u5c24\u5176\u662f\u5728 mac mini \u4e0a\u5b89\u88c5\u548c\u8fd0\u884c\u65f6\u3002\u4e3a\u4e86\u63d0\u4f9b\u5176\u4ed6\u9009\u62e9\uff0c\u5efa\u8bae\u8003\u8651 speechrecognition \u5e93\u3002<\/p>\n<p>speechrecognition \u5e93\u96c6\u6210\u4e86\u591a\u79cd\u8bed\u97f3\u8bc6\u522b\u540e\u7aef\uff0c\u5305\u62ec google web speech api\u3001microsoft bing voice recognition \u548c ibm speech to text\u3002\u8fd9\u4e9b\u540e\u7aef\u63d0\u4f9b\u5404\u79cd\u529f\u80fd\u548c\u7cbe\u5ea6\u7ea7\u522b\uff0c\u53ef\u4ee5\u6ee1\u8db3\u4e0d\u540c\u7684\u9700\u6c42\u3002<\/p>\n<p>\u4f7f\u7528 speechrecognition \u5e93\uff0c\u60a8\u53ef\u4ee5\u8f7b\u677e\u5730\u8fdb\u884c\u8bed\u97f3\u8bc6\u522b\u64cd\u4f5c\uff1a<\/p>\n<pre>import speech_recognition as sr\n\n# \u521b\u5efa\u8bed\u97f3\u8bc6\u522b\u5bf9\u8c61\nr = sr.Recognizer()\n\n# \u83b7\u53d6\u9ea6\u514b\u98ce\u97f3\u9891\nwith sr.Microphone() as source:\n    audio = r.listen(source)\n\n# \u4f7f\u7528\u540e\u7aef\u8bc6\u522b\u8bed\u97f3\ntry:\n    text = r.recognize_google(audio)\nexcept sr.UnknownValueError:\n    print(\"\u65e0\u6cd5\u8bc6\u522b\u8bed\u97f3\")\nexcept sr.RequestError:\n    print(\"\u670d\u52a1\u5668\u8fde\u63a5\u5931\u8d25\")<\/pre>\n<p>\u66ff\u4ee3\u7684\u8bed\u97f3\u8bc6\u522b\u5e93\u8fd8\u5305\u62ec\uff1a<\/p>\n<ul>\n<li><strong>vosk<\/strong>\uff1a\u5f00\u6e90\u4e14\u8f7b\u91cf\u7ea7\u7684 python \u5e93\uff0c\u4f7f\u7528 kaldi \u6a21\u578b\u8fdb\u884c\u8bed\u97f3\u8bc6\u522b\u3002<\/li>\n<li><strong>deepspeech<\/strong>\uff1a\u57fa\u4e8e tensorflow \u7684\u5f00\u6e90\u8bed\u97f3\u8bc6\u522b\u5f15\u64ce\uff0c\u63d0\u4f9b\u9ad8\u51c6\u786e\u5ea6\u3002<\/li>\n<li><strong>librispeech<\/strong>\uff1a\u4e00\u4e2a\u5927\u578b\u7684\u5f00\u653e\u8bed\u97f3\u6570\u636e\u96c6\uff0c\u7528\u4e8e\u8bad\u7ec3\u8bed\u97f3\u8bc6\u522b\u6a21\u578b\u3002<\/li>\n<\/ul>\n<p>\u6839\u636e\u60a8\u7684\u5177\u4f53\u9700\u6c42\u548c\u5e73\u53f0\uff0c\u8fd9\u4e9b\u5e93\u53ef\u80fd\u63d0\u4f9b\u66f4\u5408\u9002\u7684\u8bed\u97f3\u8bc6\u522b\u89e3\u51b3\u65b9\u6848\u3002<\/p>\n<p>\u6587\u4e2d\u5173\u4e8e\u7684\u77e5\u8bc6\u4ecb\u7ecd\uff0c\u5e0c\u671b\u5bf9\u4f60\u7684\u5b66\u4e60\u6709\u6240\u5e2e\u52a9\uff01\u82e5\u662f\u53d7\u76ca\u532a\u6d45\uff0c\u90a3\u5c31\u52a8\u52a8\u9f20\u6807\u6536\u85cf\u8fd9\u7bc7\u300aWhisper \u5728 Mac mini \u4e0a\u5b89\u88c5\u8fd0\u884c\u6709\u95ee\u9898\uff0c\u6709\u54ea\u4e9b Python \u8bed\u97f3\u8bc6\u522b\u5e93\u53ef\u4ee5\u66ff\u4ee3\uff1f\u300b\u6587\u7ae0\u5427\uff0c\u4e5f\u53ef\u5173\u6ce8\u516c\u4f17\u53f7\u4e86\u89e3\u76f8\u5173\u6280\u672f\u6587\u7ae0\u3002<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Whisper \u5728 Mac mi&#46;&#46;&#46;<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[4925],"tags":[],"class_list":["post-205107","post","type-post","status-publish","format-standard","hentry","category-4925"],"_links":{"self":[{"href":"https:\/\/server.hk\/cnblog\/wp-json\/wp\/v2\/posts\/205107","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/server.hk\/cnblog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/server.hk\/cnblog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/server.hk\/cnblog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/server.hk\/cnblog\/wp-json\/wp\/v2\/comments?post=205107"}],"version-history":[{"count":0,"href":"https:\/\/server.hk\/cnblog\/wp-json\/wp\/v2\/posts\/205107\/revisions"}],"wp:attachment":[{"href":"https:\/\/server.hk\/cnblog\/wp-json\/wp\/v2\/media?parent=205107"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/server.hk\/cnblog\/wp-json\/wp\/v2\/categories?post=205107"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/server.hk\/cnblog\/wp-json\/wp\/v2\/tags?post=205107"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}