<?xml version="1.0" encoding="utf-8"?>
<!-- generator="FeedCreator 1.7.2-ppt DokuWiki" -->
<?xml-stylesheet href="http://www.pythonclub.org/lib/exe/css.php?s=feed" type="text/css"?>
<rdf:RDF
    xmlns="http://purl.org/rss/1.0/"
    xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"
    xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
    xmlns:dc="http://purl.org/dc/elements/1.1/">
    <channel rdf:about="http://www.pythonclub.org/feed.php">
        <title>Python 俱乐部 modules:beautifulsoup</title>
        <description></description>
        <link>http://www.pythonclub.org/</link>
        <image rdf:resource="http://www.pythonclub.org/lib/tpl/dokuwiki/images/favicon.ico" />
       <dc:date>2026-05-02T09:49:41+00:00</dc:date>
        <items>
            <rdf:Seq>
                <rdf:li rdf:resource="http://www.pythonclub.org/modules/beautifulsoup/encode?rev=1275441528&amp;do=diff"/>
                <rdf:li rdf:resource="http://www.pythonclub.org/modules/beautifulsoup/start?rev=1301112074&amp;do=diff"/>
                <rdf:li rdf:resource="http://www.pythonclub.org/modules/beautifulsoup/tricks?rev=1275441528&amp;do=diff"/>
            </rdf:Seq>
        </items>
    </channel>
    <image rdf:about="http://www.pythonclub.org/lib/tpl/dokuwiki/images/favicon.ico">
        <title>Python 俱乐部</title>
        <link>http://www.pythonclub.org/</link>
        <url>http://www.pythonclub.org/lib/tpl/dokuwiki/images/favicon.ico</url>
    </image>
    <item rdf:about="http://www.pythonclub.org/modules/beautifulsoup/encode?rev=1275441528&amp;do=diff">
        <dc:format>text/html</dc:format>
        <dc:date>2010-06-02T01:18:48+00:00</dc:date>
        <title>BeautifulSoup的编码处理</title>
        <link>http://www.pythonclub.org/modules/beautifulsoup/encode?rev=1275441528&amp;do=diff</link>
        <description>BeautifulSoup的编码处理

BeautifulSoup内部使用的是Unicode，BeautifulSoup会自动检测输入文件的编码类型将其转换为Unicode。

BeautifulSoup编码检测顺序

BeautifulSoup按下面的顺序检测编码：

	*  创建Soup对象时传递的 fromEncoding 参数；</description>
    </item>
    <item rdf:about="http://www.pythonclub.org/modules/beautifulsoup/start?rev=1301112074&amp;do=diff">
        <dc:format>text/html</dc:format>
        <dc:date>2011-03-26T04:01:14+00:00</dc:date>
        <title>Beautiful Soup 中文教程</title>
        <link>http://www.pythonclub.org/modules/beautifulsoup/start?rev=1301112074&amp;do=diff</link>
        <description>Beautiful Soup 中文教程

Beautiful Soup 是一个处理Python HTML/XML的模块，功能相当强劲，最近仔细的看了一下他的帮助文档，终于看明白了一些。
准备好好研究一下，顺便将Beautiful Soup的一些用法整理一下，放到这个wiki上面，那个文档确实不咋地。</description>
    </item>
    <item rdf:about="http://www.pythonclub.org/modules/beautifulsoup/tricks?rev=1275441528&amp;do=diff">
        <dc:format>text/html</dc:format>
        <dc:date>2010-06-02T01:18:48+00:00</dc:date>
        <title>BeautifulSoup 技巧</title>
        <link>http://www.pythonclub.org/modules/beautifulsoup/tricks?rev=1275441528&amp;do=diff</link>
        <description>BeautifulSoup 技巧

得到某一节点下的所有文本


texts = soup.findAll(text=True)
all_text = &quot;\n&quot;.join(texts)</description>
    </item>
</rdf:RDF>
