当前位置：移动技术网 > IT编程>脚本编程>Python > python使用BeautifulSoup分析网页信息的方法

python使用BeautifulSoup分析网页信息的方法

2018年07月05日 | 移动技术网IT编程 | 我要评论

本文实例讲述了python使用BeautifulSoup分析网页信息的方法。分享给大家供大家参考。具体如下：

这段python代码查找网页上的所有链接，分析所有的span标签，并查找class包含titletext的span的内容

#import the library used to query a website
import urllib2

#specify the url you want to query
url = "http://www.python.org"

#Query the website and return the html to the variable 'page'
page = urllib2.urlopen(url)

#import the Beautiful soup functions to parse the data returned from the website
from BeautifulSoup import BeautifulSoup

#Parse the html in the 'page' variable, and store it in Beautiful Soup format
soup = BeautifulSoup(page)

#to print the soup.head is the head tag and soup.head.title is the title tag
print soup.head
print soup.head.title

#to print the length of the page, use the len function
print len(page)

#create a new variable to store the data you want to find.
tags = soup.findAll('a')

#to print all the links
print tags

#to get all titles and print the contents of each title
titles = soup.findAll('span', attrs = { 'class' : 'titletext' })
for title in allTitles:
print title.contents

希望本文所述对大家的Python程序设计有所帮助。

您可能感兴趣的文章:

如对本文有疑问，点击进行留言回复！！

用python实现学生管理系统

学生管理系统相信大家学各种语言的时候，练习总是会写各种管理系统吧，管理系统主要有对数据的增删查改操作，原理不难，适合作为练手的小程序数据的结构要保存数据就需要数... [阅读全文]
Python按照先后顺序，对列表进行多条件自定义排序

需求：对指定的列表，按照以下顺序排序：①先按照【编号】从小到大进行排序②再按照列表中包含【方案、扩初、施工图、后... [阅读全文]
Python经典入门100题 (21-30题)

Python入门练手，有这100题就够了！ [阅读全文]
python实现LRU算法

LRU算法python实现学习mysql数据库时，了解了一下ib_buffer_pool的存储机制，使用LRU... [阅读全文]
Python学习笔记——主要函数及基本使用（与C的对比）

实时更新中…文章目录实时更新中...1.函数1.1 title()函数-以首字母大写的方式显示每个单词1.2 合... [阅读全文]
线性回归—梯度下降python实现

import numpy as npimport pandas as pd导入数据data=pd.read_cs... [阅读全文]
python中 if语句（分支结构）使用方式

python中if语句有三种使用形式：if单分支结构（if），if双分支结构（if-else），if多分支结构（... [阅读全文]
KNN算法的理解以及Python实现

参考大佬文章https://blog.csdn.net/c406495762/article/details/7... [阅读全文]
Day03_数据类型介绍&Python运算符&IF分支

一. 数据类型数值类型: int, float, complex字符串: str布尔类型: bool只有2个值:... [阅读全文]
10. 说说Python的某些有意思的库（下）

嘿各位，上次说的那些库是不是不过瘾？那是，真正好玩的还没给你看呢！ [阅读全文]

网友评论


验证码：

python使用BeautifulSoup分析网页信息的方法

2018年07月05日 | 移动技术网IT编程 | 我要评论

您可能感兴趣的文章:

相关文章:

网友评论