How to get html tag value in python

I'm newbie to python. Here is my code working on python 2.7.5

import urllib2
import sys       

url ="mydomain.com"
usock = urllib2.urlopen[url]
data = usock.read[]
usock.close[]

print data

Getting HTML markup like that and it works.

What I want to do is, to get value from inside tag. for ex. I need data value from this example:

Data

How to do it?

asked Sep 6, 2013 at 11:38

You can use a HTML parser module such as BeautifulSoup:

from bs4 import BeautifulSoup as BS
url ="mydomain.com"
usock = urllib2.urlopen[url]
data = usock.read[]
usock.close[]
soup = BS[data]
print soup.find['font', {'class':'big'}].text

This finds a tag with a class="big". It then prints its content.

answered Sep 6, 2013 at 11:39

TerryATerryA

56.9k11 gold badges117 silver badges137 bronze badges

Using lxml:

import urllib2
import lxml.html

url ="mydomain.com"

usock = urllib2.urlopen[url]
data = usock.read[]
usock.close[]
for font in lxml.html.fromstring[data].cssselect['font.big']:
    print font.text

>>> import lxml.html
>>> root = lxml.html.fromstring['Data']
>>> [font.text for font in root.cssselect['font.big']]
['Data']

answered Sep 6, 2013 at 11:40

falsetrufalsetru

343k57 gold badges683 silver badges606 bronze badges

View Discussion

Improve Article

Save Article

Read

Discuss

View Discussion

Improve Article

Save Article

Prerequisites: Beautifulsoup

In this article, we will discuss how beautifulsoup can be employed to find a tag with the given attribute value in an HTML document.

Approach:

Import module.
Scrap data from a webpage.
Parse the string scraped to HTML.
Use find[] function to find the attribute and tag.
Print the result.

Syntax: find[attr_name=”value”]

Below are some implementations of the above approach:

Example 1:

Python3

from bs4 import BeautifulSoup

markup =

soup = BeautifulSoup[markup, 'html.parser']

div_bs4 = soup.find[id = "container"]

print[div_bs4.name]

Output:

div

Example 2:

Python3

from bs4 import BeautifulSoup

soup = BeautifulSoup[markup, 'html.parser']

print[div_bs4.name]

Output:

Example 3:

Python3

from bs4 import BeautifulSoup

markup =

soup = BeautifulSoup[markup, 'html.parser']

div_bs4 = soup.find[class_ = "gfg"]

print[div_bs4.name]

Output:

Toplist mới

Top 7 tết mậu thân năm 1968 đã diễn ra sự kiện gì ở miền nam nước ta 2023

5 tháng trước

Top 13 luyện từ và câu: dấu gạch ngang lớp 4 trang 45 2023

5 tháng trước

Top 6 trong mặt phẳng oxy ảnh của đường thẳng d 3x y 4=0 2023

5 tháng trước

Top 6 thử thách thần chết thuyết minh phần 2 2023

5 tháng trước

Top 4 vở bài tập tiếng việt lớp 3 tập 2 chính tả trang 15 2023

5 tháng trước

Top 5 áo khoác nam quảng châu cao cấp 2023

5 tháng trước

Top 4 nội dung nào sau đây không phải là trách nhiệm của đơn vị đầu mối cung cấp thông tin 2023

5 tháng trước

Top 9 mẫu đồng phục công sở đẹp 2022 2023

5 tháng trước

Top 5 ốp lưng iphone 13 pro bảo vệ camera 2023

5 tháng trước

Python3

Python3

Python3

Bài Viết Liên Quan

Toplist mới

Bài mới nhất

Chủ Đề