한빛미디어 파이썬 증권데이터 분석 4장 수정

IT/python

한빛미디어 파이썬 증권데이터 분석 4장 수정

fraha 2021. 2. 19. 15:52

책에 나오대로 아래 처럼하면

import pandas as pd
from urllib.request import urlopen
from bs4 import BeautifulSoup
from matplotlib import pyplot as plt

url = 'https://finance.naver.com/item/sise_day.nhn?code=068270&page=1'
with urlopen(url) as doc:
    html = BeautifulSoup(doc, 'lxml') 
    pgrr = html.find('td', class_='pgRR')
    s = str(pgrr.a['href']).split('=')
    last_page = s[-1]

이렇게 나온다.

'NoneType' object has no attribute 'a'

네이버에서 무분별한 스크래핑을 막기 위해

패킷헤더에 브라우저 정보가 없으면 접근을 차단하고 있다.

따라서 아래처럼 해 주면 된다.

import pandas as pd
from matplotlib import pyplot as plt
from bs4 import BeautifulSoup
from urllib.request import Request, urlopen

url = 'https://finance.naver.com/item/sise_day.nhn?code=068270&page=1'
req = Request(url, headers={'User-Agent': 'Mozilla/5.0'})
with urlopen(req) as doc:
    html = BeautifulSoup(doc, 'lxml')
    pgrr = html.find('td', class_='pgRR')
    s = str(pgrr.a['href']).split('=')
    last_page = s[-1]
print(last_page)

현재글한빛미디어 파이썬 증권데이터 분석 4장 수정

잡다한 일상 fraha 님의 블로그입니다.

델마당, IR Controller, 해피퍼피, 엑시언트 가격, aligo, 알리고, SSMS, 현대 엑시언트프로, RJ45, 채사장, 쿠팡정산보류, POS프린터, 희옷표백, Delphi7, 이거 없었으면 에어컨 새로 살뻔, 델파이, happypuppy, delphi, 18톤, IR통합리모컨,

250x250

Today :
Yesterday :

일	월	화	수	목	금	토
		1	2	3	4	5
6	7	8	9	10	11	12
13	14	15	16	17	18	19
20	21	22	23	24	25	26
27	28	29	30

내 블로그 - 관리자 홈 전환	`Q` `Q`
새 글 쓰기	`W` `W`

글 수정 (권한 있는 경우)	`E` `E`
댓글 영역으로 이동	`C` `C`

이 페이지의 URL 복사	`S` `S`
맨 위로 이동	`T` `T`
티스토리 홈 이동	`H` `H`
단축키 안내	`Shift` + `/` `⇧` + `/`

잡다한 일상

한빛미디어 파이썬 증권데이터 분석 4장 수정

'IT/python'의 다른글

티스토리툴바

단축키

내 블로그

블로그 게시글

모든 영역