赞
踩
最近在学习Python3爬虫,看了这本书《Python3网络爬虫开发实战》(并非打广告),看到了里面提到一个例子,爬取X眼电影的数据,今天试着自己实战一下。
主要是参考了以下资料:
1.书籍:《Python3网络爬虫开发实战》
如果侵权,请联系本人,谢谢 。
环境那些自行配置,废话少说,上代码。。。。。
开发工具:PyCharm
运行环境:Mac系统 Python3
- import requests
- import re
如果导入报错,先核实项目是否引入对应的包了,如图所示
- def get_douban_top(url):
- headers = {
- 'User-Agent': 'Mozilla/5.0 (Macintosh; Intel Mac OS X 10_15_4) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/81.0.4044.138 Safari/537.36'
- }
-
- response = requests.get(url, headers=headers)
- if response.status_code == 200:
- return response.text
- return None
ps:请求头直接去网页复制即可,如图所示:
-
- def main():
- url = 'https://movie.douban.com/top250'
- html = get_douban_top(url)
-
- print(html)
-
-
- main()
完整代码:
- import requests
- import re
-
-
- def get_douban_top(url):
- headers = {
- 'User-Agent': 'Mozilla/5.0 (Macintosh; Intel Mac OS X 10_15_4) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/81.0.4044.138 Safari/537.36'
- }
-
- response = requests.get(url, headers=headers)
- if response.status_code == 200:
- return response.text
- return None
-
-
- def main():
- url = 'https://movie.douban.com/top250'
- html = get_douban_top(url)
-
- print(html)
-
-
- main()
运行结果:
- /Users/wubaihua/PythonThreewDemo/bin/python /Users/wubaihua/Desktop/Dev/PythonProject/PythonThreewDemo/DouBanTop.py
- <!DOCTYPE html>
- <html lang="zh-CN" class="ua-mac ua-webkit">
- <head>
- <meta http-equiv="Content-Type" content="text/html; charset=utf-8">
- <meta name="renderer" content="webkit">
- <meta name="referrer" content="always">
- <meta name="google-site-verification" content="ok0wCgT20tBBgo9_zat2iAcimtN4Ftf5ccsh092Xeyw" />
- <title>
- 豆瓣电影 Top 250
- </title>
-
- <meta name="baidu-site-verification" content="cZdR4xxR7RxmM4zE" />
- <meta http-equiv="Pragma" content="no-cache">
- <meta http-equiv="Expires" content="Sun, 6 Mar 2005 01:00:00 GMT">
-
- <link rel="apple-touch-icon" href="https://img3.doubanio.com/f/movie/d59b2715fdea4968a450ee5f6c95c7d7a2030065/pics/movie/apple-touch-icon.png">
- <link href="https://img3.doubanio.com/f/shire/3e5dfc68b0f376484c50cf08a58bbca3700911dc/css/douban.css" rel="stylesheet" type="text/css">
- <link href="https://img3.doubanio.com/f/shire/ae3f5a3e3085968370b1fc63afcecb22d3284848/css/separation/_all.css" rel="stylesheet" type="text/css">
- <link href="https://img3.doubanio.com/f/movie/8864d3756094f5272d3c93e30ee2e324665855b0/css/movie/base/init.css" rel="stylesheet">
- <script type="text/javascript">var _head_start = new Date();</script>
- <script type="text/javascript" src="https://img3.doubanio.com/f/movie/0495cb173e298c28593766009c7b0a953246c5b5/js/movie/lib/jquery.js"></script>
- <script type="text/javascript" src="https://img3.doubanio.com/f/shire/5ecaf46d6954d5a30bc7d99be86ae34031646e00/js/douban.js"></script>
- <script type="text/javascript" src="https://img3.doubanio.com/f/shire/0efdc63b77f895eaf85281fb0e44d435c6239a3f/js/separation/_all.js"></script>
-
- <link href="https://img3.doubanio.com/f/movie/2c95f768ea74284b900c04c0209b0a44f0a0de52/css/movie/top_movies.css" rel="stylesheet" type="text/css" />
- <script type="text/javascript" src="https://img3.doubanio.com/f/shire/77323ae72a612bba8b65f845491513ff3329b1bb/js/do.js" data-cfg-autoload="false"></script>
- <script type='text/javascript'>
- Do.ready(function(){
- $("#mine-selector input[type='checkbox']").click(function(){
- var val = $(this).is(":checked")?$(this).val():"";
- window.location.href = '/top250?filter=' + val;
- })
- })
- </script>
-
- <style type="text/css">
- .site-nav-logo img{margin-bottom:0;}
- </style>
- <style type="text/css">img { max-width: 100%; }</style>
- <script type="text/javascript"></script>
- <link rel="stylesheet" href="https://img3.doubanio.com/misc/mixed_static/562925b5e3824700.css">
-
- <link rel="shortcut icon" href="https://img3.doubanio.com/favicon.ico" type="image/x-icon">
- </head>
-
- <body>
-
- <script type="text/javascript">var _body_start = new Date();</script>
-
-
-
-
-
-
- <link href="//img3.doubanio.com/dae/accounts/resources/1f8d15e/shire/bundle.css" rel="stylesheet" type="text/css">
-
-
-
- <div id="db-global-nav" class="global-nav">
- <div class="bd">
-
- <div class="top-nav-info">
- <a href="https://accounts.douban.com/passport/login?source=movie" class="nav-login" rel="nofollow">登录/注册</a>
- </div>
-
-
- <div class="top-nav-doubanapp">
- <a href="https://www.douban.com/doubanapp/app?channel=top-nav" class="lnk-doubanapp">下载豆瓣客户端</a>
- <div id="doubanapp-tip">
- <a href="https://www.douban.com/doubanapp/app?channel=qipao" class="tip-link">豆瓣 <span class="version">6.0</span> 全新发布</a>
- <a href="javascript: void 0;" class="tip-close">×</a>
- </div>
- <div id="top-nav-appintro" class="more-items">
- <p class="appintro-title">豆瓣</p>
- <p class="qrcode">扫码直接下载</p>
- <div class="download">
- <a href="https://www.douban.com/doubanapp/redirect?channel=top-nav&direct_dl=1&download=iOS">iPhone</a>
- <span>·</span>
- <a href="https://www.douban.com/doubanapp/redirect?channel=top-nav&direct_dl=1&download=Android" class="download-android">Android</a>
- </div>
- </div>
- </div>
-
-
-
-
- <div class="global-nav-items">
- <ul>
- <li class="">
- <a href="https://www.douban.com" target="_blank" data-moreurl-dict="{"from":"top-nav-click-main","uid":"0"}">豆瓣</a>
- </li>
- <li class="">
- <a href="https://book.douban.com" target="_blank" data-moreurl-dict="{"from":"top-nav-click-book","uid":"0"}">读书</a>
- </li>
- <li class="on">
- <a href="https://movie.douban.com" data-moreurl-dict="{"from":"top-nav-click-movie","uid":"0"}">电影</a>
- </li>
- <li class="">
- <a href="https://music.douban.com" target="_blank" data-moreurl-dict="{"from":"top-nav-click-music","uid":"0"}">音乐</a>
- </li>
- <li class="">
- <a href="https://www.douban.com/location" target="_blank" data-moreurl-dict="{"from":"top-nav-click-location","uid":"0"}">同城</a>
- </li>
- <li class="">
- <a href="https://www.douban.com/group" target="_blank" data-moreurl-dict="{"from":"top-nav-click-group","uid":"0"}">小组</a>
- </li>
- <li class="">
- <a href="https://read.douban.com/?dcs=top-nav&dcm=douban" target="_blank" data-moreurl-dict="{"from":"top-nav-click-read","uid":"0"}">阅读</a>
- </li>
- <li class="">
- <a href="https://douban.fm/?from_=shire_top_nav" target="_blank" data-moreurl-dict="{"from":"top-nav-click-fm","uid":"0"}">FM</a>
- </li>
- <li class="">
- <a href="https://time.douban.com/?dt_time_source=douban-web_top_nav" target="_blank" data-moreurl-dict="{"from":"top-nav-click-time","uid":"0"}">时间</a>
- </li>
- <li class="">
- <a href="https://market.douban.com/?utm_campaign=douban_top_nav&utm_source=douban&utm_medium=pc_web" target="_blank" data-moreurl-dict="{"from":"top-nav-click-market","uid":"0"}">豆品</a>
- </li>
- </ul>
- </div>
-
- </div>
- </div>
- <script>
- ;window._GLOBAL_NAV = {
- DOUBAN_URL: "https://www.douban.com",
- N_NEW_NOTIS: 0,
- N_NEW_DOUMAIL: 0
- };
- </script>
-
-
-
- <script src="//img3.doubanio.com/dae/accounts/resources/1f8d15e/shire/bundle.js" defer="defer"></script>
-
-
-
-
-
-
-
-
- <link href="//img3.doubanio.com/dae/accounts/resources/1f8d15e/movie/bundle.css" rel="stylesheet" type="text/css">
-
-
-
-
- <div id="db-nav-movie" class="nav">
- <div class="nav-wrap">
- <div class="nav-primary">
- <div class="nav-logo">
- <a href="https://movie.douban.com">豆瓣电影</a>
- </div>
- <div class="nav-search">
- <form action="https://search.douban.com/movie/subject_search" method="get">
- <fieldset>
- <legend>搜索:</legend>
- <label for="inp-query">
- </label>
- <div class="inp"><input id="inp-query" name="search_text" size="22" maxlength="60" placeholder="搜索电影、电视剧、综艺、影人" value=""></div>
- <div class="inp-btn"><input type="submit" value="搜索"></div>
- <input type="hidden" name="cat" value="1002" />
- </fieldset>
- </form>
- </div>
- </div>
- </div>
- <div class="nav-secondary">
-
-
- <div class="nav-items">
- <ul>
- <li ><a href="https://movie.douban.com/cinema/nowplaying/"
- >影讯&购票</a>
- </li>
- <li ><a href="https://movie.douban.com/explore"
- >选电影</a>
- </li>
- <li ><a href="https://movie.douban.com/tv/"
- >电视剧</a>
- </li>
- <li ><a href="https://movie.douban.com/chart"
- >排行榜</a>
- </li>
- <li ><a href="https://movie.douban.com/tag/"
- >分类</a>
- </li>
- <li ><a href="https://movie.douban.com/review/best/"
- >影评</a>
- </li>
- <li ><a href="https://movie.douban.com/annual/2019?source=navigation"
- >2019年度榜单</a>
- </li>
- <li ><a href="https://m.douban.com/standbyme/annual2019?source=navigation"
- target="_blank"
- >2019书影音报告</a>
- </li>
- </ul>
- </div>
-
- <a href="https://movie.douban.com/annual/2019?source=movie_navigation" class="movieannual"></a>
- </div>
- </div>
-
- <script id="suggResult" type="text/x-jquery-tmpl">
- <li data-link="{{= url}}">
- <a href="{{= url}}" onclick="moreurl(this, {from:'movie_search_sugg', query:'{{= keyword }}', subject_id:'{{= id}}', i: '{{= index}}', type: '{{= type}}'})">
- <img src="{{= img}}" width="40" />
- <p>
- <em>{{= title}}</em>
- {{if year}}
- <span>{{= year}}</span>
- {{/if}}
- {{if sub_title}}
- <br /><span>{{= sub_title}}</span>
- {{/if}}
- {{if address}}
- <br /><span>{{= address}}</span>
- {{/if}}
- {{if episode}}
- {{if episode=="unknow"}}
- <br /><span>集数未知</span>
- {{else}}
- <br /><span>共{{= episode}}集</span>
- {{/if}}
- {{/if}}
- </p>
- </a>
- </li>
- </script>
-
-
-
-
- <script src="//img3.doubanio.com/dae/accounts/resources/1f8d15e/movie/bundle.js" defer="defer"></script>
-
-
-
-
-
-
- <div id="wrapper">
-
-
-
- <div id="content">
-
- <h1>豆瓣电影 Top 250</h1>
-
- <div class="grid-16-8 clearfix">
-
-
- <div class="article">
-
-
-
-
-
-
-
-
- <div class="opt mod">
- <div class="tabs">
-
-
-
- </div>
- <span id="mine-selector">
- <input type="checkbox" value="unwatched">我没看过的
- </span>
- </div>
-
-
-
- <ol class="grid_view">
- <li>
- <div class="item">
- <div class="pic">
- <em class="">1</em>
- <a href="https://movie.douban.com/subject/1292052/">
- <img width="100" alt="肖申克的救赎" src="https://img3.doubanio.com/view/photo/s_ratio_poster/public/p480747492.jpg" class="">
- </a>
- </div>
- <div class="info">
- <div class="hd">
- <a href="https://movie.douban.com/subject/1292052/" class="">
- <span class="title">肖申克的救赎</span>
- <span class="title"> / The Shawshank Redemption</span>
- <span class="other"> / 月黑高飞(港) / 刺激1995(台)</span>
- </a>
-
-
- <span class="playable">[可播放]</span>
- </div>
- <div class="bd">
- <p class="">
- 导演: 弗兰克·德拉邦特 Frank Darabont 主演: 蒂姆·罗宾斯 Tim Robbins /...<br>
- 1994 / 美国 / 犯罪 剧情
- </p>
-
-
- <div class="star">
- <span class="rating5-t"></span>
- <span class="rating_num" property="v:average">9.7</span>
- <span property="v:best" content="10.0"></span>
- <span>2028164人评价</span>
- </div>
-
- <p class="quote">
- <span class="inq">希望让人自由。</span>
- </p>
- </div>
- </div>
- </div>
- </li>
- <li>
- <div class="item">
- <div class="pic">
- <em class="">2</em>
- <a href="https://movie.douban.com/subject/1291546/">
- <img width="100" alt="霸王别姬" src="https://img3.doubanio.com/view/photo/s_ratio_poster/public/p2561716440.jpg" class="">
- </a>
- </div>
- <div class="info">
- <div class="hd">
- <a href="https://movie.douban.com/subject/1291546/" class="">
- <span class="title">霸王别姬</span>
- <span class="other"> / 再见,我的妾 / Farewell My Concubine</span>
- </a>
-
-
- <span class="playable">[可播放]</span>
- </div>
- <div class="bd">
- <p class="">
- 导演: 陈凯歌 Kaige Chen 主演: 张国荣 Leslie Cheung / 张丰毅 Fengyi Zha...<br>
- 1993 / 中国大陆 中国香港 / 剧情 爱情 同性
- </p>
-
-
- <div class="star">
- <span class="rating5-t"></span>
- <span class="rating_num" property="v:average">9.6</span>
- <span property="v:best" content="10.0"></span>
- <span>1502834人评价</span>
- </div>
-
- <p class="quote">
- <span class="inq">风华绝代。</span>
- </p>
- </div>
- </div>
- </div>
- </li>
- <li>
- <div class="item">
- <div class="pic">
- <em class="">3</em>
- <a href="https://movie.douban.com/subject/1292720/">
- <img width="100" alt="阿甘正传" src="https://img9.doubanio.com/view/photo/s_ratio_poster/public/p1484728154.jpg" class="">
- </a>
- </div>
- <div class="info">
- <div class="hd">
- <a href="https://movie.douban.com/subject/1292720/" class="">
- <span class="title">阿甘正传</span>
- <span class="title"> / Forrest Gump</span>
- <span class="other"> / 福雷斯特·冈普</span>
- </a>
-
-
- <span class="playable">[可播放]</span>
- </div>
- <div class="bd">
- <p class="">
- 导演: 罗伯特·泽米吉斯 Robert Zemeckis 主演: 汤姆·汉克斯 Tom Hanks / ...<br>
- 1994 / 美国 / 剧情 爱情
- </p>
-
-
- <div class="star">
- <span class="rating5-t"></span>
- <span class="rating_num" property="v:average">9.5</span>
- <span property="v:best" content="10.0"></span>
- <span>1535358人评价</span>
- </div>
-
- <p class="quote">
- <span class="inq">一部美国近现代史。</span>
- </p>
- </div>
- </div>
- </div>
- </li>
- <li>
- <div class="item">
- <div class="pic">
- <em class="">4</em>
- <a href="https://movie.douban.com/subject/1295644/">
- <img width="100" alt="这个杀手不太冷" src="https://img3.doubanio.com/view/photo/s_ratio_poster/public/p511118051.jpg" class="">
- </a>
- </div>
- <div class="info">
- <div class="hd">
- <a href="https://movie.douban.com/subject/1295644/" class="">
- <span class="title">这个杀手不太冷</span>
- <span class="title"> / Léon</span>
- <span class="other"> / 杀手莱昂 / 终极追杀令(台)</span>
- </a>
-
-
- </div>
- <div class="bd">
- <p class="">
- 导演: 吕克·贝松 Luc Besson 主演: 让·雷诺 Jean Reno / 娜塔莉·波特曼 ...<br>
- 1994 / 法国 / 剧情 动作 犯罪
- </p>
-
-
- <div class="star">
- <span class="rating45-t"></span>
- <span class="rating_num" property="v:average">9.4</span>
- <span property="v:best" content="10.0"></span>
- <span>1727521人评价</span>
- </div>
-
- <p class="quote">
- <span class="inq">怪蜀黍和小萝莉不得不说的故事。</span>
- </p>
- </div>
- </div>
- </div>
- </li>
- <li>
- <div class="item">
- <div class="pic">
- <em class="">5</em>
- <a href="https://movie.douban.com/subject/1292063/">
- <img width="100" alt="美丽人生" src="https://img3.doubanio.com/view/photo/s_ratio_poster/public/p2578474613.jpg" class="">
- </a>
- </div>
- <div class="info">
- <div class="hd">
- <a href="https://movie.douban.com/subject/1292063/" class="">
- <span class="title">美丽人生</span>
- <span class="title"> / La vita è bella</span>
- <span class="other"> / 一个快乐的传说(港) / Life Is Beautiful</span>
- </a>
-
-
- <span class="playable">[可播放]</span>
- </div>
- <div class="bd">
- <p class="">
- 导演: 罗伯托·贝尼尼 Roberto Benigni 主演: 罗伯托·贝尼尼 Roberto Beni...<br>
- 1997 / 意大利 / 剧情 喜剧 爱情 战争
- </p>
-
-
- <div class="star">
- <span class="rating5-t"></span>
- <span class="rating_num" property="v:average">9.5</span>
- <span property="v:best" content="10.0"></span>
- <span>965268人评价</span>
- </div>
-
- <p class="quote">
- <span class="inq">最美的谎言。</span>
- </p>
- </div>
- </div>
- </div>
- </li>
- <li>
- <div class="item">
- <div class="pic">
- <em class="">6</em>
- <a href="https://movie.douban.com/subject/1292722/">
- <img width="100" alt="泰坦尼克号" src="https://img9.doubanio.com/view/photo/s_ratio_poster/public/p457760035.jpg" class="">
- </a>
- </div>
- <div class="info">
- <div class="hd">
- <a href="https://movie.douban.com/subject/1292722/" class="">
- <span class="title">泰坦尼克号</span>
- <span class="title"> / Titanic</span>
- <span class="other"> / 铁达尼号(港 / 台)</span>
- </a>
-
-
- <span class="playable">[可播放]</span>
- </div>
- <div class="bd">
- <p class="">
- 导演: 詹姆斯·卡梅隆 James Cameron 主演: 莱昂纳多·迪卡普里奥 Leonardo...<br>
- 1997 / 美国 / 剧情 爱情 灾难
- </p>
-
-
- <div class="star">
- <span class="rating45-t"></span>
- <span class="rating_num" property="v:average">9.4</span>
- <span property="v:best" content="10.0"></span>
- <span>1485880人评价</span>
- </div>
-
- <p class="quote">
- <span class="inq">失去的才是永恒的。 </span>
- </p>
- </div>
- </div>
- </div>
- </li>
- <li>
- <div class="item">
- <div class="pic">
- <em class="">7</em>
- <a href="https://movie.douban.com/subject/1291561/">
- <img width="100" alt="千与千寻" src="https://img1.doubanio.com/view/photo/s_ratio_poster/public/p2557573348.jpg" class="">
- </a>
- </div>
- <div class="info">
- <div class="hd">
- <a href="https://movie.douban.com/subject/1291561/" class="">
- <span class="title">千与千寻</span>
- <span class="title"> / 千と千尋の神隠し</span>
- <span class="other"> / 神隐少女(台) / 千与千寻的神隐</span>
- </a>
-
-
- <span class="playable">[可播放]</span>
- </div>
- <div class="bd">
- <p class="">
- 导演: 宫崎骏 Hayao Miyazaki 主演: 柊瑠美 Rumi Hîragi / 入野自由 Miy...<br>
- 2001 / 日本 / 剧情 动画 奇幻
- </p>
-
-
- <div class="star">
- <span class="rating45-t"></span>
- <span class="rating_num" property="v:average">9.4</span>
- <span property="v:best" content="10.0"></span>
- <span>1589709人评价</span>
- </div>
-
- <p class="quote">
- <span class="inq">最好的宫崎骏,最好的久石让。 </span>
- </p>
- </div>
- </div>
- </div>
- </li>
- <li>
- <div class="item">
- <div class="pic">
- <em class="">8</em>
- <a href="https://movie.douban.com/subject/1295124/">
- <img width="100" alt="辛德勒的名单" src="https://img3.doubanio.com/view/photo/s_ratio_poster/public/p492406163.jpg" class="">
- </a>
- </div>
- <div class="info">
- <div class="hd">
- <a href="https://movie.douban.com/subject/1295124/" class="">
- <span class="title">辛德勒的名单</span>
- <span class="title"> / Schindler's List</span>
- <span class="other"> / 舒特拉的名单(港) / 辛德勒名单</span>
- </a>
-
-
- <span class="playable">[可播放]</span>
- </div>
- <div class="bd">
- <p class="">
- 导演: 史蒂文·斯皮尔伯格 Steven Spielberg 主演: 连姆·尼森 Liam Neeson...<br>
- 1993 / 美国 / 剧情 历史 战争
- </p>
-
-
- <div class="star">
- <span class="rating5-t"></span>
- <span class="rating_num" property="v:average">9.5</span>
- <span property="v:best" content="10.0"></span>
- <span>782438人评价</span>
- </div>
-
- <p class="quote">
- <span class="inq">拯救一个人,就是拯救整个世界。</span>
- </p>
- </div>
- </div>
- </div>
- </li>
- <li>
- <div class="item">
- <div class="pic">
- <em class="">9</em>
- <a href="https://movie.douban.com/subject/3541415/">
- <img width="100" alt="盗梦空间" src="https://img9.doubanio.com/view/photo/s_ratio_poster/public/p513344864.jpg" class="">
- </a>
- </div>
- <div class="info">
- <div class="hd">
- <a href="https://movie.douban.com/subject/3541415/" class="">
- <span class="title">盗梦空间</span>
- <span class="title"> / Inception</span>
- <span class="other"> / 潜行凶间(港) / 全面启动(台)</span>
- </a>
-
-
- <span class="playable">[可播放]</span>
- </div>
- <div class="bd">
- <p class="">
- 导演: 克里斯托弗·诺兰 Christopher Nolan 主演: 莱昂纳多·迪卡普里奥 Le...<br>
- 2010 / 美国 英国 / 剧情 科幻 悬疑 冒险
- </p>
-
-
- <div class="star">
- <span class="rating45-t"></span>
- <span class="rating_num" property="v:average">9.3</span>
- <span property="v:best" content="10.0"></span>
- <span>1468131人评价</span>
- </div>
-
- <p class="quote">
- <span class="inq">诺兰给了我们一场无法盗取的梦。</span>
- </p>
- </div>
- </div>
- </div>
- </li>
- <li>
- <div class="item">
- <div class="pic">
- <em class="">10</em>
- <a href="https://movie.douban.com/subject/3011091/">
- <img width="100" alt="忠犬八公的故事" src="https://img9.doubanio.com/view/photo/s_ratio_poster/public/p524964016.jpg" class="">
- </a>
- </div>
- <div class="info">
- <div class="hd">
- <a href="https://movie.douban.com/subject/3011091/" class="">
- <span class="title">忠犬八公的故事</span>
- <span class="title"> / Hachi: A Dog's Tale</span>
- <span class="other"> / 忠犬小八(台) / 秋田犬八千(港)</span>
- </a>
-
-
- <span class="playable">[可播放]</span>
- </div>
- <div class="bd">
- <p class="">
- 导演: 莱塞·霍尔斯道姆 Lasse Hallström 主演: 理查·基尔 Richard Ger...<br>
- 2009 / 美国 英国 / 剧情
- </p>
-
-
- <div class="star">
- <span class="rating45-t"></span>
- <span class="rating_num" property="v:average">9.4</span>
- <span property="v:best" content="10.0"></span>
- <span>1019680人评价</span>
- </div>
-
- <p class="quote">
- <span class="inq">永远都不能忘记你所爱的人。</span>
- </p>
- </div>
- </div>
- </div>
- </li>
- <li>
- <div class="item">
- <div class="pic">
- <em class="">11</em>
- <a href="https://movie.douban.com/subject/1292001/">
- <img width="100" alt="海上钢琴师" src="https://img9.doubanio.com/view/photo/s_ratio_poster/public/p2574551676.jpg" class="">
- </a>
- </div>
- <div class="info">
- <div class="hd">
- <a href="https://movie.douban.com/subject/1292001/" class="">
- <span class="title">海上钢琴师</span>
- <span class="title"> / La leggenda del pianista sull'oceano</span>
- <span class="other"> / 声光伴我飞(港) / 一九零零的传奇</span>
- </a>
-
-
- <span class="playable">[可播放]</span>
- </div>
- <div class="bd">
- <p class="">
- 导演: 朱塞佩·托纳多雷 Giuseppe Tornatore 主演: 蒂姆·罗斯 Tim Roth / ...<br>
- 1998 / 意大利 / 剧情 音乐
- </p>
-
-
- <div class="star">
- <span class="rating45-t"></span>
- <span class="rating_num" property="v:average">9.3</span>
- <span property="v:best" content="10.0"></span>
- <span>1224081人评价</span>
- </div>
-
- <p class="quote">
- <span class="inq">每个人都要走一条自己坚定了的路,就算是粉身碎骨。 </span>
- </p>
- </div>
- </div>
- </div>
- </li>
- <li>
- <div class="item">
- <div class="pic">
- <em class="">12</em>
- <a href="https://movie.douban.com/subject/1292064/">
- <img width="100" alt="楚门的世界" src="https://img3.doubanio.com/view/photo/s_ratio_poster/public/p479682972.jpg" class="">
- </a>
- </div>
- <div class="info">
- <div class="hd">
- <a href="https://movie.douban.com/subject/1292064/" class="">
- <span class="title">楚门的世界</span>
- <span class="title"> / The Truman Show</span>
- <span class="other"> / 真人Show(港) / 真人戏</span>
- </a>
-
-
- <span class="playable">[可播放]</span>
- </div>
- <div class="bd">
- <p class="">
- 导演: 彼得·威尔 Peter Weir 主演: 金·凯瑞 Jim Carrey / 劳拉·琳妮 Lau...<br>
- 1998 / 美国 / 剧情 科幻
- </p>
-
-
- <div class="star">
- <span class="rating45-t"></span>
- <span class="rating_num" property="v:average">9.3</span>
- <span property="v:best" content="10.0"></span>
- <span>1092363人评价</span>
- </div>
-
- <p class="quote">
- <span class="inq">如果再也不能见到你,祝你早安,午安,晚安。</span>
- </p>
- </div>
- </div>
- </div>
- </li>
- <li>
- <div class="item">
- <div class="pic">
- <em class="">13</em>
- <a href="https://movie.douban.com/subject/3793023/">
- <img width="100" alt="三傻大闹宝莱坞" src="https://img3.doubanio.com/view/photo/s_ratio_poster/public/p579729551.jpg" class="">
- </a>
- </div>
- <div class="info">
- <div class="hd">
- <a href="https://movie.douban.com/subject/3793023/" class="">
- <span class="title">三傻大闹宝莱坞</span>
- <span class="title"> / 3 Idiots</span>
- <span class="other"> / 三个傻瓜(台) / 作死不离3兄弟(港)</span>
- </a>
-
-
- <span class="playable">[可播放]</span>
- </div>
- <div class="bd">
- <p class="">
- 导演: 拉库马·希拉尼 Rajkumar Hirani 主演: 阿米尔·汗 Aamir Khan / 卡...<br>
- 2009 / 印度 / 剧情 喜剧 爱情 歌舞
- </p>
-
-
- <div class="star">
- <span class="rating45-t"></span>
- <span class="rating_num" property="v:average">9.2</span>
- <span property="v:best" content="10.0"></span>
- <span>1360715人评价</span>
- </div>
-
- <p class="quote">
- <span class="inq">英俊版憨豆,高情商版谢耳朵。</span>
- </p>
- </div>
- </div>
- </div>
- </li>
- <li>
- <div class="item">
- <div class="pic">
- <em class="">14</em>
- <a href="https://movie.douban.com/subject/2131459/">
- <img width="100" alt="机器人总动员" src="https://img3.doubanio.com/view/photo/s_ratio_poster/public/p1461851991.jpg" class="">
- </a>
- </div>
- <div class="info">
- <div class="hd">
- <a href="https://movie.douban.com/subject/2131459/" class="">
- <span class="title">机器人总动员</span>
- <span class="title"> / WALL·E</span>
- <span class="other"> / 太空奇兵·威E(港) / 瓦力(台)</span>
- </a>
-
-
- <span class="playable">[可播放]</span>
- </div>
- <div class="bd">
- <p class="">
- 导演: 安德鲁·斯坦顿 Andrew Stanton 主演: 本·贝尔特 Ben Burtt / 艾丽...<br>
- 2008 / 美国 / 科幻 动画 冒险
- </p>
-
-
- <div class="star">
- <span class="rating45-t"></span>
- <span class="rating_num" property="v:average">9.3</span>
- <span property="v:best" content="10.0"></span>
- <span>967355人评价</span>
- </div>
-
- <p class="quote">
- <span class="inq">小瓦力,大人生。</span>
- </p>
- </div>
- </div>
- </div>
- </li>
- <li>
- <div class="item">
- <div class="pic">
- <em class="">15</em>
- <a href="https://movie.douban.com/subject/1291549/">
- <img width="100" alt="放牛班的春天" src="https://img3.doubanio.com/view/photo/s_ratio_poster/public/p1910824951.jpg" class="">
- </a>
- </div>
- <div class="info">
- <div class="hd">
- <a href="https://movie.douban.com/subject/1291549/" class="">
- <span class="title">放牛班的春天</span>
- <span class="title"> / Les choristes</span>
- <span class="other"> / 歌声伴我心(港) / 唱诗班男孩</span>
- </a>
-
-
- <span class="playable">[可播放]</span>
- </div>
- <div class="bd">
- <p class="">
- 导演: 克里斯托夫·巴拉蒂 Christophe Barratier 主演: 热拉尔·朱尼奥 Gé...<br>
- 2004 / 法国 瑞士 德国 / 剧情 音乐
- </p>
-
-
- <div class="star">
- <span class="rating45-t"></span>
- <span class="rating_num" property="v:average">9.3</span>
- <span property="v:best" content="10.0"></span>
- <span>946898人评价</span>
- </div>
-
- <p class="quote">
- <span class="inq">天籁一般的童声,是最接近上帝的存在。 </span>
- </p>
- </div>
- </div>
- </div>
- </li>
- <li>
- <div class="item">
- <div class="pic">
- <em class="">16</em>
- <a href="https://movie.douban.com/subject/1889243/">
- <img width="100" alt="星际穿越" src="https://img3.doubanio.com/view/photo/s_ratio_poster/public/p2206088801.jpg" class="">
- </a>
- </div>
- <div class="info">
- <div class="hd">
- <a href="https://movie.douban.com/subject/1889243/" class="">
- <span class="title">星际穿越</span>
- <span class="title"> / Interstellar</span>
- <span class="other"> / 星际启示录(港) / 星际效应(台)</span>
- </a>
-
-
- <span class="playable">[可播放]</span>
- </div>
- <div class="bd">
- <p class="">
- 导演: 克里斯托弗·诺兰 Christopher Nolan 主演: 马修·麦康纳 Matthew Mc...<br>
- 2014 / 美国 英国 加拿大 冰岛 / 剧情 科幻 冒险
- </p>
-
-
- <div class="star">
- <span class="rating45-t"></span>
- <span class="rating_num" property="v:average">9.3</span>
- <span property="v:best" content="10.0"></span>
- <span>1095565人评价</span>
- </div>
-
- <p class="quote">
- <span class="inq">爱是一种力量,让我们超越时空感知它的存在。</span>
- </p>
- </div>
- </div>
- </div>
- </li>
- <li>
- <div class="item">
- <div class="pic">
- <em class="">17</em>
- <a href="https://movie.douban.com/subject/1292213/">
- <img width="100" alt="大话西游之大圣娶亲" src="https://img9.doubanio.com/view/photo/s_ratio_poster/public/p2455050536.jpg" class="">
- </a>
- </div>
- <div class="info">
- <div class="hd">
- <a href="https://movie.douban.com/subject/1292213/" class="">
- <span class="title">大话西游之大圣娶亲</span>
- <span class="title"> / 西遊記大結局之仙履奇緣</span>
- <span class="other"> / 西游记完结篇仙履奇缘 / 齐天大圣西游记</span>
- </a>
-
-
- <span class="playable">[可播放]</span>
- </div>
- <div class="bd">
- <p class="">
- 导演: 刘镇伟 Jeffrey Lau 主演: 周星驰 Stephen Chow / 吴孟达 Man Tat Ng...<br>
- 1995 / 中国香港 中国大陆 / 喜剧 爱情 奇幻 古装
- </p>
-
-
- <div class="star">
- <span class="rating45-t"></span>
- <span class="rating_num" property="v:average">9.2</span>
- <span property="v:best" content="10.0"></span>
- <span>1074098人评价</span>
- </div>
-
- <p class="quote">
- <span class="inq">一生所爱。</span>
- </p>
- </div>
- </div>
- </div>
- </li>
- <li>
- <div class="item">
- <div class="pic">
- <em class="">18</em>
- <a href="https://movie.douban.com/subject/5912992/">
- <img width="100" alt="熔炉" src="https://img9.doubanio.com/view/photo/s_ratio_poster/public/p1363250216.jpg" class="">
- </a>
- </div>
- <div class="info">
- <div class="hd">
- <a href="https://movie.douban.com/subject/5912992/" class="">
- <span class="title">熔炉</span>
- <span class="title"> / 도가니</span>
- <span class="other"> / 无声呐喊(港) / 漩涡</span>
- </a>
-
-
- </div>
- <div class="bd">
- <p class="">
- 导演: 黄东赫 Dong-hyuk Hwang 主演: 孔侑 Yoo Gong / 郑有美 Yu-mi Jung /...<br>
- 2011 / 韩国 / 剧情
- </p>
-
-
- <div class="star">
- <span class="rating45-t"></span>
- <span class="rating_num" property="v:average">9.3</span>
- <span property="v:best" content="10.0"></span>
- <span>661701人评价</span>
- </div>
-
- <p class="quote">
- <span class="inq">我们一路奋战不是为了改变世界,而是为了不让世界改变我们。</span>
- </p>
- </div>
- </div>
- </div>
- </li>
- <li>
- <div class="item">
- <div class="pic">
- <em class="">19</em>
- <a href="https://movie.douban.com/subject/25662329/">
- <img width="100" alt="疯狂动物城" src="https://img1.doubanio.com/view/photo/s_ratio_poster/public/p2315672647.jpg" class="">
- </a>
- </div>
- <div class="info">
- <div class="hd">
- <a href="https://movie.douban.com/subject/25662329/" class="">
- <span class="title">疯狂动物城</span>
- <span class="title"> / Zootopia</span>
- <span class="other"> / 优兽大都会(港) / 动物方城市(台)</span>
- </a>
-
-
- <span class="playable">[可播放]</span>
- </div>
- <div class="bd">
- <p class="">
- 导演: 拜伦·霍华德 Byron Howard / 瑞奇·摩尔 Rich Moore 主演: 金妮弗·...<br>
- 2016 / 美国 / 喜剧 动画 冒险
- </p>
-
-
- <div class="star">
- <span class="rating45-t"></span>
- <span class="rating_num" property="v:average">9.2</span>
- <span property="v:best" content="10.0"></span>
- <span>1281626人评价</span>
- </div>
-
- <p class="quote">
- <span class="inq">迪士尼给我们营造的乌托邦就是这样,永远善良勇敢,永远出乎意料。</span>
- </p>
- </div>
- </div>
- </div>
- </li>
- <li>
- <div class="item">
- <div class="pic">
- <em class="">20</em>
- <a href="https://movie.douban.com/subject/1307914/">
- <img width="100" alt="无间道" src="https://img3.doubanio.com/view/photo/s_ratio_poster/public/p2564556863.jpg" class="">
- </a>
- </div>
- <div class="info">
- <div class="hd">
- <a href="https://movie.douban.com/subject/1307914/" class="">
- <span class="title">无间道</span>
- <span class="title"> / 無間道</span>
- <span class="other"> / Infernal Affairs / Mou gaan dou</span>
- </a>
-
-
- <span class="playable">[可播放]</span>
- </div>
- <div class="bd">
- <p class="">
- 导演: 刘伟强 / 麦兆辉 主演: 刘德华 / 梁朝伟 / 黄秋生<br>
- 2002 / 中国香港 / 剧情 犯罪 悬疑
- </p>
-
-
- <div class="star">
- <span class="rating45-t"></span>
- <span class="rating_num" property="v:average">9.2</span>
- <span property="v:best" content="10.0"></span>
- <span>875223人评价</span>
- </div>
-
- <p class="quote">
- <span class="inq">香港电影史上永不过时的杰作。</span>
- </p>
- </div>
- </div>
- </div>
- </li>
- <li>
- <div class="item">
- <div class="pic">
- <em class="">21</em>
- <a href="https://movie.douban.com/subject/1291560/">
- <img width="100" alt="龙猫" src="https://img9.doubanio.com/view/photo/s_ratio_poster/public/p2540924496.jpg" class="">
- </a>
- </div>
- <div class="info">
- <div class="hd">
- <a href="https://movie.douban.com/subject/1291560/" class="">
- <span class="title">龙猫</span>
- <span class="title"> / となりのトトロ</span>
- <span class="other"> / 邻居托托罗 / 邻家的豆豆龙</span>
- </a>
-
-
- <span class="playable">[可播放]</span>
- </div>
- <div class="bd">
- <p class="">
- 导演: 宫崎骏 Hayao Miyazaki 主演: 日高法子 Noriko Hidaka / 坂本千夏 Ch...<br>
- 1988 / 日本 / 动画 奇幻 冒险
- </p>
-
-
- <div class="star">
- <span class="rating45-t"></span>
- <span class="rating_num" property="v:average">9.2</span>
- <span property="v:best" content="10.0"></span>
- <span>909974人评价</span>
- </div>
-
- <p class="quote">
- <span class="inq">人人心中都有个龙猫,童年就永远不会消失。</span>
- </p>
- </div>
- </div>
- </div>
- </li>
- <li>
- <div class="item">
- <div class="pic">
- <em class="">22</em>
- <a href="https://movie.douban.com/subject/1291841/">
- <img width="100" alt="教父" src="https://img9.doubanio.com/view/photo/s_ratio_poster/public/p616779645.jpg" class="">
- </a>
- </div>
- <div class="info">
- <div class="hd">
- <a href="https://movie.douban.com/subject/1291841/" class="">
- <span class="title">教父</span>
- <span class="title"> / The Godfather</span>
- <span class="other"> / Mario Puzo's The Godfather</span>
- </a>
-
-
- <span class="playable">[可播放]</span>
- </div>
- <div class="bd">
- <p class="">
- 导演: 弗朗西斯·福特·科波拉 Francis Ford Coppola 主演: 马龙·白兰度 M...<br>
- 1972 / 美国 / 剧情 犯罪
- </p>
-
-
- <div class="star">
- <span class="rating45-t"></span>
- <span class="rating_num" property="v:average">9.3</span>
- <span property="v:best" content="10.0"></span>
- <span>665009人评价</span>
- </div>
-
- <p class="quote">
- <span class="inq">千万不要记恨你的对手,这样会让你失去理智。</span>
- </p>
- </div>
- </div>
- </div>
- </li>
- <li>
- <div class="item">
- <div class="pic">
- <em class="">23</em>
- <a href="https://movie.douban.com/subject/1849031/">
- <img width="100" alt="当幸福来敲门" src="https://img1.doubanio.com/view/photo/s_ratio_poster/public/p1312700628.jpg" class="">
- </a>
- </div>
- <div class="info">
- <div class="hd">
- <a href="https://movie.douban.com/subject/1849031/" class="">
- <span class="title">当幸福来敲门</span>
- <span class="title"> / The Pursuit of Happyness</span>
- <span class="other"> / 寻找快乐的故事(港) / 追求快乐</span>
- </a>
-
-
- <span class="playable">[可播放]</span>
- </div>
- <div class="bd">
- <p class="">
- 导演: 加布里尔·穆奇诺 Gabriele Muccino 主演: 威尔·史密斯 Will Smith ...<br>
- 2006 / 美国 / 剧情 传记 家庭
- </p>
-
-
- <div class="star">
- <span class="rating45-t"></span>
- <span class="rating_num" property="v:average">9.1</span>
- <span property="v:best" content="10.0"></span>
- <span>1086829人评价</span>
- </div>
-
- <p class="quote">
- <span class="inq">平民励志片。 </span>
- </p>
- </div>
- </div>
- </div>
- </li>
- <li>
- <div class="item">
- <div class="pic">
- <em class="">24</em>
- <a href="https://movie.douban.com/subject/3319755/">
- <img width="100" alt="怦然心动" src="https://img1.doubanio.com/view/photo/s_ratio_poster/public/p501177648.jpg" class="">
- </a>
- </div>
- <div class="info">
- <div class="hd">
- <a href="https://movie.douban.com/subject/3319755/" class="">
- <span class="title">怦然心动</span>
- <span class="title"> / Flipped</span>
- <span class="other"> / 萌动青春 / 青春萌动</span>
- </a>
-
-
- <span class="playable">[可播放]</span>
- </div>
- <div class="bd">
- <p class="">
- 导演: 罗伯·莱纳 Rob Reiner 主演: 玛德琳·卡罗尔 Madeline Carroll / 卡...<br>
- 2010 / 美国 / 剧情 喜剧 爱情
- </p>
-
-
- <div class="star">
- <span class="rating45-t"></span>
- <span class="rating_num" property="v:average">9.1</span>
- <span property="v:best" content="10.0"></span>
- <span>1266040人评价</span>
- </div>
-
- <p class="quote">
- <span class="inq">真正的幸福是来自内心深处。</span>
- </p>
- </div>
- </div>
- </div>
- </li>
- <li>
- <div class="item">
- <div class="pic">
- <em class="">25</em>
- <a href="https://movie.douban.com/subject/6786002/">
- <img width="100" alt="触不可及" src="https://img9.doubanio.com/view/photo/s_ratio_poster/public/p1454261925.jpg" class="">
- </a>
- </div>
- <div class="info">
- <div class="hd">
- <a href="https://movie.douban.com/subject/6786002/" class="">
- <span class="title">触不可及</span>
- <span class="title"> / Intouchables</span>
- <span class="other"> / 闪亮人生(港) / 逆转人生(台)</span>
- </a>
-
-
- </div>
- <div class="bd">
- <p class="">
- 导演: 奥利维·那卡什 Olivier Nakache / 艾力克·托兰达 Eric Toledano 主...<br>
- 2011 / 法国 / 剧情 喜剧
- </p>
-
-
- <div class="star">
- <span class="rating45-t"></span>
- <span class="rating_num" property="v:average">9.2</span>
- <span property="v:best" content="10.0"></span>
- <span>710625人评价</span>
- </div>
-
- <p class="quote">
- <span class="inq">满满温情的高雅喜剧。</span>
- </p>
- </div>
- </div>
- </div>
- </li>
- </ol>
-
-
-
-
-
-
-
- <div class="paginator">
- <span class="prev">
- <前页
- </span>
-
-
-
- <span class="thispage">1</span>
-
- <a href="?start=25&filter=" >2</a>
-
-
- <a href="?start=50&filter=" >3</a>
-
-
- <a href="?start=75&filter=" >4</a>
-
-
- <a href="?start=100&filter=" >5</a>
-
-
- <a href="?start=125&filter=" >6</a>
-
-
- <a href="?start=150&filter=" >7</a>
-
-
- <a href="?start=175&filter=" >8</a>
-
-
- <a href="?start=200&filter=" >9</a>
-
-
- <a href="?start=225&filter=" >10</a>
-
- <span class="next">
- <link rel="next" href="?start=25&filter="/>
- <a href="?start=25&filter=" >后页></a>
- </span>
-
- <span class="count">(共250条)</span>
- </div>
-
-
-
- </div>
- <div class="aside">
-
- <p class="pl">
- 豆瓣用户每天都在对“看过”的电影进行“很差”到“力荐”的评价,豆瓣根据每部影片看过的人数以及该影片所得的评价等综合数据,通过算法分析产生豆瓣电影 Top 250。
- </p>
-
- <div id="dale_movie_top250_bottom_right"></div>
-
- <!-- douban ad begin -->
-
-
-
-
-
-
- <div class="mobile-app-entrance block5 app-movie">
- <a class="entrance-link" href="https://www.douban.com/doubanapp/frodo">
- <div class="entrance-qrcode">
- <img src="https://img3.doubanio.com/f/movie/a02f6ed325fc52e220f299d51e730c422e2bcd16/pics/movie/douban_app_ad/qrcode.png" alt="扫码下载豆瓣 App" width="80" height="80" />
- </div>
- <div class="entrance-info">
- <span class="app-icon icon-movie"></span>
- <span class="main-title">豆瓣</span>
- <span class="sub-title">让好电影来找你</span>
- </div>
- </a>
- </div>
-
- <!-- douban ad end -->
-
-
- </div>
- <div class="extra">
-
- </div>
- </div>
- </div>
-
-
- <div id="footer">
- <div class="footer-extra"></div>
-
- <span id="icp" class="fleft gray-link">
- © 2005-2020 douban.com, all rights reserved 北京豆网科技有限公司
- </span>
-
- <a href="https://www.douban.com/hnypt/variformcyst.py" style="display: none;"></a>
-
- <span class="fright">
- <a href="https://www.douban.com/about">关于豆瓣</a>
- · <a href="https://www.douban.com/jobs">在豆瓣工作</a>
- · <a href="https://www.douban.com/about?topic=contactus">联系我们</a>
- · <a href="https://www.douban.com/about/legal">法律声明</a>
-
- · <a href="https://help.douban.com/?app=movie" target="_blank">帮助中心</a>
- · <a href="https://www.douban.com/doubanapp/">移动应用</a>
- · <a href="https://www.douban.com/partner/">豆瓣广告</a>
- </span>
-
- </div>
-
- </div>
- <!-- COLLECTED JS -->
-
-
- <link rel="stylesheet" type="text/css" href="https://img3.doubanio.com/f/shire/8377b9498330a2e6f056d863987cc7a37eb4d486/css/ui/dialog.css" />
- <link rel="stylesheet" type="text/css" href="https://img3.doubanio.com/f/movie/4aca95d66d37ec0712b3d19973b5d8feb75f2f05/css/movie/mod/reg_login_pop.css" />
- <script type="text/javascript" src="https://img3.doubanio.com/f/shire/77323ae72a612bba8b65f845491513ff3329b1bb/js/do.js" data-cfg-autoload="false"></script>
- <script type="text/javascript" src="https://img3.doubanio.com/f/shire/383a6e43f2108dc69e3ff2681bc4dc6c72a5ffb0/js/ui/dialog.js"></script>
- <script type="text/javascript">
- var HTTPS_DB='https://www.douban.com';
- var account_pop={open:function(o,e){e?referrer="?referrer="+encodeURIComponent(e):referrer="?referrer="+window.location.href;var n="",i="",t=448;n="用户登录",i="https://accounts.douban.com/passport/login_popup?source=movie";var r=document.location.protocol+"//"+document.location.hostname,a=dui.Dialog({width:340,title:n,height:t,cls:"account_pop",isHideTitle:!0,modal:!0,content:"<iframe scrolling='no' frameborder='0' width='340' height='"+t+"' src='"+i+"' name='"+r+"'></iframe>"},!0),c=a.node;if(c.undelegate(),c.delegate(".dui-dialog-close","click",function(){var o=$("body");o.find("#login_msk").hide(),o.find(".account_pop").remove()}),$(window).width()<478){var d="";"reg"===o?d=HTTPS_DB+"/accounts/register"+referrer:"login"===o&&(d=HTTPS_DB+"/accounts/login"+referrer),window.location.href=d}else a.open();$(window).bind("message",function(o){"https://accounts.douban.com"===o.originalEvent.origin&&(c.find("iframe").css("height",o.originalEvent.data),c.height(o.originalEvent.data),a.update())})}};Douban&&Douban.init_show_login&&(Douban.init_show_login=function(o){var e=$(o);e.click(function(){var o=e.data("ref")||"";return account_pop.open("login",o),!1})}),Do(function(){$("body").delegate(".pop_register","click",function(o){o.preventDefault();var e=$(this).data("ref")||"";return account_pop.open("reg",e),!1}),$("body").delegate(".pop_login","click",function(o){o.preventDefault();var e=$(this).data("ref")||"";return account_pop.open("login",e),!1})});
- </script>
-
-
-
-
-
-
-
-
- <script type="text/javascript">
- (function (global) {
- var newNode = global.document.createElement('script'),
- existingNode = global.document.getElementsByTagName('script')[0],
- adSource = '//erebor.douban.com/',
- userId = '',
- browserId = 'Pp8Rh7_QuUA',
- criteria = '3:/top250',
- preview = '',
- debug = false,
- adSlots = ['dale_movie_top250_bottom_right'];
-
- global.DoubanAdRequest = {src: adSource, uid: userId, bid: browserId, crtr: criteria, prv: preview, debug: debug};
- global.DoubanAdSlots = (global.DoubanAdSlots || []).concat(adSlots);
-
- newNode.setAttribute('type', 'text/javascript');
- newNode.setAttribute('src', '//img1.doubanio.com/dnRiY2ZoMy9mL2FkanMvNTRjNWIyYmFlNjFkZTJlYTBlN2VmZDk1ODc2MGY3ZTk2OGZjNmQwNS9hZC5yZWxlYXNlLmpz');
- newNode.setAttribute('async', true);
- existingNode.parentNode.insertBefore(newNode, existingNode);
- })(this);
- </script>
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
- <script type="text/javascript">
- var _paq = _paq || [];
- _paq.push(['trackPageView']);
- _paq.push(['enableLinkTracking']);
- (function() {
- var p=(('https:' == document.location.protocol) ? 'https' : 'http'), u=p+'://fundin.douban.com/';
- _paq.push(['setTrackerUrl', u+'piwik']);
- _paq.push(['setSiteId', '100001']);
- var d=document, g=d.createElement('script'), s=d.getElementsByTagName('script')[0];
- g.type='text/javascript';
- g.defer=true;
- g.async=true;
- g.src=p+'://img3.doubanio.com/dae/fundin/piwik.js';
- s.parentNode.insertBefore(g,s);
- })();
- </script>
-
- <script type="text/javascript">
- var setMethodWithNs = function(namespace) {
- var ns = namespace ? namespace + '.' : ''
- , fn = function(string) {
- if(!ns) {return string}
- return ns + string
- }
- return fn
- }
-
- var gaWithNamespace = function(fn, namespace) {
- var method = setMethodWithNs(namespace)
- fn.call(this, method)
- }
-
- var _gaq = _gaq || []
- , accounts = [
- { id: 'UA-7019765-1', namespace: 'douban' }
- , { id: 'UA-7019765-19', namespace: '' }
- ]
- , gaInit = function(account) {
- gaWithNamespace(function(method) {
- gaInitFn.call(this, method, account)
- }, account.namespace)
- }
- , gaInitFn = function(method, account) {
- _gaq.push([method('_setAccount'), account.id]);
- _gaq.push([method('_setSampleRate'), '5']);
-
-
- _gaq.push([method('_addOrganic'), 'google', 'q'])
- _gaq.push([method('_addOrganic'), 'baidu', 'wd'])
- _gaq.push([method('_addOrganic'), 'soso', 'w'])
- _gaq.push([method('_addOrganic'), 'youdao', 'q'])
- _gaq.push([method('_addOrganic'), 'so.360.cn', 'q'])
- _gaq.push([method('_addOrganic'), 'sogou', 'query'])
- if (account.namespace) {
- _gaq.push([method('_addIgnoredOrganic'), '豆瓣'])
- _gaq.push([method('_addIgnoredOrganic'), 'douban'])
- _gaq.push([method('_addIgnoredOrganic'), '豆瓣网'])
- _gaq.push([method('_addIgnoredOrganic'), 'www.douban.com'])
- }
-
- if (account.namespace === 'douban') {
- _gaq.push([method('_setDomainName'), '.douban.com'])
- }
-
- _gaq.push([method('_setCustomVar'), 1, 'responsive_view_mode', 'desktop', 3])
-
- _gaq.push([method('_setCustomVar'), 2, 'login_status', '0', 2]);
-
- _gaq.push([method('_trackPageview')])
- }
-
- for(var i = 0, l = accounts.length; i < l; i++) {
- var account = accounts[i]
- gaInit(account)
- }
-
-
- ;(function() {
- var ga = document.createElement('script');
- ga.src = ('https:' == document.location.protocol ? 'https://ssl' : 'http://www') + '.google-analytics.com/ga.js';
- ga.setAttribute('async', 'true');
- document.documentElement.firstChild.appendChild(ga);
- })()
- </script>
-
-
-
-
-
-
-
-
-
-
-
- <!-- dae-web-movie--default-b59d5cf95-vzn86-->
-
- <script>_SPLITTEST=''</script>
- </body>
-
- </html>
-
-
-
-
- Process finished with exit code 0
由此可以 这是直接抓取了整个页面,很明显这个不是我们想要的结果,接下来我们对此进行优化
4.正则表达式解析html
- def parse_douban_top(html):
- regex = '<em class="">(\d+)</em>.*?<span class="title">(.*?)</span>.*?<p class="">(.*?)</p>.*?<span class="rating_num" property="v:average">(.*?)</span>.*?<span>(.*?)</span>.*?<span class="inq">(.*?)</span>'
-
- jsonResult = re.compile(
- regex, re.S
- )
-
- items = re.findall(jsonResult, html)
-
- for item in items:
-
- content = ""
-
- for every_list in item[2].split():
- # split()通过指定分隔符对字符串进行切片,因为职员表处有些标签需要我们进行处理
- content = content + "".join(every_list)
- content = re.sub(' ', ' ', content)
- content = re.sub('<br>', ' ', content)
-
-
- yield {
- "index": item[0],
- "name": item[1],
- "describe": content,
- "star": item[3],
- "evaluate": item[4],
- "title": item[5]
- }
其中
<em class="">(\d+)</em>.*?<span class="title">(.*?)</span>.*?<p class="">(.*?)</p>.*?<span class="rating_num" property="v:average">(.*?)</span>.*?<span>(.*?)</span>.*?<span class="inq">(.*?)</span>
这个是针对豆瓣进行内容进行解析
完整代码:
- import requests
- import re
-
-
- def get_douban_top(url):
- headers = {
- 'User-Agent': 'Mozilla/5.0 (Macintosh; Intel Mac OS X 10_15_4) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/81.0.4044.138 Safari/537.36'
- }
-
- response = requests.get(url, headers=headers)
- if response.status_code == 200:
- return response.text
- return None
-
-
- def main():
- url = 'https://movie.douban.com/top250'
- html = get_douban_top(url)
-
- for item in parse_douban_top(html):
- print(item)
-
-
-
-
- def parse_douban_top(html):
- regex = '<em class="">(\d+)</em>.*?<span class="title">(.*?)</span>.*?<p class="">(.*?)</p>.*?<span class="rating_num" property="v:average">(.*?)</span>.*?<span>(.*?)</span>.*?<span class="inq">(.*?)</span>'
-
- jsonResult = re.compile(
- regex, re.S
- )
-
- items = re.findall(jsonResult, html)
-
- for item in items:
-
- content = ""
-
- for every_list in item[2].split():
- # split()通过指定分隔符对字符串进行切片,因为职员表处有些标签需要我们进行处理
- content = content + "".join(every_list)
- content = re.sub(' ', ' ', content)
- content = re.sub('<br>', ' ', content)
-
-
- yield {
- "index": item[0],
- "name": item[1],
- "describe": content,
- "star": item[3],
- "evaluate": item[4],
- "title": item[5]
- }
-
-
- main()
运行结果:
结束,希望能够帮助到像我这样的初学者。
Copyright © 2003-2013 www.wpsshop.cn 版权所有,并保留所有权利。