新浪微博后续json 数据分析和抓取_新浪微博后续工具

作者：键盘狂人 | 2024-02-03 15:14:12

踩

新浪微博后续工具

微博数据抓取下拉由json数据返回的动态数据
因为下拉的数据要不同的数据才能返回不同页码的数据，懒得分析参数了，
url 的参数是由首页数据的最后一个微博的参数决定的，没有整合到一起，这里只是做个示范，假定数据给定的情况，后续要加工成一体的。

#!/usr/bin/python
# -*- coding: UTF-8 -*-

#获取  新浪微薄 下拉后得到的微博信息  返回json数据
from bs4 import BeautifulSoup
import Auth as head
import requests

import time
def getnextinfo(): 
    t = time.time()
    nowTime = lambda:int(round(t * 1000))
    print (nowTime()); 
    url='https://weibo.com/p/aj/v6/mblog/mbloglist?ajwvr=6&domain=100505&refer_flag=0000015010_&from=feed&loc=avatar&is_all=1&pagebar=0&pl_name=Pl_Official_MyProfileFeed__21&id=1005051892059383&script_uri=/yishuwuyu&feed_type=0&page=1&pre_page=1&domain_op=100505&__rnd=1564487703911'


    header=head.getheader()

    html_doc=requests.get(url,headers=header)
    html=html_doc.json()
    table=html['data']
#print html['data']
    print '1111111111111111111111111111111111111111111111111111111'
    soup = BeautifulSoup(table, 'html.parser', from_encoding='utf-8')
    listcontent=soup.div.children
    for z in listcontent:
        s=BeautifulSoup(str(z), 'html.parser', from_encoding='utf-8')
        t=s.find('div',class_='WB_text W_f14')
        if t:
            a=BeautifulSoup(str(t), 'html.parser', from_encoding='utf-8')
            w= a.div
            print w.text
            print 'xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx'
head.getmianinfo()
getnextinfo()

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36

声明：本文内容由网友自发贡献，版权归原作者所有，本站不承担相应法律责任。如您发现有侵权的内容，请联系我们。转载请注明出处：【wpsshop博客】

新浪微博后续json 数据分析和抓取_新浪微博后续工具

〔002〕虚幻 UE5 发送 get、post 请求、读取 json 文件_varest post get

CURL post json 使用说明_curl post json请求

Python爬虫实战+数据分析+数据可视化（猫眼电影）_y5dcc

【Golang】Go中json.Marshal函数_golang json.marshal

golang 转换时指定多个别名_go json 别名

Flink kafka source定义并行度_the kafka table with 'json' format doesn't support

获取 request 中用POST方式"Content-type"是"application/x-www-form-urlencoded;charset=utf-8"发送的 json 数据_httprequest.post(tokenurl) .header("content-type",

stm32项目_stm32f103c8t6项目_循迹避障小车完整制作过程_智能小车设计_STM32智能小车教程-循迹-避障-蓝牙遥控-跟随_autoformatdata key: num-distance, json: "num-dista

[DOCKER]Windows设置daemon.json内容_windows docker tool如何配置daemon.json

docker for windows 修改 daemon.json配置_windows docker daemon.json

Python 库引用提示：name ‘json‘ is not defined. 问题解决办法_name 'json' is not defined

docker 基本使用_windows-daemon-options.json

微信小程序导入项目报错：在项目根目录未找到app.json_在项目根目录未找到 app.json

linux下json字符串格式化、解决json.tool中文乱码_linux json

数据分析：随机森林random forest在二分类中的应用_future.globals.maxsize设置

为javascript的JSON对象扩展forEach方法_js json foreach

Redis 存取 JSON 数据_redis 什么时候会存储json数据类型的数据

爬虫实践---新浪微博爬取+json+csv_微博怎么得到json文件