赞
踩
在原始arxiv数据集中论⽂文作者authors 字段是⼀个字符串格式,其中每个作者使用逗号进行分隔,所
以我们⾸先需要完成以下步骤:
'''
C. Bal\\'azs, E. L. Berger, P. M. Nadolsky, C.-P. Yuan
# 切分为,其中\\为转义符
C. Ba'lazs
E. L. Berger
P. M. Nadolsky
C.-P. Yuan
'''
#当然在原始数据集中authors_parsed 字段已经帮我们处理理好了了作者信息,可以直接使⽤用该字段完成后续统计。
"\nC. Bal\\'azs, E. L. Berger, P. M. Nadolsky, C.-P. Yuan\n# 切分为,其中\\为转义符\nC. Ba'lazs\nE. L. Berger\nP. M. Nadolsky\nC.-P. Yuan\n"
在Python中字符串是最常用的数据类型,可以使用引号('或")来创建字符串串。Python中所有的字符都使
用字符串存储,可以使⽤方括号来截取字符串,如下实例:
var1 = 'Hello Datawhale!'
var2 = "Python Everwhere!"
print("var1[-10:]: ", var1[-10:])
print("var2[1:5]: ", var2[0:7])
var1[-10:]: Datawhale!
var2[1:5]: Python
Python中还内置了很多内置函数,非常方便使用:
import os
os.getcwd()
'C:\\Users\\Administrator\\Desktop\\datawhale\\数据分析之学术前沿分析'
data = []
import pandas as pd
import json
with open('./arxiv-metadata-oai-2019.json/arxiv-metadata-oai-2019.json','r' ) as f:
for idx, line in enumerate(f):
d = json.loads(line)
d = {"authors": d["authors"],'categories': d['categories'],'authors_parsed': d['authors_parsed'] }
data.append(d)
data = pd.DataFrame(data)
# 为了方便处理数据,我们只选择了三个字段进行读取。
# 选择类别为cs.CV下⾯面的论⽂文
data2 = data[data['categories'].apply(lambda x: 'cs.CV' in x)]
# 拼接所有作者
data2
authors | categories | authors_parsed | |
---|---|---|---|
531 | Mahesh Pal | cs.NE cs.CV | [[Pal, Mahesh, ]] |
1408 | Serguei A. Mokhov, Stephen Sinclair, Ian Cl\'e... | cs.SD cs.CL cs.CV cs.MM cs.NE | [[Mokhov, Serguei A., , for the MARF R&D Group... |
3231 | Chris Aholt, Bernd Sturmfels, Rekha Thomas | math.AG cs.CV | [[Aholt, Chris, ], [Sturmfels, Bernd, ], [Thom... |
4120 | Jos\'e I. Ronda, Antonio Vald\'es and Guillerm... | cs.CV | [[Ronda, José I., ], [Valdés, Antonio, ], [Gal... |
4378 | Tanaya Guha and Rabab K. Ward | cs.CV | [[Guha, Tanaya, ], [Ward, Rabab K., ]] |
... | ... | ... | ... |
167912 | Zilong Ji, Xiaolong Zou, Tiejun Huang, Si Wu | cs.CV cs.LG | [[Ji, Zilong, ], [Zou, Xiaolong, ], [Huang, Ti... |
167913 | Tristan Sylvain, Linda Petrini, Devon Hjelm | cs.CV cs.LG | [[Sylvain, Tristan, ], [Petrini, Linda, ], [Hj... |
167914 | Jonathan Ho, Nal Kalchbrenner, Dirk Weissenbor... | cs.CV | [[Ho, Jonathan, ], [Kalchbrenner, Nal, ], [Wei... |
167918 | Chia-Mu Yu, Ching-Tang Chang, Yen-Wu Ti | cs.CV | [[Yu, Chia-Mu, ], [Chang, Ching-Tang, ], [Ti, ... |
167964 | Dian Chen and Brady Zhou and Vladlen Koltun an... | cs.RO cs.AI cs.CV cs.LG | [[Chen, Dian, ], [Zhou, Brady, ], [Koltun, Vla... |
11168 rows × 3 columns
all_authors = sum(data2['authors_parsed'], [])
# 处理理完成后all_authors 变成了了所有⼀个list,其中每个元素为⼀个作者的姓名。我们⾸先来完成姓名频率的统计。
all_authors
[['Pal', 'Mahesh', ''], ['Mokhov', 'Serguei A.', '', 'for the MARF R&D Group'], ['Sinclair', 'Stephen', '', 'for the MARF R&D Group'], ['Clément', 'Ian', '', 'for the MARF R&D Group'], ['Nicolacopoulos', 'Dimitrios', '', 'for the MARF R&D Group'], ['Aholt', 'Chris', ''], ['Sturmfels', 'Bernd', ''], ['Thomas', 'Rekha', ''], ['Ronda', 'José I.', ''], ['Valdés', 'Antonio', ''], ['Gallego', 'Guillermo', ''], ['Guha', 'Tanaya', ''], ['Ward', 'Rabab K.', ''], ['Olaizola', 'Igor G.', ''], ['Quartulli', 'Marco', ''], ['Florez', 'Julian', ''], ['Sierra', 'Basilio', ''], ['Xie', 'Xiaohua', ''], ['Xu', 'Kai', ''], ['Mitra', 'Niloy J.', ''], ['Cohen-Or', 'Daniel', ''], ['Chen', 'Baoquan', ''], ['Guha', 'Tanaya', ''], ['Nezhadarya', 'Ehsan', ''], ['Ward', 'Rabab K', ''], ['Gao', 'Fei', ''], ['Tao', 'Dacheng', ''], ['Gao', 'Xinbo', ''], ['Li', 'Xuelong', ''], ['Sun', 'Yuli', ''], ['Tao', 'Jinxu', ''], ['Liu', 'Conggui', ''], ['Sun', 'Yuli', ''], ['Tao', 'Jinxu', ''], ['Soorma', 'Neha', '', 'M.TECH'], ['Singh', 'Jaikaran', '', 'Department of Electronics and Communication, SSSIST, Sehore,\n M.P. India'], ['Tiwari', 'Mukesh', '', 'Department of Electronics and Communication, SSSIST, Sehore,\n M.P. India'], ['Poling', 'Bryan', ''], ['Lerman', 'Gilad', ''], ['Szlam', 'Arthur', ''], ['Chung', 'Moo K.', ''], ['Hanson', 'Jamie L.', ''], ['Ye', 'Jieping', ''], ['Davidson', 'Richard J.', ''], ['Pollak', 'Seth D.', ''], ['Li', 'Junhua', ''], ['Struzik', 'Zbigniew', ''], ['Zhang', 'Liqing', ''], ['Cichocki', 'Andrzej', ''], ['Gilani', 'Syed Zulqarnain', ''], ['Mian', 'Ajmal', ''], ['Shafait', 'Faisal', ''], ['Reid', 'Ian', ''], ['Li', 'Junhua', ''], ['Li', 'Chao', ''], ['Cichocki', 'Andrzej', ''], ['van Gennip', 'Yves', ''], ['Athavale', 'Prashant', ''], ['Gilles', 'Jérôme', ''], ['Choksi', 'Rustum', ''], ['Vitanyi', 'P. M. B.', '', 'CWI and University of Amsterdam'], ['Vitale', 'Jonathan', ''], ['Williams', 'Mary-Anne', ''], ['Johnston', 'Benjamin', ''], ['Boccignone', 'Giuseppe', ''], ['Borji', 'Ali', ''], ['Cheng', 'Ming-Ming', ''], ['Hou', 'Qibin', ''], ['Jiang', 'Huaizu', ''], ['Li', 'Jia', ''], ['Strauß', 'Tobias', '', 'for the University of Rostock - CITlab'], ['Grüning', 'Tobias', '', 'for the University of Rostock - CITlab'], ['Leifert', 'Gundram', '', 'for the University of Rostock - CITlab'], ['Labahn', 'Roger', '', 'for the University of Rostock - CITlab'], ['Leifert', 'Gundram', '', 'for the University of Rostock - CITlab'], ['Grüning', 'Tobias', '', 'for the University of Rostock - CITlab'], ['Strauß', 'Tobias', '', 'for the University of Rostock - CITlab'], ['Labahn', 'Roger', '', 'for the University of Rostock - CITlab'], ['Cohen', 'Taco S.', ''], ['Welling', 'Max', ''], ['Torres', 'Wuilian', ''], ['Rueda-Toicen', 'Antonio', ''], ['Melo', 'E. F.', ''], ['de Oliveira', 'H. M.', ''], ['Oh', 'Tae-Hyun', ''], ['Tai', 'Yu-Wing', ''], ['Bazin', 'Jean-Charles', ''], ['Kim', 'Hyeongwoo', ''], ['Kweon', 'In So', ''], ['Li', 'Xiangru', ''], ['Lu', 'Yu', ''], ['Comte', 'Georges', ''], ['Luo', 'Ali', ''], ['Zhao', 'Yongheng', ''], ['Wang', 'Yongjun', ''], ['Lai', 'Hanjiang', ''], ['Pan', 'Yan', ''], ['Liu', 'Ye', ''], ['Yan', 'Shuicheng', ''], ['Felzenszwalb', 'Pedro F.', ''], ['Svaiter', 'Benar F.', ''], ['Arrigoni', 'Federica', ''], ['Fusiello', 'Andrea', ''], ['Rossi', 'Beatrice', ''], ['Fragneto', 'Pasqualina', ''], ['Mandal', 'Subhamoy', ''], ['Sudarshan', 'Viswanath Pamulakanty', ''], ['Nagaraj', 'Yeshaswini', ''], ['Ben', 'Xose Luis Dean', ''], ['Razansky', 'Daniel', ''], ['Boccignone', 'Giuseppe', ''], ['Parizi', 'Sobhan Naderi', ''], ['He', 'Kun', ''], ['Aghajani', 'Reza', ''], ['Sclaroff', 'Stan', ''], ['Felzenszwalb', 'Pedro', ''], ['Isikdogan', 'F.', ''], ['Bovik', 'A. C.', ''], ['Passalacqua', 'P.', ''], ['Bohi', 'Amine', ''], ['Prandi', 'Dario', ''], ['Guis', 'Vincente', ''], ['Bouchara', 'Frédéric', ''], ['Gauthier', 'Jean-Paul', ''], ['Hafemann', 'Luiz G.', ''], ['Sabourin', 'Robert', ''], ['Oliveira', 'Luiz S.', ''], ['Oh', 'Tae-Hyun', ''], ['Matsushita', 'Yasuyuki', ''], ['Tai', 'Yu-Wing', ''], ['Kweon', 'In So', ''], ['Tsakiris', 'Manolis C.', ''], ['Vidal', 'Rene', ''], ['Tsakiris', 'Manolis C.', ''], ['Vidal', 'Rene', ''], ['Mandal', 'Subhamoy', ''], ['Deán-Ben', 'Xosé Luís', ''], ['Razansky', 'Daniel', ''], ['Palmieri', 'Luigi', ''], ['Rudenko', 'Andrey', ''], ['Arras', 'Kai O.', ''], ['Ginosar', 'Shiry', ''], ['Rakelly', 'Kate', ''], ['Sachs', 'Sarah', ''], ['Yin', 'Brian', ''], ['Lee', 'Crystal', ''], ['Krahenbuhl', 'Philipp', ''], ['Efros', 'Alexei A.', ''], ['McClure', 'Patrick', ''], ['Kriegeskorte', 'Nikolaus', ''], ['Fu', 'Yanwei', ''], ['Huang', 'De-An', ''], ['Sigal', 'Leonid', ''], ['Huang', 'Shaoli', ''], ['Xu', 'Zhe', ''], ['Tao', 'Dacheng', ''], ['Zhang', 'Ya', ''], ['Khosravi', 'Mohammad Reza', ''], ['Sharif-Yazd', 'Mohammad', ''], ['Moghimi', 'Mohammad Kazem', ''], ['Keshavarz', 'Ahmad', ''], ['Rostami', 'Habib', ''], ['Mansouri', 'Suleiman', ''], ['Huttunen', 'Heikki', ''], ['Yancheshmeh', 'Fatemeh Shokrollahi', ''], ['Chen', 'Ke', ''], ['Iyer', 'Rahul Radhakrishnan', ''], ['Parekh', 'Sanjeel', ''], ['Mohandoss', 'Vikas', ''], ['Ramsurat', 'Anush', ''], ['Raj', 'Bhiksha', ''], ['Singh', 'Rita', ''], ['Sudarshan', 'Viswanath P', ''], ['Weiser', 'Tobias', ''], ['Chintala', 'Phalgun', ''], ['Mandal', 'Subhamoy', ''], ['Dutta', 'Rahul', ''], ['Gaya', 'Joel D. O.', ''], ['Codevilla', 'Felipe', ''], ['Duarte', 'Amanda C.', ''], ['Drews-Jr', 'Paulo L.', ''], ['Botelho', 'Silvia S.', ''], ['Abdulkhaev', 'Alisher', ''], ['Yilmaz', 'Ozgur', ''], ['Liu', 'Fuqiang', ''], ['Bi', 'Fukun', ''], ['Chen', 'Liang', ''], ['Markuš', 'Nenad', ''], ['Pandžić', 'Igor S.', ''], ['Ahlberg', 'Jörgen', ''], ['Granstrom', 'Karl', ''], ['Baum', 'Marcus', ''], ['Reuter', 'Stephan', ''], ['Savinov', 'Nikolay', ''], ['Haene', 'Christian', ''], ['Ladicky', 'Lubor', ''], ['Pollefeys', 'Marc', ''], ['Ponti', 'Moacir', ''], ['Riva', 'Mateus', ''], ['Barina', 'David', ''], ['Kula', 'Michal', ''], ['Zemcik', 'Pavel', ''], ['Granstrom', 'Karl', ''], ['Fatemi', 'Maryam', ''], ['Svensson', 'Lennart', ''], ['Chen', 'Yanxiang', ''], ['Hu', 'Yuxing', ''], ['Zhang', 'Luming', ''], ['Li', 'Ping', ''], ['Zhang', 'Chao', ''], ['Triki', 'Amal Rannen', ''], ['Blaschko', 'Matthew B.', ''], ['Chua', 'Jeroen', ''], ['Felzenszwalb', 'Pedro F.', ''], ['Konyushkova', 'Ksenia', ''], ['Sznitman', 'Raphael', ''], ['Fua', 'Pascal', ''], ['Zamzmi', 'Ghada', ''], ['Goldgof', 'Dmitry', ''], ['Kasturi', 'Rangachar', ''], ['Sun', 'Yu', ''], ['Ashmeade', 'Terri', ''], ['Gallego', 'Guillermo', ''], ['Lund', 'Jon E. A.', ''], ['Mueggler', 'Elias', ''], ['Rebecq', 'Henri', ''], ['Delbruck', 'Tobi', ''], ['Scaramuzza', 'Davide', ''], ['Arablouei', 'Reza', ''], ['Goan', 'Ethan', ''], ['Gensemer', 'Stephen', ''], ['Kusy', 'Branislav', ''], ['Al-Shabi', 'Mundher', ''], ['Cheah', 'Wooi Ping', ''], ['Connie', 'Tee', ''], ['Zha', 'Zhiyuan', ''], ['Wen', 'Bihan', ''], ['Zhang', 'Jiachao', ''], ['Zhou', 'Jiantao', ''], ['Zhu', 'Ce', ''], ['Spampinato', 'Concetto', ''], ['Palazzo', 'Simone', ''], ['Kavasidis', 'Isaak', ''], ['Giordano', 'Daniela', ''], ['Shah', 'Mubarak', ''], ['Souly', 'Nasim', ''], ['Aizenbud', 'Yariv', ''], ['Shkolnisky', 'Yoel', ''], ['Han', 'Lei', ''], ['Sun', 'Juanzhen', ''], ['Zhang', 'Wei', ''], ['Xiu', 'Yuanyuan', ''], ['Feng', 'Hailei', ''], ['Lin', 'Yinjing', ''], ['Coninx', 'Alexandre', ''], ['Bessière', 'Pierre', ''], ['Droulez', 'Jacques', ''], ['Clement', 'Lee', ''], ['Peretroukhin', 'Valentin', ''], ['Kelly', 'Jonathan', ''], ['Liu', 'Yi', ''], ['Liu', 'Jingwei', ''], ['Prangnell', 'Lee', ''], ['Peretroukhin', 'Valentin', ''], ['Clement', 'Lee', ''], ['Kelly', 'Jonathan', ''], ['Cai', 'Xiaohao', ''], ['Wallis', 'Christopher G. R.', ''], ['Chan', 'Jennifer Y. H.', ''], ['McEwen', 'Jason D.', ''], ['Oliveira', 'P. A. M.', ''], ['Cintra', 'R. J.', ''], ['Bayer', 'F. M.', ''], ['Kulasekera', 'S.', ''], ['Madanayake', 'A.', ''], ['Coutinho', 'V. A.', ''], ['Selvaraju', 'Ramprasaath R.', ''], ['Cogswell', 'Michael', ''], ['Das', 'Abhishek', ''], ['Vedantam', 'Ramakrishna', ''], ['Parikh', 'Devi', ''], ['Batra', 'Dhruv', ''], ['McCaig', 'Graeme', ''], ['DiPaola', 'Steve', ''], ['Gabora', 'Liane', ''], ['Liu', 'Min', ''], ['Shi', 'Yifei', ''], ['Zheng', 'Lintao', ''], ['Xu', 'Kai', ''], ['Huang', 'Hui', ''], ['Manocha', 'Dinesh', ''], ['Laga', 'Hamid', ''], ['Xie', 'Qian', ''], ['Jermyn', 'Ian H.', ''], ['Srivastava', 'Anuj', ''], ['Aksoy', 'Eren Erdal', ''], ['Orhan', 'Adil', ''], ['Woergoetter', 'Florentin', ''], ['Gewali', 'Utsav B.', ''], ['Monteiro', 'Sildomar T.', ''], ['Gewali', 'Utsav B.', ''], ['Monteiro', 'Sildomar T.', ''], ['Tang', 'Da', ''], ['Jebara', 'Tony', ''], ['McClure', 'Patrick', ''], ['Kriegeskorte', 'Nikolaus', ''], ['Mastriani', 'Mario', ''], ['Shah', 'Abhay', ''], ['Abramoff', 'Michael D.', ''], ['Wu', 'Xiaodong', ''], ['Zhang', 'Li', ''], ['Xiang', 'Tao', ''], ['Gong', 'Shaogang', ''], ['Iscen', 'Ahmet', ''], ['Tolias', 'Giorgos', ''], ['Avrithis', 'Yannis', ''], ['Furon', 'Teddy', ''], ['Chum', 'Ondrej', ''], ['Johnson', 'Jeremiah', ''], ['Emeršič', 'Žiga', ''], ['Štruc', 'Vitomir', ''], ['Peer', 'Peter', ''], ['Rozumnyi', 'Denys', ''], ['Kotera', 'Jan', ''], ['Sroubek', 'Filip', ''], ['Novotny', 'Lukas', ''], ['Matas', 'Jiri', ''], ['Lukežič', 'Alan', ''], ['Vojíř', 'Tomáš', ''], ['Čehovin', 'Luka', ''], ['Matas', 'Jiří', ''], ['Kristan', 'Matej', ''], ['Lu', 'Yuzhen', ''], ['Berenbaum', 'David', ''], ['Deighan', 'Dwyer', ''], ['Marlow', 'Thomas', ''], ['Lee', 'Ashley', ''], ['Frickel', 'Scott', ''], ['Howison', 'Mark', ''], ['Wu', 'Bichen', ''], ['Wan', 'Alvin', ''], ['Iandola', 'Forrest', ''], ['Jin', 'Peter H.', ''], ['Keutzer', 'Kurt', ''], ['Liu', 'Yun', ''], ['Cheng', 'Ming-Ming', ''], ['Hu', 'Xiaowei', ''], ['Wang', 'Kai', ''], ['Bai', 'Xiang', ''], ['Khoreva', 'Anna', ''], ['Perazzi', 'Federico', ''], ['Benenson', 'Rodrigo', ''], ['Schiele', 'Bernt', ''], ['Sorkine-Hornung', 'Alexander', ''], ['Wijmans', 'Erik', ''], ['Furukawa', 'Yasutaka', ''], ['Le', 'Hieu', ''], ['Yu', 'Chen-Ping', ''], ['Zelinsky', 'Gregory', ''], ['Samaras', 'Dimitris', ''], ['Dong', 'Qiulei', ''], ['Hu', 'Zhanyi', ''], ['Averbuch-Elor', 'Hadar', ''], ['Bar', 'Nadav', ''], ['Cohen-Or', 'Daniel', ''], ['Albarqouni', 'Shadi', ''], ['Fotouhi', 'Javad', ''], ['Navab', 'Nassir', ''], ['Rahimpour', 'Alireza', ''], ['Taalimi', 'Ali', ''], ['Qi', 'Hairong', ''], ['Cai', 'Deng', ''], ['Zhuang', 'Xiahai', ''], ['Connie', 'Tee', ''], ['Al-Shabi', 'Mundher', ''], ['Goh', 'Michael', ''], ['Borsoi', 'Ricardo A.', ''], ['Aya', 'Julio C. C.', ''], ['Costa', 'Guilherme H.', ''], ['Bermudez', 'José C. M.', ''], ['Barron', 'Jonathan T.', ''], ['Zhang', 'He', ''], ['Sindagi', 'Vishwanath', ''], ['Patel', 'Vishal M.', ''], ['Kortylewski', 'Adam', ''], ['Wieczorek', 'Aleksander', ''], ['Wieser', 'Mario', ''], ['Blumer', 'Clemens', ''], ['Parbhoo', 'Sonali', ''], ['Morel-Forster', 'Andreas', ''], ['Roth', 'Volker', ''], ['Vetter', 'Thomas', ''], ['Qi', 'Guo-Jun', ''], ['Dutta', 'Anjan', ''], ['Sahbi', 'Hichem', ''], ['Emeršič', 'Žiga', ''], ['Gabriel', 'Luka Lan', ''], ['Štruc', 'Vitomir', ''], ['Peer', 'Peter', ''], ['Rafegas', 'Ivet', ''], ['Vanrell', 'Maria', ''], ['Alexandre', 'Luis A.', ''], ['Arias', 'Guillem', ''], ['Minaee', 'Shervin', ''], ['Abdolrashidi', 'Amirali', ''], ['Wang', 'Yao', ''], ['Zuo', 'Xinxin', ''], ['Wang', 'Sen', ''], ['Zheng', 'Jiangbin', ''], ['Yang', 'Ruigang', ''], ['Rahmani', 'Mostafa', ''], ['Atia', 'George', ''], ['Guo', 'Hengkai', ''], ['Wang', 'Guijin', ''], ['Chen', 'Xinghao', ''], ['Zhang', 'Cairong', ''], ['Qiao', 'Fei', ''], ['Yang', 'Huazhong', ''], ['Takahashi', 'Ryo', ''], ['Matsubara', 'Takashi', ''], ['Uehara', 'Kuniaki', ''], ['Gupta', 'Saurabh', ''], ['Tolani', 'Varun', ''], ['Davidson', 'James', ''], ['Levine', 'Sergey', ''], ['Sukthankar', 'Rahul', ''], ['Malik', 'Jitendra', ''], ['Yao', 'Hantao', ''], ['Dai', 'Feng', ''], ['Zhang', 'Dongming', ''], ['Ma', 'Yike', ''], ['Zhang', 'Shiliang', ''], ['Zhang', 'Yongdong', ''], ['Tian', 'Qi', ''], ['Litjens', 'Geert', ''], ['Kooi', 'Thijs', ''], ['Bejnordi', 'Babak Ehteshami', ''], ['Setio', 'Arnaud Arindra Adiyoso', ''], ['Ciompi', 'Francesco', ''], ['Ghafoorian', 'Mohsen', ''], ['van der Laak', 'Jeroen A. W. M.', ''], ['van Ginneken', 'Bram', ''], ['Sánchez', 'Clara I.', ''], ['Zhang', 'Wei', ''], ['Hu', 'Shengnan', ''], ['Liu', 'Kan', ''], ['Zha', 'Zhengjun', ''], ['Sochor', 'Jakub', ''], ['Juránek', 'Roman', ''], ['Špaňhel', 'Jakub', ''], ['Maršík', 'Lukáš', ''], ['Široký', 'Adam', ''], ['Herout', 'Adam', ''], ['Zemčík', 'Pavel', ''], ['Mueggler', 'Elias', ''], ['Gallego', 'Guillermo', ''], ['Rebecq', 'Henri', ''], ['Scaramuzza', 'Davide', ''], ['Inoue', 'Hiroshi', ''], ['Mahbod', 'Amirreza', ''], ['Schaefer', 'Gerald', ''], ['Wang', 'Chunliang', ''], ['Ecker', 'Rupert', ''], ['Ellinger', 'Isabella', ''], ['Sochor', 'Jakub', ''], ['Špaňhel', 'Jakub', ''], ['Herout', 'Adam', ''], ['Xu', 'Sheng', ''], ['Wang', 'Ruisheng', ''], ['Zheng', 'Han', ''], ['Antonello', 'Morris', ''], ['Carraro', 'Marco', ''], ['Pierobon', 'Marco', ''], ['Menegatti', 'Emanuele', ''], ['Kawahara', 'Jeremy', ''], ['Hamarneh', 'Ghassan', ''], ['Li', 'Kun', ''], ['Yang', 'Jingyu', ''], ['Lai', 'Yu-Kun', ''], ['Guo', 'Daoliang', ''], ['Volkhonskiy', 'Denis', ''], ['Nazarov', 'Ivan', ''], ['Burnaev', 'Evgeny', ''], ['Baur', 'Christoph', ''], ['Albarqouni', 'Shadi', ''], ['Navab', 'Nassir', ''], ['Kim', 'Youngsung', ''], ['Yoo', 'ByungIn', ''], ['Kwak', 'Youngjun', ''], ['Choi', 'Changkyu', ''], ['Kim', 'Junmo', ''], ['Wu', 'Huikai', ''], ['Zheng', 'Shuai', ''], ['Zhang', 'Junge', ''], ['Huang', 'Kaiqi', ''], ['Lin', 'Yutian', ''], ['Zheng', 'Liang', ''], ['Zheng', 'Zhedong', ''], ['Wu', 'Yu', ''], ['Hu', 'Zhilan', ''], ['Yan', 'Chenggang', ''], ['Yang', 'Yi', ''], ['Zhao', 'Long', ''], ['Han', 'Fangda', ''], ['Peng', 'Xi', ''], ['Zhang', 'Xun', ''], ['Kapadia', 'Mubbasir', ''], ['Pavlovic', 'Vladimir', ''], ['Metaxas', 'Dimitris N.', ''], ['Lou', 'Jing', ''], ['Wang', 'Huan', ''], ['Chen', 'Longtao', ''], ['Xu', 'Fenglei', ''], ['Xia', 'Qingyuan', ''], ['Zhu', 'Wei', ''], ['Ren', 'Mingwu', ''], ['Khoreva', 'Anna', ''], ['Benenson', 'Rodrigo', ''], ['Ilg', 'Eddy', ''], ['Brox', 'Thomas', ''], ['Schiele', 'Bernt', ''], ['Wang', 'Zhiguang', ''], ['Yang', 'Jianbo', ''], ['Cannings', 'Timothy I.', ''], ['Berrett', 'Thomas B.', ''], ['Samworth', 'Richard J.', ''], ['Avola', 'Danilo', ''], ['Foresti', 'Gian Luca', ''], ['Martinel', 'Niki', ''], ['Pannone', 'Daniele', ''], ['Piciarelli', 'Claudio', ''], ['Shrikumar', 'Avanti', ''], ['Greenside', 'Peyton', ''], ['Kundaje', 'Anshul', ''], ['Wu', 'Zuxuan', ''], ['Davis', 'Larry S.', ''], ['Sigal', 'Leonid', ''], ['Meinhardt', 'Tim', ''], ['Moeller', 'Michael', ''], ['Hazirbas', 'Caner', ''], ['Cremers', 'Daniel', ''], ['Elgendy', 'Omar A.', ''], ['Chan', 'Stanley H.', ''], ['Arnold', 'Lukas On', '', 'for the SoLid collaboration'], ['Janai', 'Joel', ''], ['Güney', 'Fatma', ''], ['Behl', 'Aseem', ''], ['Geiger', 'Andreas', ''], ['Deniz', 'Cem M.', ''], ['Xiang', 'Siyuan', ''], ['Hallyburton', 'Spencer', ''], ['Welbeck', 'Arakua', ''], ['Babb', 'James S.', ''], ['Honig', 'Stephen', ''], ['Cho', 'Kyunghyun', ''], ['Chang', 'Gregory', ''], ['Carvalho', 'João', ''], ['Marques', 'Manuel', ''], ['Costeira', 'João P.', ''], ['Xu', 'Minmin', ''], ['Xu', 'Siyu', ''], ['Zhu', 'Jihua', ''], ['Li', 'Yaochen', ''], ['Wang', 'Jun', ''], ['Lu', 'Huimin', ''], ['Brogan', 'Joel', ''], ['Bestagini', 'Paolo', ''], ['Bharati', 'Aparna', ''], ['Pinto', 'Allan', ''], ['Moreira', 'Daniel', ''], ['Bowyer', 'Kevin', ''], ['Flynn', 'Patrick', ''], ['Rocha', 'Anderson', ''], ['Scheirer', 'Walter', ''], ['Pandey', 'Gaurav', ''], ['Dukkipati', 'Ambedkar', ''], ['Wang', 'Xiaosong', ''], ['Peng', 'Yifan', ''], ['Lu', 'Le', ''], ['Lu', 'Zhiyong', ''], ['Bagheri', 'Mohammadhadi', ''], ['Summers', 'Ronald M.', ''], ['Borkar', 'Tejas', ''], ['Karam', 'Lina', ''], ['Harangi', 'Balazs', ''], ['Bae', 'Sung-Ho', ''], ['Elgharib', 'Mohamed', ''], ['Hefeeda', 'Mohamed', ''], ['Matusik', 'Wojciech', ''], ['Zhang', 'Jing', ''], ['Li', 'Wanqing', ''], ['Ogunbona', 'Philip', ''], ['Xu', 'Dong', ''], ['Lu', 'Yao', ''], ['Yang', 'Zhirong', ''], ['Kannala', 'Juho', ''], ['Kaski', 'Samuel', ''], ['Wang', 'Zhengyang', ''], ['Yuan', 'Hao', ''], ['Ji', 'Shuiwang', ''], ['Dong', 'Xingping', ''], ['Shen', 'Jianbing', ''], ['Wu', 'Dongming', ''], ['Guo', 'Kan', ''], ['Jin', 'Xiaogang', ''], ['Porikli', 'Fatih', ''], ['Krishna', 'Onkar', ''], ['Aizawa', 'Kiyoharu', ''], ['Helo', 'Andrea', ''], ['Pia', 'Rama', ''], ['Goldman', 'Eran', ''], ['Goldberger', 'Jacob', ''], ['Dong', 'Xin', ''], ['Chen', 'Shangyu', ''], ['Pan', 'Sinno Jialin', ''], ['Khalili', 'A. M.', ''], ['Kiran', 'B Ravi', ''], ['Das', 'Arindam', ''], ['Yogamani', 'Senthil', ''], ['Herring', 'James', ''], ['Nagy', 'James', ''], ['Ruthotto', 'Lars', ''], ['Deza', 'Arturo', ''], ['Jonnalagadda', 'Aditya', ''], ['Eckstein', 'Miguel', ''], ['Veshki', 'Farshad G.', ''], ['Vorobyov', 'Sergiy A.', ''], ['Baisa', 'Nathanael L.', ''], ['Bhowmik', 'Deepayan', ''], ['Wallace', 'Andrew', ''], ['Baisa', 'Nathanael L.', ''], ['Wallace', 'Andrew', ''], ['Soleymani', 'Roghayeh', ''], ['Granger', 'Eric', ''], ['Fumera', 'Giorgio', ''], ['Tsakiris', 'Manolis C.', ''], ['Vidal', 'Rene', ''], ['Si-Yao', 'Li', ''], ['Ren', 'Dongwei', ''], ['Yin', 'Qian', ''], ['Wu', 'Jiqing', ''], ['Huang', 'Zhiwu', ''], ['Acharya', 'Dinesh', ''], ['Li', 'Wen', ''], ['Thoma', 'Janine', ''], ['Paudel', 'Danda Pani', ''], ['Van Gool', 'Luc', ''], ['Kragh', 'Mikkel', ''], ['Underwood', 'James', ''], ['Chen', 'Chong', ''], ['Öktem', 'Ozan', ''], ['Sun', 'Xu', ''], ['Ren', 'Xuancheng', ''], ['Ma', 'Shuming', ''], ['Wang', 'Houfeng', ''], ['Joshi', 'Sharad', ''], ['Khanna', 'Nitin', ''], ['Shao', 'Ruifeng', ''], ['Xu', 'Ning', ''], ['Geng', 'Xin', ''], ['Nagar', 'Rajendra', ''], ['Raman', 'Shanmuganathan', ''], ['Wang', 'Chaoyue', ''], ['Xu', 'Chang', ''], ['Wang', 'Chaohui', ''], ['Tao', 'Dacheng', ''], ['Zheng', 'Zhedong', ''], ['Zheng', 'Liang', ''], ['Yang', 'Yi', ''], ['Yao', 'Hantao', ''], ['Zhang', 'Shiliang', ''], ['Zhang', 'Yongdong', ''], ['Li', 'Jintao', ''], ['Tian', 'Qi', ''], ['Jund', 'Philipp', ''], ['Eitel', 'Andreas', ''], ['Abdo', 'Nichola', ''], ['Burgard', 'Wolfram', ''], ['Liao', 'Jun', ''], ['Jiang', 'Yutong', ''], ['Bian', 'Zichao', ''], ['Mahrou', 'Bahareh', ''], ['Nambiar', 'Aparna', ''], ['Magsam', 'Alexander W.', ''], ['Guo', 'Kaikai', ''], ['Cho', 'Yong Ku', ''], ['Zheng', 'Guoan', ''], ['Zeng', 'Zhiqiang', ''], ['Zhang', 'Jian', ''], ['Wang', 'Xiaodong', ''], ['Chen', 'Yuming', ''], ['Zhu', 'Chaoyang', ''], ['Yang', 'Weixin', ''], ['Lyons', 'Terry', ''], ['Ni', 'Hao', ''], ['Schmid', 'Cordelia', ''], ['Jin', 'Lianwen', ''], ['Vongkulbhisal', 'Jayakorn', ''], ['De la Torre', 'Fernando', ''], ['Costeira', 'João P.', ''], ['Guo', 'Tian', ''], ['Ji', 'Pan', ''], ['Reid', 'Ian', ''], ['Garg', 'Ravi', ''], ['Li', 'Hongdong', ''], ['Salzmann', 'Mathieu', ''], ['Aksoy', 'Yağız', ''], ['Aydın', 'Tunç Ozan', ''], ['Pollefeys', 'Marc', ''], ['Noyel', 'Guillaume', '', 'IPRI, SIGPH@iPRI'], ['Mees', 'Oier', ''], ['Eitel', 'Andreas', ''], ['Burgard', 'Wolfram', ''], ['Bojanowski', 'Piotr', ''], ['Joulin', 'Armand', ''], ['Lopez-Paz', 'David', ''], ['Szlam', 'Arthur', ''], ['Wojna', 'Zbigniew', ''], ['Ferrari', 'Vittorio', ''], ['Guadarrama', 'Sergio', ''], ['Silberman', 'Nathan', ''], ['Chen', 'Liang-Chieh', ''], ['Fathi', 'Alireza', ''], ['Uijlings', 'Jasper', ''], ['Benligiray', 'Burak', ''], ['Topal', 'Cihan', ''], ['Akinlar', 'Cuneyt', ''], ['Yu', 'Fisher', ''], ['Wang', 'Dequan', ''], ['Shelhamer', 'Evan', ''], ['Darrell', 'Trevor', ''], ['Zhao', 'Ningning', ''], ["O'Connor", 'Daniel', ''], ['Basarab', 'Adrian', ''], ['Ruan', 'Dan', ''], ['Hu', 'Peng', ''], ['Sheng', 'Ke', ''], ['Tong', 'Xin-Yi', ''], ['Xia', 'Gui-Song', ''], ['Hu', 'Fan', ''], ['Zhong', 'Yanfei', ''], ['Datcu', 'Mihai', ''], ['Zhang', 'Liangpei', ''], ['Rahimpour', 'Alireza', ''], ['Liu', 'Liu', ''], ['Taalimi', 'Ali', ''], ['Song', 'Yang', ''], ['Qi', 'Hairong', ''], ['Pontes', 'Jhony K.', ''], ['Kong', 'Chen', ''], ['Eriksson', 'Anders', ''], ['Fookes', 'Clinton', ''], ['Sridharan', 'Sridha', ''], ['Lucey', 'Simon', ''], ['Guo', 'Chunchao', ''], ['Lai', 'Jianhuang', ''], ['Xie', 'Xiaohua', ''], ['Prakash', 'Jaya', ''], ['Mandal', 'Subhamoy', ''], ['Razansky', 'Daniel', ''], ['Ntziachristos', 'Vasilis', ''], ['Xiao', 'Chang', ''], ['Zhang', 'Cheng', ''], ['Zheng', 'Changxi', ''], ['Phung', 'Manh Duong', ''], ['Hoang', 'Van Truong', ''], ['Dinh', 'Tran Hiep', ''], ['Ha', 'Quang', ''], ['Bali', 'Alexandre', ''], ['Ghiasi-Shirazi', 'Kamaledin', ''], ['Zhang', 'Chengyue', ''], ['Li', 'Zhiwei', ''], ['Cheng', 'Qing', ''], ['Li', 'Xinghua', ''], ['Shen', 'Huanfeng', ''], ['Baskin', 'Chaim', ''], ['Liss', 'Natan', ''], ['Zheltonozhskii', 'Evgenii', ''], ['Bronshtein', 'Alex M.', ''], ['Mendelson', 'Avi', ''], ['Peretroukhin', 'Valentin', ''], ['Clement', 'Lee', ''], ['Giamou', 'Matthew', ''], ['Kelly', 'Jonathan', ''], ['Zhang', 'He', ''], ['Sindagi', 'Vishwanath', ''], ['Patel', 'Vishal M.', ''], ['Lee', 'Minhyeok', ''], ['Seok', 'Junhee', ''], ['Park', 'Hyung Suk', ''], ['Lee', 'Sung Min', ''], ['Kim', 'Hwa Pyung', ''], ['Seo', 'Jin Keun', ''], ['Tixier', 'Antoine Jean-Pierre', ''], ['Nikolentzos', 'Giannis', ''], ['Meladianos', 'Polykarpos', ''], ['Vazirgiannis', 'Michalis', ''], ['Zeng', 'Yu', ''], ['Lu', 'Huchuan', ''], ['Borji', 'Ali', ''], ['Cho', 'Donghyeon', ''], ['Park', 'Jinsun', ''], ['Oh', 'Tae-Hyun', ''], ['Tai', 'Yu-Wing', ''], ['Kweon', 'In So', ''], ['Komorowski', 'Michal', ''], ['Trzcinski', 'Tomasz', ''], ['Pourkamali-Anaraki', 'Farhad', ''], ['Becker', 'Stephen', ''], ['Chen', 'Xinghao', ''], ['Wang', 'Guijin', ''], ['Guo', 'Hengkai', ''], ['Zhang', 'Cairong', ''], ['Yu', 'Zhou', ''], ['Yu', 'Jun', ''], ['Xiang', 'Chenchao', ''], ['Fan', 'Jianping', ''], ['Tao', 'Dacheng', ''], ['Zhang', 'Quanshi', ''], ['Wu', 'Ying Nian', ''], ['Zhang', 'Hao', ''], ['Zhu', 'Song-Chun', ''], ['Laloy', 'Eric', ''], ['Hérault', 'Romain', ''], ['Jacques', 'Diederik', ''], ['Linde', 'Niklas', ''], ['Lobos', 'Rodrigo A.', ''], ['Kim', 'Tae Hyung', ''], ['Hoge', 'W. Scott', ''], ['Haldar', 'Justin P.', ''], ['Mokari', 'Mozhgan', ''], ['Mohammadzade', 'Hoda', ''], ['Ghojogh', 'Benyamin', ''], ['Yi', 'Xin', ''], ['Babyn', 'Paul', ''], ['Yao', 'Yazhou', ''], ['Zhang', 'Jian', ''], ['Shen', 'Fumin', ''], ['Liu', 'Li', ''], ['Zhu', 'Fan', ''], ['Zhang', 'Dongxiang', ''], ['Shen', 'Heng-Tao', ''], ['Bas', 'Anil', ''], ['Smith', 'William A. P.', ''], ['Emeršič', 'Žiga', ''], ['Štepec', 'Dejan', ''], ['Štruc', 'Vitomir', ''], ['Peer', 'Peter', ''], ['George', 'Anjith', ''], ['Ahmad', 'Adil', ''], ['Omar', 'Elshibani', ''], ['Boult', 'Terrance E.', ''], ['Safdari', 'Reza', ''], ['Zhou', 'Yuxiang', ''], ['Zafeiriou', 'Stefanos', ''], ['Yaman', 'Dogucan', ''], ['Eyiokur', 'Fevziye I.', ''], ['Ekenel', 'Hazim K.', ''], ['Sakaridis', 'Christos', ''], ['Dai', 'Dengxin', ''], ['Van Gool', 'Luc', ''], ['Nguyen', 'Anh', ''], ['Do', 'Thanh-Toan', ''], ['Caldwell', 'Darwin G.', ''], ['Tsagarakis', 'Nikos G.', ''], ['Moolekamp', 'Fred', ''], ['Melchior', 'Peter', ''], ['Shen', 'Li', ''], ['Margolies', 'Laurie R.', ''], ['Rothstein', 'Joseph H.', ''], ['Fluder', 'Eugene', ''], ['McBride', 'Russell B.', ''], ['Sieh', 'Weiva', ''], ['Datta', 'Shounak', ''], ['Nag', 'Sayak', ''], ['Das', 'Swagatam', ''], ['Helber', 'Patrick', ''], ['Bischke', 'Benjamin', ''], ['Dengel', 'Andreas', ''], ['Borth', 'Damian', ''], ['He', 'Xiangteng', ''], ['Peng', 'Yuxin', ''], ['Cangea', 'Cătălina', ''], ['Veličković', 'Petar', ''], ['Liò', 'Pietro', ''], ['Wu', 'Cinna', ''], ['Tygert', 'Mark', ''], ['LeCun', 'Yann', ''], ['Garcia', 'Noa', ''], ['Vogiatzis', 'George', ''], ['Hu', 'Jie', ''], ['Shen', 'Li', ''], ['Albanie', 'Samuel', ''], ['Sun', 'Gang', ''], ['Wu', 'Enhua', ''], ['Anwar', 'Syed Muhammad', ''], ['Majid', 'Muhammad', ''], ['Qayyum', 'Adnan', ''], ['Awais', 'Muhammad', ''], ['Alnowami', 'Majdi', ''], ['Khan', 'Muhammad Khurram', ''], ['Wang', 'Qian', ''], ['Chen', 'Ke', ''], ['Lesort', 'Timothée', ''], ['Seurin', 'Mathieu', ''], ['Li', 'Xinrui', ''], ['Díaz-Rodríguez', 'Natalia', ''], ['Filliat', 'David', ''], ['Jiang', 'Lai', ''], ['Xu', 'Mai', ''], ['Wang', 'Zulin', ''], ['Rangesh', 'Akshay', ''], ['Yuen', 'Kevan', ''], ['Satzoda', 'Ravi Kumar', ''], ['Rajaram', 'Rakesh Nattoji', ''], ['Gunaratne', 'Pujitha', ''], ['Trivedi', 'Mohan M.', ''], ['Fong', 'Chamberlain', ''], ['Jha', 'Ranjeet Ranjan', ''], ['Thapar', 'Daksh', ''], ['Patil', 'Shreyas Malakarjun', ''], ['Nigam', 'Aditya', ''], ['Bhunia', 'Ankan Kumar', ''], ['Alaei', 'Alireza', ''], ['Roy', 'Partha Pratim', ''], ['Corbière', 'Charles', ''], ['Ben-Younes', 'Hedi', ''], ['Ramé', 'Alexandre', ''], ['Ollion', 'Charles', ''], ['Dubey', 'Shiv Ram', ''], ['Lerman', 'Gilad', ''], ['Shi', 'Yunpeng', ''], ['Zhang', 'Teng', ''], ['Pasquale', 'Giulia', ''], ['Ciliberto', 'Carlo', ''], ['Odone', 'Francesca', ''], ['Rosasco', 'Lorenzo', ''], ['Natale', 'Lorenzo', ''], ['Gong', 'Sixue', ''], ['Boddeti', 'Vishnu Naresh', ''], ['Jain', 'Anil K.', ''], ['Sanzari', 'Marta', ''], ['Ntouskos', 'Valsamis', ''], ['Pirri', 'Fiora', ''], ['Duran', 'Joan', ''], ['Buades', 'Antoni', ''], ['Di', 'Xing', ''], ['Sindagi', 'Vishwanath A.', ''], ['Patel', 'Vishal M.', ''], ['Xu', 'Mai', ''], ['Li', 'Tianyi', ''], ['Wang', 'Zulin', ''], ['Deng', 'Xin', ''], ['Yang', 'Ren', ''], ['Guan', 'Zhenyu', ''], ['Dar', 'Salman Ul Hassan', ''], ['Özbey', 'Muzaffer', ''], ['Çatlı', 'Ahmet Burak', ''], ['Çukur', 'Tolga', ''], ['Vidal', 'Rosaura G.', ''], ['Banerjee', 'Sreya', ''], ['Grm', 'Klemen', ''], ['Struc', 'Vitomir', ''], ['Scheirer', 'Walter J.', ''], ['Shi', 'Bowen', ''], ['Livescu', 'Karen', ''], ['Shin', 'Seung Yeon', ''], ['Lee', 'Soochahn', ''], ['Yun', 'Il Dong', ''], ['Kim', 'Sun Mi', ''], ['Lee', 'Kyoung Mu', ''], ['Li', 'Yijun', ''], ['Huang', 'Jia-Bin', ''], ['Ahuja', 'Narendra', ''], ['Yang', 'Ming-Hsuan', ''], ['Tu', 'Peihan', ''], ['Tu', 'Peihan', ''], ['Bacchuwar', 'Ketan', '', 'GE Healthcare, LIGM'], ['Cousty', 'Jean', '', 'LIGM'], ['Vaillant', 'Régis', '', 'GE Healthcare'], ['Najman', 'Laurent', '', 'LIGM'], ['Thapar', 'Daksh', ''], ['Aggarwal', 'Divyansh', ''], ['Agarwal', 'Punjal', ''], ['Nigam', 'Aditya', ''], ['Jiang', 'Zutao', ''], ['Zhu', 'Jihua', ''], ['Evangelidis', 'Georgios D.', ''], ['Zhang', 'Changqing', ''], ['Pang', 'Shanmin', ''], ['Li', 'Yaochen', ''], ['Dolz', 'Jose', ''], ['Ayed', 'Ismail Ben', ''], ['Yuan', 'Jing', ''], ['Desrosiers', 'Christian', ''], ['Zhou', 'Linjun', ''], ['Cui', 'Peng', ''], ['Yang', 'Shiqiang', ''], ['Zhu', 'Wenwu', ''], ['Tian', 'Qi', ''], ...]
# 拼接所有的作者
authors_names = [' '.join(x) for x in all_authors]
authors_names = pd.DataFrame(authors_name)
authors_names
0 | |
---|---|
0 | Pal Mahesh |
1 | Mokhov Serguei A. for the MARF R&D Group |
2 | Sinclair Stephen for the MARF R&D Group |
3 | Clément Ian for the MARF R&D Group |
4 | Nicolacopoulos Dimitrios for the MARF R&D Group |
... | ... |
49139 | Ti Yen-Wu |
49140 | Chen Dian |
49141 | Zhou Brady |
49142 | Koltun Vladlen |
49143 | Krähenbühl Philipp |
49144 rows × 1 columns
# 根据作者频率绘制直⽅方图
import matplotlib.pyplot as plt
plt.figure(figsize=(10, 6))
authors_names[0].value_counts().head(10).plot(kind='barh')
<AxesSubplot:>
# 修改图配置
import matplotlib.pyplot as plt
plt.figure(figsize=(10, 6))
authors_names[0].value_counts().head(10).plot(kind='barh')
names = authors_name[0].value_counts().index.values[:10]
_ = plt.yticks(range(0, len(names)), names)
plt.ylabel('Author')
plt.xlabel('Count')
Text(0.5, 0, 'Count')
# 接下来统计姓名姓,也就是authors_parsed 字段中作者第⼀一个单词:
authors_lastnames = [x[0] for x in all_authors]
authors_lastnames = pd.DataFrame(authors_lastnames)
plt.figure(figsize=(10, 6))
authors_lastnames[0].value_counts().head(10).plot(kind = 'barh')
names = authors_lastnames[0].value_counts().index.values[:10]
_ = plt.yticks(range(0, len(names)), names)
plt.ylabel('Author')
plt.xlabel('Count')
#绘制得到的结果,从结果看出这些都是华⼈人或者中国姓⽒氏~
Text(0.5, 0, 'Count')
Copyright © 2003-2013 www.wpsshop.cn 版权所有,并保留所有权利。