当前位置:   article > 正文

PandasAI—让AI做数据分析

PandasAI—让AI做数据分析

安装 

pip install pandasai

!pip install --upgrade pandas pandasai

导入依赖项

  1. import pandas as pd
  2. from pandasai import PandasAI
  3. from pandasai.llm.openai import OpenAI

使用pandas创建一个数据框

  1. df = pd.DataFrame({
  2. "country": ["United States", "United Kingdom", "France", "Germany", "Italy", "Spain", "Canada", "Australia", "Japan", "China"],
  3. "gdp": [21400000, 2940000, 2830000, 3870000, 2160000, 1350000, 1780000, 1320000, 516000, 14000000],
  4. "happiness_index": [7.3, 7.2, 6.5, 7.0, 6.0, 6.3, 7.3, 7.3, 5.9, 5.0]
  5. })

实例化LLM

  1. OPENAI_API_KEY = "YOUR API KEY"
  2. llm = OpenAI(api_token=OPENAI_API_KEY)

使用提供的大语言模型实例化PandasAI,并运行

  1. pandas_ai = PandasAI(llm, conversational=False)
  2. pandas_ai.run(df, prompt='What are the 5 happiest countries')

图表展示

pandas_ai.run(df, "Plot the histogram of countries showing for each the gpd, using different colors for each bar")

 

询问涉及多个数据框的问题

  1. employees_data = {
  2. 'EmployeeID': [1, 2, 3, 4, 5],
  3. 'Name': ['John', 'Emma', 'Liam', 'Olivia', 'William'],
  4. 'Department': ['HR', 'Sales', 'IT', 'Marketing', 'Finance']
  5. }
  6. salaries_data = {
  7. 'EmployeeID': [1, 2, 3, 4, 5],
  8. 'Salary': [5000, 6000, 4500, 7000, 5500]
  9. }
  10. employees_df = pd.DataFrame(employees_data)
  11. salaries_df = pd.DataFrame(salaries_data)
  12. pandas_ai([employees_df, salaries_df], "Who gets paid the most?")
声明:本文内容由网友自发贡献,不代表【wpsshop博客】立场,版权归原作者所有,本站不承担相应法律责任。如您发现有侵权的内容,请联系我们。转载请注明出处:https://www.wpsshop.cn/w/盐析白兔/article/detail/214635
推荐阅读
相关标签
  

闽ICP备14008679号