当前位置:   article > 正文

二、Neo4j的使用(知识图谱构建射雕人物关系)

知识图谱构建射雕人物关系

本文主要总结了 Neo4j 的部署方法和客户端界面的操作方法。本项目采用的环境是Linux,Windows系统用户可以直接官网下载对应安装包使用,其中客户端界面的操作方法不受系统环境影响。

目录

1. 部署Neo4j

2. Neo4j客户端使用


1. 部署Neo4j

  1. # 1. 下载安装包
  2. wget https://labfile.oss.aliyuncs.com/courses/1354/neo4j-community-3.5.1-unix.tar.gz
  3. # 2. 解压安装
  4. tar -axvf neo4j-community-3.5.1-unix.tar.gz
  5. # 查看Neo4j文件目录
  6. tree neo4j-community-3.5.1
  7. neo4j-community-3.5.1
  8. ├── LICENSE.txt
  9. ├── LICENSES.txt
  10. ├── NOTICE.txt
  11. ├── README.txt
  12. ├── UPGRADE.txt
  13. ├── bin
  14. │ ├── cypher-shell
  15. │ ├── neo4j
  16. │ ├── neo4j-admin
  17. │ ├── neo4j-import
  18. │ └── tools
  19. │ ├── 2.x-config-migrator.jar
  20. │ └── cypher-shell-all.jar
  21. ├── conf
  22. │ └── neo4j.conf
  23. ├── data
  24. │ └── databases
  25. ├── import
  26. ├── lib
  27. │ ├── FastInfoset-1.2.13.jar
  28. │ ├── ...
  29. │ └── txw2-2.3.0.1.jar
  30. ├── logs
  31. ├── plugins
  32. │ └── README.txt
  33. └── run

主要目录结构:

  • bin 目录:用于存储 Neo4j 的可执行程序
  • conf 目录:用于控制 Neo4j 启动的配置文件
  • data 目录:用于存储核心数据库文件
  • plugins 目录:用于存储 Neo4j 的插件
  • logs 目录:记录 Neo4j 运行时的日志
  • import 目录:存放 CSV 文件,用于导入到数据库中
  1. # 3.启动
  2. cd /usr/bin
  3. sudo ./neo4j start

启动 Neo4j 服务需要一定时间,可以查看/var/log/neo4j目录下的neo4j.log文件,如果最后一行显示 Remote interface available at http://localhost:7474/,则表示 Neo4j 已完全启动,http 访问端口为 7474。

打开浏览器,在地址栏上输入 http://127.0.0.1:7474/browser/ 就可以访问 Neo4j 数据库了。第一次访问默认账号 neo4j,默认密码 neo4j,输入密码点击 Connect 后会提示修改初始密码。修改完成后就可以看到 Neo4j 的客户端界面。

2. Neo4j客户端使用

  1. # 导入数据
  2. sudo neo4j stop
  3. sudo wget https://labfile.oss.aliyuncs.com/courses/1354/graph.db.dump
  4. sudo neo4j-admin load --from=graph.db.dump --database=graph.db --force
  5. sudo neo4j start

启动成功后,刷新浏览器,就可以看到射雕三部曲的知识图谱相关信息了 

左侧为状态栏Database Information一栏显示的是数据库信息

  • Node Labels中显示的是节点信息,可以看到一共有 64 个节点,分别属于四个类别。
  • Relationships Types中显示关系信息,共 115 个关系,分别属于八个类别。
  • Property Keys中显示节点与关系含有的属性。
  • Connected as显示连接信息。
  • Database显示数据库信息。

点击Node Labels中的角色可以看到右侧视图窗口出现了角色的查询结果,查询结果视图窗口的顶部显示了点击角色时自动转换的查询语句:MATCH (n:`角色`) RETURN n LIMIT 25,表示查询返回 25 个角色节点。

Graph表示返回的是查询后的图形结果

Table则是将查询结果以表格的方式返回

Text将查询结果以纯文本的形式返回

Code返回一个类似 json 格式的结果,里面包括了版本信息,服务器地址,查询语句,请求的 json 内容和返回的 json 内容

在Graph标签页中,点击图中的杨康节点后,会在窗口的下方显示该节点的属性信息

同时,节点周围会出现三个按键,以便在图形界面上重新布局。

  • 左上角的锁形按键的功能为解锁节点,可以固定节点位置
  • 下方的树形按键的功能为显示与该节点连接的所有节点。
  • 右上角的眼睛按键功能为隐藏该节点,点击后该节点和与该节点连接的边就会消失。

 在Graph标签页顶部,显示了节点和关系的类别,点击某个类别,可以为这个类别设置颜色,大小和显示内容。

本文主要介绍了 Neo4j 的基本操作方法。在实际应用中,Neo4j 的配置会变得更加复杂,如改变访问端口,设置运行路径、运行性能,日志记录等,这些内容可以在下面查看。

  1. #*****************************************************************
  2. # Neo4j configuration
  3. #
  4. # For more details and a complete list of settings, please see
  5. # https://neo4j.com/docs/operations-manual/current/reference/configuration-settings/
  6. #*****************************************************************
  7. # 如果想自定义neo4j数据库数据的存储路径,要同时修改dbms.active_database 和 dbms.directories.data 两项配置,
  8. # 修改配置后,数据会存放在${dbms.directories.data}/databases/${dbms.active_database} 目录下
  9. # 安装的数据库的名称,默认使用${NEO4J_HOME}/data/databases/graph.db目录
  10. # The name of the database to mount
  11. #dbms.active_database=graph.db
  12. #安装Neo4j数据库的各个配置路径,默认使用$NEO4J_HOME下的路径
  13. #Paths of directories in the installation.
  14. # 数据路径
  15. #dbms.directories.data=data
  16. # 插件路径
  17. #dbms.directories.plugins=plugins
  18. #dbms.directories.certificates=certificates 证书路径
  19. #dbms.directories.logs=logs 日志路径
  20. #dbms.directories.lib=lib jar包路径
  21. #dbms.directories.run=run 运行路径
  22. #默认情况下想load csv文件,只能把csv文件放到${NEO4J_HOME}/import目录下,把下面的#删除后,可以在load csv时使用绝对路径,这样可能不安全
  23. #This setting constrains all `LOAD CSV` import files to be under the `import` directory. Remove or comment it out to allow files to be loaded from anywhere in the filesystem; this introduces possible security problems. See the `LOAD CSV` section of the manual for details.
  24. #此设置将所有“LOAD CSV”导入文件限制在`import`目录下。删除注释允许从文件系统的任何地方加载文件;这引入了可能的安全问题。
  25. dbms.directories.import=import
  26. #把下面这行的#删掉后,连接neo4j数据库时就不用输密码了
  27. #Whether requests to Neo4j are authenticated. 是否对Neo4j的请求进行了身份验证。
  28. #To disable authentication, uncomment this line 要禁用身份验证,请取消注释此行。
  29. #dbms.security.auth_enabled=false
  30. #Enable this to be able to upgrade a store from an older version. 是否兼容以前版本的数据
  31. dbms.allow_format_migration=true
  32. #Java Heap Size: by default the Java heap size is dynamically calculated based on available system resources. Java堆大小:默认情况下,Java堆大小是动态地根据可用的系统资源计算。
  33. #Uncomment these lines to set specific initial and maximum heap size. 取消注释这些行以设置特定的初始值和最大值
  34. #dbms.memory.heap.initial_size=512m
  35. #dbms.memory.heap.max_size=512m
  36. #The amount of memory to use for mapping the store files, in bytes (or kilobytes with the 'k' suffix, megabytes with 'm' and gigabytes with 'g'). 用于映射存储文件的内存量(以字节为单位)千字节带有'k'后缀,兆字节带有'm',千兆字节带有'g')。
  37. #If Neo4j is running on a dedicated server, then it is generally recommended to leave about 2-4 gigabytes for the operating system, give the JVM enough heap to hold all your transaction state and query context, and then leave the rest for the page cache. 如果Neo4j在专用服务器上运行,那么通常建议为操作系统保留大约2-4千兆字节,为JVM提供足够的堆来保存所有的事务状态和查询上下文,然后保留其余的页面缓存 。
  38. #The default page cache memory assumes the machine is dedicated to running Neo4j, and is heuristically set to 50% of RAM minus the max Java heap size. 默认页面缓存存储器假定机器专用于运行Neo4j,并且试探性地设置为RAM的50%减去最大Java堆大小。
  39. #dbms.memory.pagecache.size=10g
  40. ### Network connector configuration
  41. #With default configuration Neo4j only accepts local connections. Neo4j默认只接受本地连接(localhost)
  42. #To accept non-local connections, uncomment this line: 要接受非本地连接,请取消注释此行
  43. dbms.connectors.default_listen_address=0.0.0.0 (这是删除#后的配置,可以通过ip访问)
  44. #You can also choose a specific network interface, and configure a non-default port for each connector, by setting their individual listen_address. 还可以选择特定的网络接口,并配置非默认值端口,设置它们各自的listen_address
  45. #The address at which this server can be reached by its clients. This may be the server's IP address or DNS name, or it may be the address of a reverse proxy which sits in front of the server. This setting may be overridden for individual connectors below. 客户端可以访问此服务器的地址。这可以是服务器的IP地址或DNS名称,或者可以是位于服务器前面的反向代理的地址。此设置可能会覆盖以下各个连接器。
  46. #dbms.connectors.default_advertised_address=localhost
  47. #You can also choose a specific advertised hostname or IP address, and configure an advertised port for each connector, by setting their individual advertised_address. 您还可以选择特定广播主机名或IP地址,
  48. 为每个连接器配置通告的端口,通过设置它们独特的advertised_address。
  49. #Bolt connector 使用Bolt协议
  50. dbms.connector.bolt.enabled=true
  51. dbms.connector.bolt.tls_level=OPTIONAL
  52. dbms.connector.bolt.listen_address=:7687
  53. #HTTP Connector. There must be exactly one HTTP connector. 使用http协议
  54. dbms.connector.http.enabled=true
  55. dbms.connector.http.listen_address=:7474
  56. #HTTPS Connector. There can be zero or one HTTPS connectors. 使用https协议
  57. dbms.connector.https.enabled=true
  58. dbms.connector.https.listen_address=:7473
  59. #Number of Neo4j worker threads. Neo4j线程数
  60. #dbms.threads.worker_count=
  61. #Logging configuration 日志配置
  62. #To enable HTTP logging, uncomment this line 要启用HTTP日志记录,请取消注释此行
  63. dbms.logs.http.enabled=true
  64. #Number of HTTP logs to keep. 要保留的HTTP日志数
  65. #dbms.logs.http.rotation.keep_number=5
  66. #Size of each HTTP log that is kept. 每个HTTP日志文件的大小
  67. dbms.logs.http.rotation.size=20m
  68. #To enable GC Logging, uncomment this line 要启用GC日志记录,请取消注释此行
  69. #dbms.logs.gc.enabled=true
  70. #GC Logging Options see http://docs.oracle.com/cd/E19957-01/819-0084-10/pt_tuningjava.html#wp57013 for more information. GC日志记录选项 有关详细信息,请参见http://docs.oracle.com/cd/E19957-01/819-0084-10/pt_tuningjava.html#wp57013
  71. #dbms.logs.gc.options=-XX:+PrintGCDetails -XX:+PrintGCDateStamps -XX:+PrintGCApplicationStoppedTime -XX:+PrintPromotionFailure -XX:+PrintTenuringDistribution
  72. #Number of GC logs to keep. 要保留的GC日志数
  73. #dbms.logs.gc.rotation.keep_number=5
  74. #Size of each GC log that is kept. 保留的每个GC日志文件的大小
  75. #dbms.logs.gc.rotation.size=20m
  76. #Size threshold for rotation of the debug log. If set to zero then no rotation will occur. Accepts a binary suffix "k", "m" or "g". 调试日志旋转的大小阈值。如果设置为零,则不会发生滚动(达到指定大小后切割日志文件)。接受二进制后缀“k”,“m”或“g”。
  77. #dbms.logs.debug.rotation.size=20m
  78. #Maximum number of history files for the internal log. 最多保存几个日志文件
  79. #dbms.logs.debug.rotation.keep_number=7
  80. ### Miscellaneous configuration 其他配置
  81. #Enable this to specify a parser other than the default one. 启用此选项可指定除默认解析器之外的解析器
  82. #cypher.default_language_version=3.0
  83. #Determines if Cypher will allow using file URLs when loading data using `LOAD CSV`. Setting this value to `false` will cause Neo4j to fail `LOAD CSV` clauses that load data from the file system. 确定当使用加载数据时,Cypher是否允许使用文件URL `LOAD CSV`。将此值设置为`false`将导致Neo4j不能通过互联网上的URL导入数据,`LOAD CSV` 会从文件系统加载数据。
  84. dbms.security.allow_csv_import_from_file_urls=true
  85. #Retention policy for transaction logs needed to perform recovery and backups. 执行恢复和备份所需的事务日志的保留策略
  86. #dbms.tx_log.rotation.retention_policy=7 days
  87. #Enable a remote shell server which Neo4j Shell clients can log in to. 启用Neo4j Shell客户端可以登录的远程shell服务器
  88. dbms.shell.enabled=true
  89. #The network interface IP the shell will listen on (use 0.0.0.0 for all interfaces).
  90. dbms.shell.host=127.0.0.1
  91. #The port the shell will listen on, default is 1337.
  92. dbms.shell.port=1337
  93. #Only allow read operations from this Neo4j instance. This mode still requires write access to the directory for lock purposes. 只允许从Neo4j实例读取操作。此模式仍然需要对目录的写访问以用于锁定目的。
  94. #dbms.read_only=false
  95. #Comma separated list of JAX-RS packages containing JAX-RS resources, one package name for each mountpoint. The listed package names will be loaded under the mountpoints specified. Uncomment this line to mount the org.neo4j.examples.server.unmanaged.HelloWorldResource.java from neo4j-server-examples under /examples/unmanaged, resulting in a final URL of http://localhost:7474/examples/unmanaged/helloworld/{nodeId} 包含JAX-RS资源的JAX-RS软件包的逗号分隔列表,每个安装点一个软件包名称。所列出的软件包名称将在指定的安装点下加载。取消注释此行以装载org.neo4j.examples.server.unmanaged.HelloWorldResource.java neo4j-server-examples下/ examples / unmanaged,最终的URL为http//localhost7474/examples/unmanaged/helloworld/{nodeId}
  96. #dbms.unmanaged_extension_classes=org.neo4j.examples.server.unmanaged=/examples/unmanaged
  97. #JVM Parameters JVM参数
  98. #G1GC generally strikes a good balance between throughput and tail latency, without too much tuning. G1GC通常在吞吐量和尾部延迟之间达到很好的平衡,而没有太多的调整。
  99. dbms.jvm.additional=-XX:+UseG1GC
  100. #Have common exceptions keep producing stack traces, so they can be debugged regardless of how often logs are rotated. 有共同的异常保持生成堆栈跟踪,所以他们可以被调试,无论日志被旋转的频率
  101. dbms.jvm.additional=-XX:-OmitStackTraceInFastThrow
  102. #Make sure that `initmemory` is not only allocated, but committed to the process, before starting the database. This reduces memory fragmentation, increasing the effectiveness of transparent huge pages. It also reduces the possibility of seeing performance drop due to heap-growing GC events, where a decrease in available page cache leads to an increase in mean IO response time. Try reducing the heap memory, if this flag degrades performance. 确保在启动数据库之前,“initmemory”不仅被分配,而且被提交到进程。这减少了内存碎片,增加了透明大页面的有效性。它还减少了由于堆增长的GC事件而导致性能下降的可能性,其中可用页面缓存的减少导致平均IO响应时间的增加。如果此标志降低性能,请减少堆内存。
  103. dbms.jvm.additional=-XX:+AlwaysPreTouch
  104. #Trust that non-static final fields are really final. This allows more optimizations and improves overall performance. NOTE: Disable this if you use embedded mode, or have extensions or dependencies that may use reflection or serialization to change the value of final fields! 信任非静态final字段真的是final。这允许更多的优化和提高整体性能。注意:如果使用嵌入模式,或者有可能使用反射或序列化更改最终字段的值的扩展或依赖关系,请禁用此选项!
  105. dbms.jvm.additional=-XX:+UnlockExperimentalVMOptions
  106. dbms.jvm.additional=-XX:+TrustFinalNonStaticFields
  107. #Disable explicit garbage collection, which is occasionally invoked by the JDK itself. 禁用显式垃圾回收,这是偶尔由JDK本身调用。
  108. dbms.jvm.additional=-XX:+DisableExplicitGC
  109. #Remote JMX monitoring, uncomment and adjust the following lines as needed. Absolute paths to jmx.access and jmx.password files are required. 远程JMX监视,取消注释并根据需要调整以下行。需要jmx.access和jmx.password文件的绝对路径。
  110. #Also make sure to update the jmx.access and jmx.password files with appropriate permission roles and passwords, the shipped configuration contains only a read only role called 'monitor' with password 'Neo4j'. 还要确保使用适当的权限角色和密码更新jmx.access和jmx.password文件,所配置的配置只包含名为“monitor”的只读角色,密码为“Neo4j”。
  111. #For more details, see: http://download.oracle.com/javase/8/docs/technotes/guides/management/agent.html On Unix based systems the jmx.password file needs to be owned by the user that will run the server, and have permissions set to 0600. Unix系统,有关详情,请参阅:http://download.oracle.com/javase/8/docs/technotes/guides/management/agent.html,jmx.password文件需要由运行服务器的用户拥有,并且权限设置为0600。
  112. #For details on setting these file permissions on Windows see: http://docs.oracle.com/javase/8/docs/technotes/guides/management/security-windows.html Windows系统 有关在设置这些文件权限的详细信息,请参阅:http://docs.oracle.com/javase/8/docs/technotes/guides/management/security-windows.html
  113. #dbms.jvm.additional=-Dcom.sun.management.jmxremote.port=3637
  114. #dbms.jvm.additional=-Dcom.sun.management.jmxremote.authenticate=true
  115. #dbms.jvm.additional=-Dcom.sun.management.jmxremote.ssl=false
  116. #dbms.jvm.additional=-Dcom.sun.management.jmxremote.password.file=/absolute/path/to/conf/jmx.password
  117. #dbms.jvm.additional=-Dcom.sun.management.jmxremote.access.file=/absolute/path/to/conf/jmx.access
  118. #Some systems cannot discover host name automatically, and need this line configured: 某些系统无法自动发现主机名,需要配置以下行:
  119. #dbms.jvm.additional=-Djava.rmi.server.hostname=$THE_NEO4J_SERVER_HOSTNAME
  120. #Expand Diffie Hellman (DH) key size from default 1024 to 2048 for DH-RSA cipher suites used in server TLS handshakes. 对于服务器TLS握手中使用的DH-RSA密码套件,将Diffie Hellman(DH)密钥大小从默认1024展开到2048。
  121. #This is to protect the server from any potential passive eavesdropping. 这是为了保护服务器免受任何潜在的被动窃听。
  122. dbms.jvm.additional=-Djdk.tls.ephemeralDHKeySize=2048
  123. ### Wrapper Windows NT/2000/XP Service Properties 包装器Windows NT / 2000 / XP服务属性包装器Windows NT / 2000 / XP服务属性
  124. #WARNING - Do not modify any of these properties when an application using this configuration file has been installed as a service. WARNING - 当使用此配置文件的应用程序已作为服务安装时,不要修改任何这些属性。
  125. #Please uninstall the service before modifying this section. The service can then be reinstalled. 请在修改此部分之前卸载服务。 然后可以重新安装该服务。
  126. #Name of the service 服务的名称
  127. dbms.windows_service_name=neo4j
  128. ### Other Neo4j system properties 其他 Neo4j 系统属性
  129. dbms.jvm.additional=-Dunsupported.dbms.udc.source=zip
声明:本文内容由网友自发贡献,转载请注明出处:【wpsshop博客】
推荐阅读
相关标签
  

闽ICP备14008679号