合 StarRocks安装常见错误整理
- Error: Could not find or load main class com.starrocks.StarRocksFE
- java.net.ConnectException: Connection refused (Connection refused)
- 如何在配置文件 fe.conf 中 priority_networks 参数下配置固定 IP?
- 安装 BE 节点后启动失败,并返回错误 "StarRocks BE http service did not start correctly, exiting"。我该如何解决?
- 在部署企业版 StarRocks 的过程当中,配置节点时报错:“Failed to Distribute files to node”。我该如何解决?
- StarRocks 是否支持动态修改 FE、BE 配置项?
- 为 BE 节点增加磁盘空间后,数据存储无法均衡负载且报错 “Failed to get scan range, no queryable replica found in tablet: xxxxx”。我该如何解决?
- 重启集群时,FE 启动失败并报错 “Fe type:unknown ,is ready :false”。我该如何解决?
- 安装集群时报错 “failed to get service info err”。我该如何解决?
- BE 启动失败并报错 “Fail to get master client from cache. host= port=0 code=THRIFT_RPC_ERROR“。我该如何解决?
- 通过 StarRocks Manager 升级集群时报错 “Failed to transport upgrade files to agent host. src:…”。我该如何解决?
- 新扩容节点的 FE 状态正常,但是在 StarRocks Manager 的 诊断页面下,该 FE 节点日志展示报错 “Failed to search log“。我该如何解决?
- 启动 FE 失败并报错 “exceeds max permissable delta:5000ms”。我该如何解决?
- 如果 BE 节点有多块磁盘做存储,如何设置 storage_root_path 配置项?
- 添加新的 FE 节点至集群后报错 “invalid cluster id: xxxxxxxx”。我该如何解决?
- 当前 FE 节点已经启动,且状态为 transfer:follower,但是调用 show frontends 命令返回 isAlive 状态为 false。我该如何解决?
- 查询报错 “could not initialize class com.starrocks.rpc.BackendServiceProxy”。我该如何解决?
Error: Could not find or load main class com.starrocks.StarRocksFE
请使用下载地址:https://www.mirrorship.cn/zh-CN/download/community进行下载二进制包,大小为2.2G
不要使用github上的地址进行下载,github上的是源码包,编译才能使用,大小为50MB。
二进制安装包已经包括了文件starrocks-fe.jar:
1 2 | [root@starrocks253 soft]# find / -name starrocks-fe.jar /usr/local/StarRocks-2.5.3/fe/lib/starrocks-fe.jar |
java.net.ConnectException: Connection refused (Connection refused)
添加BE节点后,不能启动,查询“show backends;”,列ErrMsg显示:java.net.ConnectException: Connection refused (Connection refused)
1 2 3 4 5 6 7 8 | mysql> show backends; +-----------+--------------+---------------+--------+----------+----------+---------------------+---------------------+-------+----------------------+-----------------------+-----------+------------------+---------------+---------------+---------+----------------+--------------------------------------------------------------------+---------------+--------------------------------------------------------+-------------------+-------------+----------+-------------------+------------+------------+ | BackendId | IP | HeartbeatPort | BePort | HttpPort | BrpcPort | LastStartTime | LastHeartbeat | Alive | SystemDecommissioned | ClusterDecommissioned | TabletNum | DataUsedCapacity | AvailCapacity | TotalCapacity | UsedPct | MaxDiskUsedPct | ErrMsg | Version | Status | DataTotalCapacity | DataUsedPct | CpuCores | NumRunningQueries | MemUsedPct | CpuUsedPct | +-----------+--------------+---------------+--------+----------+----------+---------------------+---------------------+-------+----------------------+-----------------------+-----------+------------------+---------------+---------------+---------+----------------+--------------------------------------------------------------------+---------------+--------------------------------------------------------+-------------------+-------------+----------+-------------------+------------+------------+ | 10272 | 192.18.0.137 | 9050 | -1 | -1 | -1 | NULL | NULL | false | false | false | 0 | 0.000 | 1.000 B | 0.000 | 0.00 % | 0.00 % | java.net.ConnectException: Connection refused (Connection refused) | | {"lastSuccessReportTabletsTime":"N/A"} | 0.000 | 0.00 % | 0 | 0 | 0.00 % | 0.0 % | | 10007 | 192.18.0.138 | 9050 | 9060 | 8040 | 8060 | 2023-03-29 15:37:36 | 2023-03-29 15:52:52 | true | false | false | 39 | 2.595 KB | 285.514 GB | 494.971 GB | 42.32 % | 42.32 % | | 2.5.3-46bf084 | {"lastSuccessReportTabletsTime":"2023-03-29 15:52:36"} | 285.514 GB | 0.00 % | 4 | 0 | 1.03 % | 0.2 % | +-----------+--------------+---------------+--------+----------+----------+---------------------+---------------------+-------+----------------------+-----------------------+-----------+------------------+---------------+---------------+---------+----------------+--------------------------------------------------------------------+---------------+--------------------------------------------------------+-------------------+-------------+----------+-------------------+------------+------------+ 2 rows in set (0.04 sec) |
查看日志:
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 | I0329 15:51:01.494788 31548 backend_options.cpp:77] localhost 192.18.0.137 I0329 15:51:01.496893 31548 exec_env.cpp:421] Set storage page cache size 2698684563 I0329 15:51:01.497128 31558 daemon.cpp:188] Current memory statistics: process(28953072), query_pool(0), load(0), metadata(0), compaction(0), schema_change(0), column_pool(0), page_cache(0), update(0), chunk_allocator(0), clone(0), consistency(0) I0329 15:51:01.498107 31560 data_dir.cpp:113] path: /usr/local/starrocks/be/storage, hash: 8442776031408076163 I0329 15:51:01.526329 31627 data_dir.cpp:237] start to load tablets from /usr/local/starrocks/be/storage I0329 15:51:01.526365 31627 data_dir.cpp:243] begin loading rowset from meta I0329 15:51:01.527894 31627 data_dir.cpp:261] load rowset from meta finished, data dir: /usr/local/starrocks/be/storage I0329 15:51:01.527917 31627 data_dir.cpp:266] begin loading tablet from meta I0329 15:51:01.528043 31627 data_dir.cpp:302] load tablet from meta finished, loaded tablet: 0, error tablet: 0, path: /usr/local/starrocks/be/storage I0329 15:51:01.537509 31693 fragment_mgr.cpp:516] FragmentMgr cancel worker start working. I0329 15:51:01.546819 31548 exec_env.cpp:173] [PIPELINE] Exec thread pool: thread_num=4 W0329 15:51:01.770228 31548 stack_util.cpp:128] 2023-03-29 15:51:01.770140, query_id=00000000-0000-0000-0000-000000000000, fragment_instance_id=00000000-0000-0000-0000-000000000000 throws exception: std::system_error, trace: @ 0x2a33ce0 std::__throw_system_error() @ 0x7cabc59 std::thread::_M_start_thread() @ 0x46f1534 starrocks::RuntimeFilterWorker::RuntimeFilterWorker() @ 0x464de29 starrocks::ExecEnv::_init() @ 0x464ed42 starrocks::ExecEnv::init() @ 0x2a36ea1 main @ 0x7f0cb60d6555 __libc_start_main @ 0x2b5172f (unknown) @ (nil) (unknown) |
不能启动的原因是:可用内存不足,请保证可用内存有10G以上。
查询资源:
1 2 3 4 | [root@PT-Test-12 storage]# free -h total used free shared buff/cache available Mem: 15G 5.7G 801M 802M 9.0G 8.7G Swap: 4.0G 503M 3.5G |
杀掉部分占用内存的资源后:
1 2 3 4 | [root@PT-Test-12 ~]# free -h total used free shared buff/cache available Mem: 15G 839M 5.6G 802M 9.1G 13G Swap: 4.0G 31M 4.0G |