Skip to content

[Qustion] doris fe not ready after reboot fe/be  #290

@ming12713

Description

@ming12713

Search before asking

  • I had searched in the issues and found no similar issues.

Version

2.11

What's Wrong?

2024-11-12 08:05:14,341 INFO (stateListener|83) [DatabaseTransactionMgr.replayUpsertTransactionState():2158] replay a COMMITTED transaction TransactionState. transaction id: 3917247, label: vtc_source_nome__KC_ods_vtc_source_nome__KC_2__KC_loshu_ods_vtc_source_nome__KC_0__KC_495084__KC_1730487306367, db id: 11154, table id list: 508351, callback id: -1, coordinator: BE: 10.42.1.19, transaction status: COMMITTED, error replicas num: 0, replica ids: , prepare time: 1730487306375, commit time: 1730487307994, finish time: -1, reason: 
2024-11-12 08:05:14,341 INFO (stateListener|83) [DatabaseTransactionMgr.replayUpsertTransactionState():2158] replay a COMMITTED transaction TransactionState. transaction id: 3917245, label: vtc_source_nome__KC_ods_vtc_source_nome__KC_1__KC_loshu_ods_vtc_source_nome__KC_0__KC_494027__KC_1730487305225, db id: 11154, table id list: 508351, callback id: -1, coordinator: BE: 10.42.1.19, transaction status: COMMITTED, error replicas num: 0, replica ids: , prepare time: 1730487305236, commit time: 1730487308002, finish time: -1, reason: 
2024-11-12 08:05:14,341 INFO (stateListener|83) [OlapTable.updateVisibleVersionAndTime():2591] updateVisibleVersionAndTime, tableName: ods_vtc_source_nome, visibleVersion, 344672, visibleVersionTime: 1730487308007
2024-11-12 08:05:14,341 INFO (stateListener|83) [DatabaseTransactionMgr.replayUpsertTransactionState():2158] replay a VISIBLE transaction TransactionState. transaction id: 3917247, label: vtc_source_nome__KC_ods_vtc_source_nome__KC_2__KC_loshu_ods_vtc_source_nome__KC_0__KC_495084__KC_1730487306367, db id: 11154, table id list: 508351, callback id: -1, coordinator: BE: 10.42.1.19, transaction status: VISIBLE, error replicas num: 0, replica ids: , prepare time: 1730487306375, commit time: 1730487307994, finish time: 1730487308007, reason: 
2024-11-12 08:05:14,342 INFO (stateListener|83) [OlapTable.updateVisibleVersionAndTime():2591] updateVisibleVersionAndTime, tableName: ods_vtc_source_nome, visibleVersion, 344673, visibleVersionTime: 1730487308018
2024-11-12 08:05:14,342 INFO (stateListener|83) [DatabaseTransactionMgr.replayUpsertTransactionState():2158] replay a VISIBLE transaction TransactionState. transaction id: 3917245, label: vtc_source_nome__KC_ods_vtc_source_nome__KC_1__KC_loshu_ods_vtc_source_nome__KC_0__KC_494027__KC_1730487305225, db id: 11154, table id list: 508351, callback id: -1, coordinator: BE: 10.42.1.19, transaction status: VISIBLE, error replicas num: 0, replica ids: , prepare time: 1730487305236, commit time: 1730487308002, finish time: 1730487308018, reason: 
2024-11-12 08:05:14,342 INFO (stateListener|83) [DatabaseTransactionMgr.replayUpsertTransactionState():2158] replay a COMMITTED transaction TransactionState. transaction id: 3917244, label: nome_raw_data__KC_ods_vtc_nome_raw_data__KC_2__KC_loshu_ods_vtc_nome_raw_data__KC_0__KC_1611187__KC_1730487305168, db id: 11154, table id list: 74966, callback id: -1, coordinator: BE: 10.42.1.19, transaction status: COMMITTED, error replicas num: 0, replica ids: , prepare time: 1730487305177, commit time: 1730487308558, finish time: -1, reason: 
2024-11-12 08:05:14,342 INFO (stateListener|83) [DatabaseTransactionMgr.replayUpsertTransactionState():2158] replay a COMMITTED transaction TransactionState. transaction id: 3917246, label: nome_raw_data__KC_ods_vtc_nome_raw_data__KC_1__KC_loshu_ods_vtc_nome_raw_data__KC_0__KC_1612525__KC_1730487305321, db id: 11154, table id list: 74966, callback id: -1, coordinator: BE: 10.42.1.19, transaction status: COMMITTED, error replicas num: 0, replica ids: , prepare time: 1730487305400, commit time: 1730487308568, finish time: -1, reason: 
/opt/apache-doris/fe/bin/start_fe.sh: line 265:   162 Killed                  ${LIMIT:+${LIMIT}} "${JAVA}" ${final_java_opt:+${final_java_opt}} -XX:-OmitStackTraceInFastThrow -XX:OnOutOfMemoryError="kill -9 %p" ${coverage_opt:+${coverage_opt}} org.apache.doris.DorisFE ${HELPER:+${HELPER}} ${OPT_VERSION:+${OPT_VERSION}} "${METADATA_FAILURE_RECOVERY}" "$@" < /dev/null

Doris Installation via Operator, 1 BE Node and 1 FE Node, After restarting both the Doris FE and BE nodes, the FE node fails to start normally and reports the error mentioned above. The BE IP 10.42.1.19 mentioned in the error is the previous BE pod IP, not the SVC IP. The FE configuration for service discovery is set to use SVC (Service) method, but now the BE is 10.42.1.6.

image

pod network cidr 10.42.1.x/16
image
svc network cidr 10.43.48.x
image

What You Expected?

fix issue

How to Reproduce?

No response

Anything Else?

No response

Are you willing to submit PR?

  • Yes I am willing to submit a PR!

Code of Conduct

Metadata

Metadata

Assignees

No one assigned

    Labels

    good first issueGood for newcomersquestionFurther information is requested

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions