<error> <unique>0x00000000005760B0</unique> <tid>1</tid> <kind>Leak_DefinitelyLost</kind> <xwhat> <text>32 bytes in 1 blocks are lost in loss record 1167725 of 1387445 (#5726384)</text> <leakedbytes>32</leakedbytes> <leakedblocks>1</leakedblocks> </xwhat> <stack> <frame> <fn>malloc</fn> </frame> <frame> <ip>0x00007FFA1663F24B</ip> <obj>E:\q5142\qt5pro\TjuECS_QT5\bin\win\libstdc++-6.dll</obj> <fn>_Znwy</fn> </frame> <frame> <ip>0x000000006891BFF4</ip> <obj>E:\q5142\qt5pro\TjuECS_QT5\bin\win\Qt5Core.dll</obj> <fn>_ZN12QEasingCurveC1ENS_4TypeE</fn> </frame> <frame> <ip>0x000000006888B336</ip> <obj>E:\q5142\qt5pro\TjuECS_QT5\bin\win\Qt5Core.dll</obj> <fn>_ZN17QVariantAnimation12valueChangedERK8QVariant</fn> </frame> <frame> <ip>0x000000006888FF53</ip> <obj>E:\q5142\qt5pro\TjuECS_QT5\bin\win\Qt5Core.dll</obj> <fn>_ZN18QPropertyAnimationC1EP7QObjectRK10QByteArrayS1_</fn> </frame> <frame> <ip>0x000000000ACF8558</ip> <obj>E:\q5142\qt5pro\TjuECS_QT5\bin\win\TjuWidgets.dll</obj> <fn>_ZN14TjuWidgetTitle11setFloatingEb</fn> </frame> <frame> <ip>0x000000000AD44218</ip> <obj>E:\q5142\qt5pro\TjuECS_QT5\bin\win\TjuWidgets.dll</obj> <fn>_ZN14TjuWidgetTitle11setFloatingEb</fn> </frame> <frame> <ip>0x000000000AE7B470</ip> <obj>E:\q5142\qt5pro\TjuECS_QT5\bin\win\TjuWidgets.dll</obj> <fn>_ZN11TjuECEngine21TjuS57MilitaryManagerC1EP7QWidget</fn> </frame> <frame> <ip>0x000000000042EE9B</ip> <obj>E:\q5142\qt5pro\TjuECS_QT5\bin\win\TjuMain.exe</obj> </frame> <frame> <ip>0x00000000004165CE</ip> <obj>E:\q5142\qt5pro\TjuECS_QT5\bin\win\TjuMain.exe</obj> </frame> <frame> <ip>0x00000000008BF0DF</ip> <obj>E:\q5142\qt5pro\TjuECS_QT5\bin\win\TjuMain.exe</obj> </frame> <frame> <ip>0x00000000004013C6</ip> <obj>E:\q5142\qt5pro\TjuECS_QT5\bin\win\TjuMain.exe</obj> </frame> <frame> <ip>0x00000000004014CA</ip> <obj>E:\q5142\qt5pro\TjuECS_QT5\bin\win\TjuMain.exe</obj> </frame> </stack> </error>

时间: 2025-07-04 17:17:35 浏览: 5
### 内存泄漏分析:QVariantAnimation 和 QPropertyAnimation 引发的 Qt 应用问题 在基于 Qt 的应用程序中,`QVariantAnimation` 及其派生类(如 `QPropertyAnimation`)常用于实现控件属性的平滑过渡动画。然而,若未正确管理这些动画对象的生命周期,可能导致内存泄漏,特别是在结合自定义库(如 `TjuWidgets.dll`)进行复杂 UI 交互时。此类问题通常表现为动画对象未被销毁、信号连接未释放或资源未回收等情况[^1]。 #### 动画对象未被显式删除 Qt 的动画系统允许通过 `start()` 启动动画,并在动画完成后自动调用 `deleteLater()` 以确保异步销毁。但在某些场景下,例如: ```cpp QPropertyAnimation *animation = new QPropertyAnimation(widget, "geometry"); animation->setDuration(1000); animation->setStartValue(QRect(0, 0, 100, 30)); animation->setEndValue(QRect(250, 250, 100, 30)); animation->start(); ``` 若动画对象未设置父对象且未手动调用 `delete` 或未连接到适当的信号(如 `finished()`)以触发清理逻辑,则可能在动画结束后仍驻留在内存中。建议使用以下方式确保对象最终被释放: ```cpp connect(animation, &QAbstractAnimation::finished, animation, &QObject::deleteLater); ``` #### 自定义控件与动画资源管理不当 在 `TjuWidgets.dll` 中,若 `TjuWidgetTitle` 或其他组件内部创建了动画实例但未遵循上述清理机制,可能导致动画对象累积。此外,若动画作用于某个未正确设置父子关系的对象,则其生命周期无法被自动管理,从而引发内存泄漏。应确保所有动态分配的动画对象具有合适的父级上下文: ```cpp QPropertyAnimation *animation = new QPropertyAnimation(widget, "geometry", this); ``` 其中 `this` 表示当前控件作为父对象,当控件被销毁时,其子对象(包括动画)也将自动删除。 #### 信号连接导致 QObject 持有引用 Qt 的信号与槽机制会维护连接对象的引用计数。如果 `QPropertyAnimation` 实例与某个 `QObject` 建立连接但未在动画结束时断开,则接收方可能因引用未释放而无法被销毁。例如: ```cpp connect(animation, &QPropertyAnimation::valueChanged, this, &MyClass::onAnimationValueChanged); ``` 若 `this` 所属对象在动画运行期间被销毁而未调用 `disconnect()`,则可能造成内存泄漏。推荐使用 `Qt::QueuedConnection` 或绑定至 `finished` 信号执行清理操作。 #### 使用 Valgrind 或 Visual Leak Detector 进行检测 为精确定位由 `Qt5Core.dll` 中 `QVariantAnimation` 或 `QPropertyAnimation` 引发的内存泄漏,可借助以下工具: - **Valgrind (Linux)** 提供详细的堆内存分配记录,支持追踪未释放的动画对象及其调用栈信息。 - **Visual Leak Detector (Windows)** 集成于 Visual Studio,能捕获 C++ 内存泄漏并展示完整的调用路径,适用于排查 `TjuWidgets.dll` 中未释放的动画资源。 - **Qt Creator 内置对象浏览器** 可查看当前所有 `QObject` 实例及其父子关系,辅助识别未被销毁的动画对象。 #### 示例:安全使用 QPropertyAnimation 并避免泄漏 ```cpp void MyClass::startAnimation(QWidget *widget) { QPropertyAnimation *animation = new QPropertyAnimation(widget, "geometry", this); animation->setDuration(1000); animation->setStartValue(QRect(0, 0, 100, 30)); animation->setEndValue(QRect(250, 250, 100, 30)); connect(animation, &QPropertyAnimation::finished, [this, animation]() { // 确保动画完成后自动销毁 animation->deleteLater(); }); animation->start(); } ``` 在此示例中,动画对象设置了父对象并绑定了 `finished` 信号以触发 `deleteLater()`,从而确保动画完成后的资源释放。
阅读全文

相关推荐

<error> <unique>0x0000000000394304</unique> <tid>1</tid> <kind>Leak_DefinitelyLost</kind> <xwhat> <text>48 bytes in 1 blocks are lost in loss record 998356 of 1387445 (#3752708)</text> <leakedbytes>48</leakedbytes> <leakedblocks>1</leakedblocks> </xwhat> <stack> <frame> <fn>malloc</fn> </frame> <frame> <ip>0x00007FFA1663F24B</ip> <obj>E:\q5142\qt5pro\TjuECS_QT5\bin\win\libstdc++-6.dll</obj> <fn>_Znwy</fn> </frame> <frame> <ip>0x0000000068AC0098</ip> <obj>E:\q5142\qt5pro\TjuECS_QT5\bin\win\Qt5Core.dll</obj> <fn>_ZN14QObjectPrivate13addConnectionEiPNS_10ConnectionE</fn> </frame> <frame> <ip>0x0000000068AC1418</ip> <obj>E:\q5142\qt5pro\TjuECS_QT5\bin\win\Qt5Core.dll</obj> <fn>_ZNK7QObject17isSignalConnectedERK11QMetaMethod</fn> </frame> <frame> <ip>0x0000000068AC8523</ip> <obj>E:\q5142\qt5pro\TjuECS_QT5\bin\win\Qt5Core.dll</obj> <fn>_ZN7QObject7connectEPKS_PKcS1_S3_N2Qt14ConnectionTypeE</fn> </frame> <frame> <ip>0x000000000ACF866F</ip> <obj>E:\q5142\qt5pro\TjuECS_QT5\bin\win\TjuWidgets.dll</obj> <fn>_ZN14TjuWidgetTitle11setFloatingEb</fn> </frame> <frame> <ip>0x000000000AD50895</ip> <obj>E:\q5142\qt5pro\TjuECS_QT5\bin\win\TjuWidgets.dll</obj> <fn>_ZN14TjuWidgetTitle11setFloatingEb</fn> </frame> <frame> <ip>0x000000000AE7956E</ip> <obj>E:\q5142\qt5pro\TjuECS_QT5\bin\win\TjuWidgets.dll</obj> <fn>_ZN11TjuECEngine21TjuS57MilitaryManagerC1EP7QWidget</fn> </frame> <frame> <ip>0x000000000042EE9B</ip> <obj>E:\q5142\qt5pro\TjuECS_QT5\bin\win\TjuMain.exe</obj> </frame> <frame> <ip>0x00000000004165CE</ip> <obj>E:\q5142\qt5pro\TjuECS_QT5\bin\win\TjuMain.exe</obj> </frame> <frame> <ip>0x00000000008BF0DF</ip> <obj>E:\q5142\qt5pro\TjuECS_QT5\bin\win\TjuMain.exe</obj> </frame> <frame> <ip>0x00000000004013C6</ip> <obj>E:\q5142\qt5pro\TjuECS_QT5\bin\win\TjuMain.exe</obj> </frame> <frame> <ip>0x00000000004014CA</ip> <obj>E:\q5142\qt5pro\TjuECS_QT5\bin\win\TjuMain.exe</obj> </frame> </stack> </error>

<error> <unique>0x0000000000711054</unique> <tid>1</tid> <kind>Leak_DefinitelyLost</kind> <xwhat> <text>15625920 bytes in 97662 blocks are lost in loss record 2 of 1387445 (#7409748)</text> <leakedbytes>160</leakedbytes> <leakedblocks>97662</leakedblocks> </xwhat> <stack> <frame> <fn>malloc</fn> </frame> <frame> <ip>0x00000000688E5B37</ip> <obj>E:\q5142\qt5pro\TjuECS_QT5\bin\win\Qt5Core.dll</obj> <fn>_ZN9QHashData12allocateNodeEi</fn> </frame> <frame> <ip>0x0000000002A5EF5B</ip> <obj>E:\q5142\qt5pro\TjuECS_QT5\bin\win\Qt5Widgets.dll</obj> <fn>_ZN12QPixmapStyle11qt_metacastEPKc</fn> </frame> <frame> <ip>0x000000000279F5D1</ip> <obj>E:\q5142\qt5pro\TjuECS_QT5\bin\win\Qt5Widgets.dll</obj> <fn>_ZN11QProxyStyle11qt_metacastEPKc</fn> </frame> <frame> <ip>0x00000000027A3687</ip> <obj>E:\q5142\qt5pro\TjuECS_QT5\bin\win\Qt5Widgets.dll</obj> <fn>_ZN11QProxyStyle11qt_metacastEPKc</fn> </frame> <frame> <ip>0x00000000027A3B12</ip> <obj>E:\q5142\qt5pro\TjuECS_QT5\bin\win\Qt5Widgets.dll</obj> <fn>_ZN11QProxyStyle11qt_metacastEPKc</fn> </frame> <frame> <ip>0x00000000027453CE</ip> <obj>E:\q5142\qt5pro\TjuECS_QT5\bin\win\Qt5Widgets.dll</obj> <fn>_ZN7QWidget5eventEP6QEvent</fn> </frame> <frame> <ip>0x0000000002707A0F</ip> <obj>E:\q5142\qt5pro\TjuECS_QT5\bin\win\Qt5Widgets.dll</obj> <fn>_ZN19QApplicationPrivate13notify_helperEP7QObjectP6QEvent</fn> </frame> <frame> <ip>0x000000000270E722</ip> <obj>E:\q5142\qt5pro\TjuECS_QT5\bin\win\Qt5Widgets.dll</obj> <fn>_ZN12QApplication6notifyEP7QObjectP6QEvent</fn> </frame> <frame> <ip>0x0000000068A95677</ip> <obj>E:\q5142\qt5pro\TjuECS_QT5\bin\win\Qt5Core.dll</obj> <fn>_ZN16QCoreApplication9sendEventEP7QObjectP6QEvent</fn> </frame> <frame> <ip>0x0000000002740952</ip> <obj>E:\q5142\qt5pro\TjuECS_QT5\bin\win\Qt5Widgets.dll</obj> <fn>_ZNK7QWidget14ensurePolishedEv</fn> </frame> <frame> <ip>0x0000000002745477</ip> <obj>E:\q5142\qt5pro\TjuECS_QT5\bin\win\Qt5Widgets.dll</obj> <fn>_ZN7QWidget5eventEP6QEvent</fn> </frame> <frame> <ip>0x0000000002707A0F</ip> <obj>E:\q5142\qt5pro\TjuECS_QT5\bin\win\Qt5Widgets.dll</obj> <fn>_ZN19QApplicationPrivate13notify_helperEP7QObjectP6QEvent</fn> </frame> <frame> <ip>0x000000000270E722</ip> <obj>E:\q5142\qt5pro\TjuECS_QT5\bin\win\Qt5Widgets.dll</obj> <fn>_ZN12QApplication6notifyEP7QObjectP6QEvent</fn> </frame> <frame> <ip>0x0000000068A94FF7</ip> <obj>E:\q5142\qt5pro\TjuECS_QT5\bin\win\Qt5Core.dll</obj> <fn>_ZN16QCoreApplication15notifyInternal2EP7QObjectP6QEvent</fn> </frame> <frame> <ip>0x0000000068A9B056</ip> <obj>E:\q5142\qt5pro\TjuECS_QT5\bin\win\Qt5Core.dll</obj> <fn>_ZN23QCoreApplicationPrivate16sendPostedEventsEP7QObjectiP11QThreadData</fn> </frame> <frame> <ip>0x000000006A8FFBFD</ip> <obj>E:\q5142\qt5pro\TjuECS_QT5\bin\win\platforms\qwindows.dll</obj> <fn>qt_plugin_instance</fn> </frame> <frame> <ip>0x0000000068AF10D1</ip> <obj>E:\q5142\qt5pro\TjuECS_QT5\bin\win\Qt5Core.dll</obj> <fn>_ZN21QEventDispatcherWin3213processEventsE6QFlagsIN10QEventLoop17ProcessEventsFlagEE</fn> </frame> <frame> <ip>0x000000006A8FFBE4</ip> <obj>E:\q5142\qt5pro\TjuECS_QT5\bin\win\platforms\qwindows.dll</obj> <fn>qt_plugin_instance</fn> </frame> <frame> <ip>0x000000000287FEF3</ip> <obj>E:\q5142\qt5pro\TjuECS_QT5\bin\win\Qt5Widgets.dll</obj> <fn>_ZN13QSplashScreen11showMessageERK7QStringiRK6QColor</fn> </frame> <frame> <ip>0x000000000ACA17EF</ip> <obj>E:\q5142\qt5pro\TjuECS_QT5\bin\win\TjuWidgets.dll</obj> <fn>_ZN11TjuECEngine15TjuSplashScreen19showProgressMessageEiRK7QString</fn> </frame> <frame> <ip>0x00000000641428E8</ip> <obj>E:\q5142\qt5pro\TjuECS_QT5\bin\win\TjuChartControl.dll</obj> <fn>_ZN11TjuECEngine15TjuChartControlC1EP7QWidget</fn> </frame> <frame> <ip>0x0000000000431BAA</ip> <obj>E:\q5142\qt5pro\TjuECS_QT5\bin\win\TjuMain.exe</obj> </frame> <frame> <ip>0x000000000042F142</ip> <obj>E:\q5142\qt5pro\TjuECS_QT5\bin\win\TjuMain.exe</obj> </frame> <frame> <ip>0x00000000004165CE</ip> <obj>E:\q5142\qt5pro\TjuECS_QT5\bin\win\TjuMain.exe</obj> </frame> <frame> <ip>0x00000000008BF0DF</ip> <obj>E:\q5142\qt5pro\TjuECS_QT5\bin\win\TjuMain.exe</obj> </frame> <frame> <ip>0x00000000004013C6</ip> <obj>E:\q5142\qt5pro\TjuECS_QT5\bin\win\TjuMain.exe</obj> </frame> <frame> <ip>0x00000000004014CA</ip> <obj>E:\q5142\qt5pro\TjuECS_QT5\bin\win\TjuMain.exe</obj> </frame> </stack> </error>

#include <vector> #include <iostream> #include <iomanip> #include <fstream> #include <sstream> #include <string> #include <string.h> #include <stdlib.h> #include <stdio.h> #include <ctype.h> #include <algorithm> #include <fcntl.h> #include <map> #include <math.h> #define chromoLL 44000000 #define maxN 20000000//28000 #define chromoL 900000 #define sWinN 1000 //300 //1000000 //500 //249 //1000000 #define maxLimit 300 #define chromoNo 10 #define maxLarge 56000 //60000 ,number of proteins #define maxNumFea 30000 //600000 #define Half 8000 #define trainPairs 14 #define M0 4 //4 #define M1 20 //10 positions #define M2 10 //10 intervals, jumping distance //#define PNclass "+1" using namespace std; //int pairRedund[maxLarge][2]; //int pairNoR[maxLarge][2]; //unsigned m,way,actualway; string feaArray[maxNumFea]; bool flagDomainA[maxLarge]; bool flagFeature[maxNumFea]; bool flagFeature2[maxNumFea]; long int totalF=0; long int totalF_ys=0; long int totalLine=0,totalE1=0,totalE2=0; long double threshV=0.01, eValueOri=1.0, nLE_eValueOri; vector <long int>arrayMethy[chromoNo]; vector <long int>arrayMethy_2[chromoNo]; int sP,eP,chrNo; int numP1,numP2,numP3; string annoStr,acceStr,sPstr,ePstr,geneTypeStr,geneOrientStr,chrNoStr; int arrayRow1[chromoLL]; int arrayRow2[chromoLL]; vector<int>posV; vector<long double>logPV; vector<int>meUnV; vector<string> vectorQ; ofstream coutE2("check_____formatError.txt"); ofstream coutE3("check_____startLargerEnd.txt"); ofstream coutE4("check_____repeatedGeneTypes.txt"); ofstream cout00X("irregular_ACCE_positions.txt"); ofstream coutSingleACCE("singleACCE.txt"); //ofstream fileOutE("Evalue larger than threshV.txt"); //ofstream fileOutE2("Evalue is NA.txt"); //ofstream coutTest("test signT.txt"); //the sum of lines of all files for one species //string pfamFeature[maxNumFea]; //string proteinPfam[maxN][3]; int TOUPPER(int c) { return toupper(c); } struct feaNode { int No; long double negaLogE; int realN; int biN; //}in

info "(gdb)Auto-loading safe path" warning: Unable to find libthread_db matching inferior's thread library, thread debugging will not be available. ai_detection start [New LWP 8741] [New LWP 8745] [New LWP 8746] [New LWP 8747] [New LWP 8748] [New LWP 8749] [New LWP 8750] [New LWP 8751] ptr = 0x5555630338 [2025-03-28 17:59:10.890] [info] [8411] quick_log init success! name:quick_log, v1.0.5, buildtime:2025-03-28T11:18:12+08:00, githash:0e6b5c1[tom_wang]<[email protected]> [2025-03-28 17:59:10.890] [info] [8411] ai_server.cpp:77: AiServer [2025-03-28 17:59:10.893] [trace] [8411] ai_detection.cpp:245: clear_dt_ =5 [2025-03-28 17:59:10.893] [trace] [8411] ai_detection.cpp:262: clear_cloud_obstacle_ size: 201 [2025-03-28 17:59:10.893] [trace] [8411] perception_worker.cpp:27: perception lib init entry [2025-03-28 17:59:10.893] [info] [8411] perception_worker.cpp:30: point cloud init [New LWP 8758] [2025-03-28 17:59:10.894] [info] [8758] thread_worker_base.cpp:100: worker point clound start! [2025-03-28 17:59:10.894] [trace] [8758] thread_worker_base.cpp:105: work_thread point clound wait call Thread 1 "ai_detection" received signal SIGSEGV, Segmentation fault. 0x0000007fbdc43e28 in ?? () from /app/lib/libc.so.6 (gdb) where #0 0x0000007fbdc43e28 in ?? () from /app/lib/libc.so.6 #1 0x0000007fbdf336d0 in std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> >::_M_assign(std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > const&) () from /usr/lib/libstdc++.so.6 #2 0x0000007fbf4d1d14 in perception_lib::init(perception_lib::Perception_worker_param&) () from /app/lib/libperception_lib.so #3 0x00000055555f80b0 in ai_detection::AiDetection::on_configure() () #4 0x00000055555fac0c in ai_detection::AiDetection::AiDetection(std::shared_ptr<rclcpp::Node>) () #5 0x00000055555c827c in ai_detection::AiServer::on_configure() () #6 0x00000055555c9f1c in ai_detection::AiServer::AiServer(std::shared_ptr<rclcpp::Node>) () #7 0x0报错

assert failed: esp_startup_start_app app_startup.c:86 (res == pdTRUE) Backtrace: 0x40375aed:0x3fceb110 0x4037a509:0x3fceb130 0x40380375:0x3fceb150 0x420149e2:0x3fceb270 0x420015a2:0x3fceb2a0 0x403756e5:0x3fceb2d0 0x403ccba4:0x3fceb340 0x403ccfe9:0x3fceb380 0x403c89b5:0x3fceb4b0 0x40045c01:0x3fceb570 0x40043ab6:0x3fceb6f0 0x40034c45:0x3fceb710 --- 0x40375aed: panic_abort at /home/ubuntu/esp32/esp-idf/components/esp_system/panic.c:454 0x4037a509: esp_system_abort at /home/ubuntu/esp32/esp-idf/components/esp_system/port/esp_system_chip.c:92 0x40380375: __assert_func at /home/ubuntu/esp32/esp-idf/components/newlib/src/assert.c:80 0x420149e2: esp_startup_start_app at /home/ubuntu/esp32/esp-idf/components/freertos/app_startup.c:86 (discriminator 1) 0x420015a2: start_cpu0_default at /home/ubuntu/esp32/esp-idf/components/esp_system/startup.c:216 0x403756e5: call_start_cpu0 at /home/ubuntu/esp32/esp-idf/components/esp_system/port/cpu_start.c:853 (discriminator 1) 0x403ccba4: set_cache_and_start_app at /home/ubuntu/esp32/esp-idf/components/bootloader_support/src/bootloader_utility.c:1158 (inlined by) unpack_load_app at /home/ubuntu/esp32/esp-idf/components/bootloader_support/src/bootloader_utility.c:893 (inlined by) load_image at /home/ubuntu/esp32/esp-idf/components/bootloader_support/src/bootloader_utility.c:803 0x403ccfe9: bootloader_utility_load_boot_image at /home/ubuntu/esp32/esp-idf/components/bootloader_support/src/bootloader_utility.c:605 0x403c89b5: call_start_cpu0 at /home/ubuntu/esp32/esp-idf/components/bootloader/subproject/main/bootloader_start.c:62 0x40045c01: ets_run_flash_bootloader in ROM 0x40043ab6: main in ROM 0x40034c45: .stack_ok in ROM ELF file SHA256: 68dc9896a Rebooting... ���ESP-ROM:esp32s3-20210327 Build:Mar 27 2021 rst:0xc (RTC_SW_CPU_RST),boot:0x2a (SPI_FAST_FLASH_BOOT) Saved PC:0x40375a9d --- 0x40375a9d: esp_restart_noos at /home/ubuntu/esp32/esp-idf/components/esp_system/port/soc/esp32s3/system_internal.c:160 SPIWP:0xee mode:DIO, clock div:1 load:0x3fce2820,len:0x1718 load:0x403c8700,len:0xebc --- 0x403c8700: _stext at ??:? load:0x403cb700,len:0x31a0 entry 0x403c894c --- 0x403c894c: call_start_cpu0 at /home/ubuntu/esp32/esp-idf/components/bootloader/subproject/main/bootloader_start.c:25

/home/wiseatc/.local/lib/python3.11/site-packages/jieba/_compat.py:18: UserWarning: pkg_resources is deprecated as an API. See https://setuptools.pypa.io/en/latest/pkg_resources.html. The pkg_resources package is slated for removal as early as 2025-11-30. Refrain from using this package or pin to Setuptools<81. import pkg_resources W0703 16:13:22.516433 3913223 torch/distributed/run.py:766] W0703 16:13:22.516433 3913223 torch/distributed/run.py:766] ***************************************** W0703 16:13:22.516433 3913223 torch/distributed/run.py:766] Setting OMP_NUM_THREADS environment variable for each process to be 1 in default, to avoid your system being overloaded, please further tune the variable for optimal performance in your application as needed. W0703 16:13:22.516433 3913223 torch/distributed/run.py:766] ***************************************** /home/wiseatc/.local/lib/python3.11/site-packages/jieba/_compat.py:18: UserWarning: pkg_resources is deprecated as an API. See https://setuptools.pypa.io/en/latest/pkg_resources.html. The pkg_resources package is slated for removal as early as 2025-11-30. Refrain from using this package or pin to Setuptools<81. import pkg_resources /home/wiseatc/.local/lib/python3.11/site-packages/jieba/_compat.py:18: UserWarning: pkg_resources is deprecated as an API. See https://setuptools.pypa.io/en/latest/pkg_resources.html. The pkg_resources package is slated for removal as early as 2025-11-30. Refrain from using this package or pin to Setuptools<81. import pkg_resources /home/wiseatc/.local/lib/python3.11/site-packages/jieba/_compat.py:18: UserWarning: pkg_resources is deprecated as an API. See https://setuptools.pypa.io/en/latest/pkg_resources.html. The pkg_resources package is slated for removal as early as 2025-11-30. Refrain from using this package or pin to Setuptools<81. import pkg_resources /home/wiseatc/.local/lib/python3.11/site-packages/jieba/_compat.py:18: UserWarning: pkg_resources is deprecated as an API. See https://setuptools.pypa.io/en/latest/pkg_resources.html. The pkg_resources package is slated for removal as early as 2025-11-30. Refrain from using this package or pin to Setuptools<81. import pkg_resources [rank0]: Traceback (most recent call last): [rank0]: File "/usr/local/lib/python3.11/dist-packages/transformers/utils/import_utils.py", line 1863, in _get_module [rank0]: return importlib.import_module("." + module_name, self.__name__) [rank0]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [rank0]: File "/usr/lib/python3.11/importlib/__init__.py", line 126, in import_module [rank0]: return _bootstrap._gcd_import(name[level:], package, level) [rank0]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [rank0]: File "<frozen importlib._bootstrap>", line 1206, in _gcd_import [rank0]: File "<frozen importlib._bootstrap>", line 1178, in _find_and_load [rank0]: File "<frozen importlib._bootstrap>", line 1149, in _find_and_load_unlocked [rank0]: File "<frozen importlib._bootstrap>", line 690, in _load_unlocked [rank0]: File "<frozen importlib._bootstrap_external>", line 940, in exec_module [rank0]: File "<frozen importlib._bootstrap>", line 241, in _call_with_frames_removed [rank0]: File "/usr/local/lib/python3.11/dist-packages/transformers/models/llama/tokenization_llama_fast.py", line 29, in <module> [rank0]: from .tokenization_llama import LlamaTokenizer [rank0]: File "/usr/local/lib/python3.11/dist-packages/transformers/models/llama/tokenization_llama.py", line 27, in <module> [rank0]: import sentencepiece as spm [rank0]: File "/usr/local/lib/python3.11/dist-packages/sentencepiece/__init__.py", line 10, in <module> [rank0]: from . import _sentencepiece [rank0]: ImportError: cannot import name '_sentencepiece' from partially initialized module 'sentencepiece' (most likely due to a circular import) (/usr/local/lib/python3.11/dist-packages/sentencepiece/__init__.py) [rank0]: The above exception was the direct cause of the following exception: [rank0]: Traceback (most recent call last): [rank0]: File "/home/wiseatc/LLaMA-Factory/src/llamafactory/model/loader.py", line 82, in load_tokenizer [rank0]: tokenizer = AutoTokenizer.from_pretrained( [rank0]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [rank0]: File "/usr/local/lib/python3.11/dist-packages/transformers/models/auto/tokenization_auto.py", line 912, in from_pretrained [rank0]: tokenizer_class_from_name(config_tokenizer_class) is not None [rank0]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [rank0]: File "/usr/local/lib/python3.11/dist-packages/transformers/models/auto/tokenization_auto.py", line 611, in tokenizer_class_from_name [rank0]: return getattr(module, class_name) [rank0]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^ [rank0]: File "/usr/local/lib/python3.11/dist-packages/transformers/utils/import_utils.py", line 1851, in __getattr__ [rank0]: module = self._get_module(self._class_to_module[name]) [rank0]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [rank0]: File "/usr/local/lib/python3.11/dist-packages/transformers/utils/import_utils.py", line 1865, in _get_module [rank0]: raise RuntimeError( [rank0]: RuntimeError: Failed to import transformers.models.llama.tokenization_llama_fast because of the following error (look up to see its traceback): [rank0]: cannot import name '_sentencepiece' from partially initialized module 'sentencepiece' (most likely due to a circular import) (/usr/local/lib/python3.11/dist-packages/sentencepiece/__init__.py) [rank0]: The above exception was the direct cause of the following exception: [rank0]: Traceback (most recent call last): [rank0]: File "/home/wiseatc/LLaMA-Factory/src/llamafactory/launcher.py", line 23, in <module> [rank0]: launch() [rank0]: File "/home/wiseatc/LLaMA-Factory/src/llamafactory/launcher.py", line 19, in launch [rank0]: run_exp() [rank0]: File "/home/wiseatc/LLaMA-Factory/src/llamafactory/train/tuner.py", line 110, in run_exp [rank0]: _training_function(config={"args": args, "callbacks": callbacks}) [rank0]: File "/home/wiseatc/LLaMA-Factory/src/llamafactory/train/tuner.py", line 72, in _training_function [rank0]: run_sft(model_args, data_args, training_args, finetuning_args, generating_args, callbacks) [rank0]: File "/home/wiseatc/LLaMA-Factory/src/llamafactory/train/sft/workflow.py", line 48, in run_sft [rank0]: tokenizer_module = load_tokenizer(model_args) [rank0]: ^^^^^^^^^^^^^^^^^^^^^^^^^^ [rank0]: File "/home/wiseatc/LLaMA-Factory/src/llamafactory/model/loader.py", line 97, in load_tokenizer [rank0]: raise OSError("Failed to load tokenizer.") from e [rank0]: OSError: Failed to load tokenizer. [rank3]: Traceback (most recent call last): [rank3]: File "/usr/local/lib/python3.11/dist-packages/transformers/utils/import_utils.py", line 1863, in _get_module [rank3]: return importlib.import_module("." + module_name, self.__name__) [rank3]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [rank3]: File "/usr/lib/python3.11/importlib/__init__.py", line 126, in import_module [rank3]: return _bootstrap._gcd_import(name[level:], package, level) [rank3]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [rank3]: File "<frozen importlib._bootstrap>", line 1206, in _gcd_import [rank3]: File "<frozen importlib._bootstrap>", line 1178, in _find_and_load [rank3]: File "<frozen importlib._bootstrap>", line 1149, in _find_and_load_unlocked [rank3]: File "<frozen importlib._bootstrap>", line 690, in _load_unlocked [rank3]: File "<frozen importlib._bootstrap_external>", line 940, in exec_module [rank3]: File "<frozen importlib._bootstrap>", line 241, in _call_with_frames_removed [rank3]: File "/usr/local/lib/python3.11/dist-packages/transformers/models/llama/tokenization_llama_fast.py", line 29, in <module> [rank3]: from .tokenization_llama import LlamaTokenizer [rank3]: File "/usr/local/lib/python3.11/dist-packages/transformers/models/llama/tokenization_llama.py", line 27, in <module> [rank3]: import sentencepiece as spm [rank3]: File "/usr/local/lib/python3.11/dist-packages/sentencepiece/__init__.py", line 10, in <module> [rank3]: from . import _sentencepiece [rank3]: ImportError: cannot import name '_sentencepiece' from partially initialized module 'sentencepiece' (most likely due to a circular import) (/usr/local/lib/python3.11/dist-packages/sentencepiece/__init__.py) [rank3]: The above exception was the direct cause of the following exception: [rank3]: Traceback (most recent call last): [rank3]: File "/home/wiseatc/LLaMA-Factory/src/llamafactory/model/loader.py", line 82, in load_tokenizer [rank3]: tokenizer = AutoTokenizer.from_pretrained( [rank3]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [rank3]: File "/usr/local/lib/python3.11/dist-packages/transformers/models/auto/tokenization_auto.py", line 912, in from_pretrained [rank3]: tokenizer_class_from_name(config_tokenizer_class) is not None [rank3]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [rank3]: File "/usr/local/lib/python3.11/dist-packages/transformers/models/auto/tokenization_auto.py", line 611, in tokenizer_class_from_name [rank3]: return getattr(module, class_name) [rank3]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^ [rank3]: File "/usr/local/lib/python3.11/dist-packages/transformers/utils/import_utils.py", line 1851, in __getattr__ [rank3]: module = self._get_module(self._class_to_module[name]) [rank3]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [rank3]: File "/usr/local/lib/python3.11/dist-packages/transformers/utils/import_utils.py", line 1865, in _get_module [rank3]: raise RuntimeError( [rank3]: RuntimeError: Failed to import transformers.models.llama.tokenization_llama_fast because of the following error (look up to see its traceback): [rank3]: cannot import name '_sentencepiece' from partially initialized module 'sentencepiece' (most likely due to a circular import) (/usr/local/lib/python3.11/dist-packages/sentencepiece/__init__.py) [rank3]: The above exception was the direct cause of the following exception: [rank3]: Traceback (most recent call last): [rank3]: File "/home/wiseatc/LLaMA-Factory/src/llamafactory/launcher.py", line 23, in <module> [rank3]: launch() [rank3]: File "/home/wiseatc/LLaMA-Factory/src/llamafactory/launcher.py", line 19, in launch [rank3]: run_exp() [rank3]: File "/home/wiseatc/LLaMA-Factory/src/llamafactory/train/tuner.py", line 110, in run_exp [rank3]: _training_function(config={"args": args, "callbacks": callbacks}) [rank3]: File "/home/wiseatc/LLaMA-Factory/src/llamafactory/train/tuner.py", line 72, in _training_function [rank3]: run_sft(model_args, data_args, training_args, finetuning_args, generating_args, callbacks) [rank3]: File "/home/wiseatc/LLaMA-Factory/src/llamafactory/train/sft/workflow.py", line 48, in run_sft [rank3]: tokenizer_module = load_tokenizer(model_args) [rank3]: ^^^^^^^^^^^^^^^^^^^^^^^^^^ [rank3]: File "/home/wiseatc/LLaMA-Factory/src/llamafactory/model/loader.py", line 97, in load_tokenizer [rank3]: raise OSError("Failed to load tokenizer.") from e [rank3]: OSError: Failed to load tokenizer. [rank1]: Traceback (most recent call last): [rank1]: File "/usr/local/lib/python3.11/dist-packages/transformers/utils/import_utils.py", line 1863, in _get_module [rank1]: return importlib.import_module("." + module_name, self.__name__) [rank1]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [rank1]: File "/usr/lib/python3.11/importlib/__init__.py", line 126, in import_module [rank1]: return _bootstrap._gcd_import(name[level:], package, level) [rank1]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [rank1]: File "<frozen importlib._bootstrap>", line 1206, in _gcd_import [rank1]: File "<frozen importlib._bootstrap>", line 1178, in _find_and_load [rank1]: File "<frozen importlib._bootstrap>", line 1149, in _find_and_load_unlocked [rank1]: File "<frozen importlib._bootstrap>", line 690, in _load_unlocked [rank1]: File "<frozen importlib._bootstrap_external>", line 940, in exec_module [rank1]: File "<frozen importlib._bootstrap>", line 241, in _call_with_frames_removed [rank1]: File "/usr/local/lib/python3.11/dist-packages/transformers/models/llama/tokenization_llama_fast.py", line 29, in <module> [rank1]: from .tokenization_llama import LlamaTokenizer [rank1]: File "/usr/local/lib/python3.11/dist-packages/transformers/models/llama/tokenization_llama.py", line 27, in <module> [rank1]: import sentencepiece as spm [rank1]: File "/usr/local/lib/python3.11/dist-packages/sentencepiece/__init__.py", line 10, in <module> [rank1]: from . import _sentencepiece [rank1]: ImportError: cannot import name '_sentencepiece' from partially initialized module 'sentencepiece' (most likely due to a circular import) (/usr/local/lib/python3.11/dist-packages/sentencepiece/__init__.py) [rank1]: The above exception was the direct cause of the following exception: [rank1]: Traceback (most recent call last): [rank1]: File "/home/wiseatc/LLaMA-Factory/src/llamafactory/model/loader.py", line 82, in load_tokenizer [rank1]: tokenizer = AutoTokenizer.from_pretrained( [rank1]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [rank1]: File "/usr/local/lib/python3.11/dist-packages/transformers/models/auto/tokenization_auto.py", line 912, in from_pretrained [rank1]: tokenizer_class_from_name(config_tokenizer_class) is not None [rank1]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [rank1]: File "/usr/local/lib/python3.11/dist-packages/transformers/models/auto/tokenization_auto.py", line 611, in tokenizer_class_from_name [rank1]: return getattr(module, class_name) [rank1]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^ [rank1]: File "/usr/local/lib/python3.11/dist-packages/transformers/utils/import_utils.py", line 1851, in __getattr__ [rank1]: module = self._get_module(self._class_to_module[name]) [rank1]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [rank1]: File "/usr/local/lib/python3.11/dist-packages/transformers/utils/import_utils.py", line 1865, in _get_module [rank1]: raise RuntimeError( [rank1]: RuntimeError: Failed to import transformers.models.llama.tokenization_llama_fast because of the following error (look up to see its traceback): [rank1]: cannot import name '_sentencepiece' from partially initialized module 'sentencepiece' (most likely due to a circular import) (/usr/local/lib/python3.11/dist-packages/sentencepiece/__init__.py) [rank1]: The above exception was the direct cause of the following exception: [rank1]: Traceback (most recent call last): [rank1]: File "/home/wiseatc/LLaMA-Factory/src/llamafactory/launcher.py", line 23, in <module> [rank1]: launch() [rank1]: File "/home/wiseatc/LLaMA-Factory/src/llamafactory/launcher.py", line 19, in launch [rank1]: run_exp() [rank1]: File "/home/wiseatc/LLaMA-Factory/src/llamafactory/train/tuner.py", line 110, in run_exp [rank1]: _training_function(config={"args": args, "callbacks": callbacks}) [rank1]: File "/home/wiseatc/LLaMA-Factory/src/llamafactory/train/tuner.py", line 72, in _training_function [rank1]: run_sft(model_args, data_args, training_args, finetuning_args, generating_args, callbacks) [rank1]: File "/home/wiseatc/LLaMA-Factory/src/llamafactory/train/sft/workflow.py", line 48, in run_sft [rank1]: tokenizer_module = load_tokenizer(model_args) [rank1]: ^^^^^^^^^^^^^^^^^^^^^^^^^^ [rank1]: File "/home/wiseatc/LLaMA-Factory/src/llamafactory/model/loader.py", line 97, in load_tokenizer [rank1]: raise OSError("Failed to load tokenizer.") from e [rank1]: OSError: Failed to load tokenizer. [rank2]: Traceback (most recent call last): [rank2]: File "/usr/local/lib/python3.11/dist-packages/transformers/utils/import_utils.py", line 1863, in _get_module [rank2]: return importlib.import_module("." + module_name, self.__name__) [rank2]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [rank2]: File "/usr/lib/python3.11/importlib/__init__.py", line 126, in import_module [rank2]: return _bootstrap._gcd_import(name[level:], package, level) [rank2]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [rank2]: File "<frozen importlib._bootstrap>", line 1206, in _gcd_import [rank2]: File "<frozen importlib._bootstrap>", line 1178, in _find_and_load [rank2]: File "<frozen importlib._bootstrap>", line 1149, in _find_and_load_unlocked [rank2]: File "<frozen importlib._bootstrap>", line 690, in _load_unlocked [rank2]: File "<frozen importlib._bootstrap_external>", line 940, in exec_module [rank2]: File "<frozen importlib._bootstrap>", line 241, in _call_with_frames_removed [rank2]: File "/usr/local/lib/python3.11/dist-packages/transformers/models/llama/tokenization_llama_fast.py", line 29, in <module> [rank2]: from .tokenization_llama import LlamaTokenizer [rank2]: File "/usr/local/lib/python3.11/dist-packages/transformers/models/llama/tokenization_llama.py", line 27, in <module> [rank2]: import sentencepiece as spm [rank2]: File "/usr/local/lib/python3.11/dist-packages/sentencepiece/__init__.py", line 10, in <module> [rank2]: from . import _sentencepiece [rank2]: ImportError: cannot import name '_sentencepiece' from partially initialized module 'sentencepiece' (most likely due to a circular import) (/usr/local/lib/python3.11/dist-packages/sentencepiece/__init__.py) [rank2]: The above exception was the direct cause of the following exception: [rank2]: Traceback (most recent call last): [rank2]: File "/home/wiseatc/LLaMA-Factory/src/llamafactory/model/loader.py", line 82, in load_tokenizer [rank2]: tokenizer = AutoTokenizer.from_pretrained( [rank2]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [rank2]: File "/usr/local/lib/python3.11/dist-packages/transformers/models/auto/tokenization_auto.py", line 912, in from_pretrained [rank2]: tokenizer_class_from_name(config_tokenizer_class) is not None [rank2]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [rank2]: File "/usr/local/lib/python3.11/dist-packages/transformers/models/auto/tokenization_auto.py", line 611, in tokenizer_class_from_name [rank2]: return getattr(module, class_name) [rank2]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^ [rank2]: File "/usr/local/lib/python3.11/dist-packages/transformers/utils/import_utils.py", line 1851, in __getattr__ [rank2]: module = self._get_module(self._class_to_module[name]) [rank2]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [rank2]: File "/usr/local/lib/python3.11/dist-packages/transformers/utils/import_utils.py", line 1865, in _get_module [rank2]: raise RuntimeError( [rank2]: RuntimeError: Failed to import transformers.models.llama.tokenization_llama_fast because of the following error (look up to see its traceback): [rank2]: cannot import name '_sentencepiece' from partially initialized module 'sentencepiece' (most likely due to a circular import) (/usr/local/lib/python3.11/dist-packages/sentencepiece/__init__.py) [rank2]: The above exception was the direct cause of the following exception: [rank2]: Traceback (most recent call last): [rank2]: File "/home/wiseatc/LLaMA-Factory/src/llamafactory/launcher.py", line 23, in <module> [rank2]: launch() [rank2]: File "/home/wiseatc/LLaMA-Factory/src/llamafactory/launcher.py", line 19, in launch [rank2]: run_exp() [rank2]: File "/home/wiseatc/LLaMA-Factory/src/llamafactory/train/tuner.py", line 110, in run_exp [rank2]: _training_function(config={"args": args, "callbacks": callbacks}) [rank2]: File "/home/wiseatc/LLaMA-Factory/src/llamafactory/train/tuner.py", line 72, in _training_function [rank2]: run_sft(model_args, data_args, training_args, finetuning_args, generating_args, callbacks) [rank2]: File "/home/wiseatc/LLaMA-Factory/src/llamafactory/train/sft/workflow.py", line 48, in run_sft [rank2]: tokenizer_module = load_tokenizer(model_args) [rank2]: ^^^^^^^^^^^^^^^^^^^^^^^^^^ [rank2]: File "/home/wiseatc/LLaMA-Factory/src/llamafactory/model/loader.py", line 97, in load_tokenizer [rank2]: raise OSError("Failed to load tokenizer.") from e [rank2]: OSError: Failed to load tokenizer. [rank0]:[W703 16:13:30.861219244 ProcessGroupNCCL.cpp:1479] Warning: WARNING: destroy_process_group() was not called before program exit, which can leak resources. For more info, please see https://pytorch.org/docs/stable/distributed.html#shutdown (function operator()) W0703 16:13:31.449512 3913223 torch/distributed/elastic/multiprocessing/api.py:900] Sending process 3913282 closing signal SIGTERM W0703 16:13:31.450263 3913223 torch/distributed/elastic/multiprocessing/api.py:900] Sending process 3913283 closing signal SIGTERM W0703 16:13:31.450724 3913223 torch/distributed/elastic/multiprocessing/api.py:900] Sending process 3913284 closing signal SIGTERM E0703 16:13:31.765744 3913223 torch/distributed/elastic/multiprocessing/api.py:874] failed (exitcode: 1) local_rank: 0 (pid: 3913281) of binary: /usr/bin/python3.11 Traceback (most recent call last): File "/usr/local/bin/torchrun", line 8, in <module> sys.exit(main()) ^^^^^^ File "/usr/local/lib/python3.11/dist-packages/torch/distributed/elastic/multiprocessing/errors/__init__.py", line 355, in wrapper return f(*args, **kwargs) ^^^^^^^^^^^^^^^^^^ File "/usr/local/lib/python3.11/dist-packages/torch/distributed/run.py", line 892, in main run(args) File "/usr/local/lib/python3.11/dist-packages/torch/distributed/run.py", line 883, in run elastic_launch( File "/usr/local/lib/python3.11/dist-packages/torch/distributed/launcher/api.py", line 139, in __call__ return launch_agent(self._config, self._entrypoint, list(args)) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/usr/local/lib/python3.11/dist-packages/torch/distributed/launcher/api.py", line 270, in launch_agent raise ChildFailedError( torch.distributed.elastic.multiprocessing.errors.ChildFailedError: ============================================================ /home/wiseatc/LLaMA-Factory/src/llamafactory/launcher.py FAILED ------------------------------------------------------------ Failures: <NO_OTHER_FAILURES> ------------------------------------------------------------ Root Cause (first observed failure): [0]: time : 2025-07-03_16:13:31 host : wiseatc-Super-Server rank : 0 (local_rank: 0) exitcode : 1 (pid: 3913281) error_file: <N/A> traceback : To enable traceback see: https://pytorch.org/docs/stable/elastic/errors.html ============================================================ Traceback (most recent call last): File "/home/wiseatc/.local/bin/llamafactory-cli", line 8, in <module> sys.exit(main()) ^^^^^^ File "/home/wiseatc/LLaMA-Factory/src/llamafactory/cli.py", line 130, in main process = subprocess.run( ^^^^^^^^^^^^^^^ File "/usr/lib/python3.11/subprocess.py", line 569, in run raise CalledProcessError(retcode, process.args, subprocess.CalledProcessError: Command '['torchrun', '--nnodes', '1', '--node_rank', '0', '--nproc_per_node', '4', '--master_addr', '127.0.0.1', '--master_port', '38589', '/home/wiseatc/LLaMA-Factory/src/llamafactory/launcher.py', 'saves/DeepSeek-R1-1.5B-Distill/lora/train_2025-07-03-16-00-01/training_args.yaml']' returned non-zero exit status 1.

alibi@alibi-virtual-machine:~/桌面$ ./shell 段错误 (核心已转储) alibi@alibi-virtual-machine:~/桌面$ gdb ./shell GNU gdb (Ubuntu 9.2-0ubuntu1~20.04.2) 9.2 Copyright (C) 2020 Free Software Foundation, Inc. License GPLv3+: GNU GPL version 3 or later <http://gnu.org/licenses/gpl.html> This is free software: you are free to change and redistribute it. There is NO WARRANTY, to the extent permitted by law. Type "show copying" and "show warranty" for details. This GDB was configured as "x86_64-linux-gnu". Type "show configuration" for configuration details. For bug reporting instructions, please see: <http://www.gnu.org/software/gdb/bugs/>. Find the GDB manual and other documentation resources online at: <http://www.gnu.org/software/gdb/documentation/>. For help, type "help". Type "apropos word" to search for commands related to "word"... Reading symbols from ./shell... (No debugging symbols found in ./shell) gdb-peda$ run Starting program: /home/alibi/桌面/shell Program received signal SIGSEGV, Segmentation fault. [----------------------------------registers-----------------------------------] EAX: 0xfffffffe EBX: 0xffffd23c --> 0x0 ECX: 0xffffd234 ("/bin//sh") EDX: 0xb ('\x0b') ESI: 0x0 EDI: 0x0 EBP: 0x0 ESP: 0xffffd234 ("/bin//sh") EIP: 0x8049017 --> 0x0 EFLAGS: 0x10246 (carry PARITY adjust ZERO sign trap INTERRUPT direction overflow) [-------------------------------------code-------------------------------------] 0x8049011 <_start+17>: mov al,0xb 0x8049013 <_start+19>: mov edx,eax 0x8049015 <_start+21>: int 0x80 => 0x8049017: add BYTE PTR [eax],al 0x8049019: add BYTE PTR [eax],al 0x804901b: add BYTE PTR [eax],al 0x804901d: add BYTE PTR [eax],al 0x804901f: add BYTE PTR [eax],al [------------------------------------stack-------------------------------------] 0000| 0xffffd234 ("/bin//sh") 0004| 0xffffd238 ("//sh") 0008| 0xffffd23c --> 0x0 0012| 0xffffd240 --> 0x1 0016| 0xffffd244 --> 0xffffd3fe ("/home/alibi/桌面/shell") 0020| 0xf

最新推荐

recommend-type

2005年上半年网络工程师上午试卷(解析)--网络工程师试题及答案.doc

2005年上半年网络工程师上午试卷(解析)--网络工程师试题及答案.doc
recommend-type

构建基于ajax, jsp, Hibernate的博客网站源码解析

根据提供的文件信息,本篇内容将专注于解释和阐述ajax、jsp、Hibernate以及构建博客网站的相关知识点。 ### AJAX AJAX(Asynchronous JavaScript and XML)是一种用于创建快速动态网页的技术,它允许网页在不重新加载整个页面的情况下,与服务器交换数据并更新部分网页内容。AJAX的核心是JavaScript中的XMLHttpRequest对象,通过这个对象,JavaScript可以异步地向服务器请求数据。此外,现代AJAX开发中,常常用到jQuery中的$.ajax()方法,因为其简化了AJAX请求的处理过程。 AJAX的特点主要包括: - 异步性:用户操作与数据传输是异步进行的,不会影响用户体验。 - 局部更新:只更新需要更新的内容,而不是整个页面,提高了数据交互效率。 - 前后端分离:AJAX技术允许前后端分离开发,让前端开发者专注于界面和用户体验,后端开发者专注于业务逻辑和数据处理。 ### JSP JSP(Java Server Pages)是一种动态网页技术标准,它允许开发者将Java代码嵌入到HTML页面中,从而实现动态内容的生成。JSP页面在服务器端执行,并将生成的HTML发送到客户端浏览器。JSP是Java EE(Java Platform, Enterprise Edition)的一部分。 JSP的基本工作原理: - 当客户端首次请求JSP页面时,服务器会将JSP文件转换为Servlet。 - 服务器上的JSP容器(如Apache Tomcat)负责编译并执行转换后的Servlet。 - Servlet生成HTML内容,并发送给客户端浏览器。 JSP页面中常见的元素包括: - 指令(Directives):如page、include、taglib等。 - 脚本元素:脚本声明(Script declarations)、脚本表达式(Scriptlet)和脚本片段(Expression)。 - 标准动作:如jsp:useBean、jsp:setProperty、jsp:getProperty等。 - 注释:在客户端浏览器中不可见的注释。 ### Hibernate Hibernate是一个开源的对象关系映射(ORM)框架,它提供了从Java对象到数据库表的映射,简化了数据库编程。通过Hibernate,开发者可以将Java对象持久化到数据库中,并从数据库中检索它们,而无需直接编写SQL语句或掌握复杂的JDBC编程。 Hibernate的主要优点包括: - ORM映射:将对象模型映射到关系型数据库的表结构。 - 缓存机制:提供了二级缓存,优化数据访问性能。 - 数据查询:提供HQL(Hibernate Query Language)和Criteria API等查询方式。 - 延迟加载:可以配置对象或对象集合的延迟加载,以提高性能。 ### 博客网站开发 构建一个博客网站涉及到前端页面设计、后端逻辑处理、数据库设计等多个方面。使用ajax、jsp、Hibernate技术栈,开发者可以更高效地构建功能完备的博客系统。 #### 前端页面设计 前端主要通过HTML、CSS和JavaScript来实现,其中ajax技术可以用来异步获取文章内容、用户评论等,无需刷新页面即可更新内容。 #### 后端逻辑处理 JSP可以在服务器端动态生成HTML内容,根据用户请求和数据库中的数据渲染页面。Hibernate作为ORM框架,可以处理Java对象与数据库表之间的映射,并提供数据库的CRUD(创建、读取、更新、删除)操作。 #### 数据库设计 博客网站的数据库设计通常包含多个表,如用户表(存储用户信息)、文章表(存储文章信息)、评论表(存储用户评论信息)等。使用Hibernate框架可以简化数据库操作,同时确保数据的一致性和安全性。 #### 安全性和性能优化 安全性是构建网站时需要考虑的重要方面,包括但不限于SQL注入防护、XSS攻击防护、会话管理等。性能优化方面,可以利用Hibernate的缓存机制,以及对JSP页面和ajax请求进行适当的缓存处理。 ### 结论 ajax、jsp、Hibernate技术结合可以构建出高效、动态、易于维护的博客网站。在开发过程中,应当关注前后端分离、用户体验优化、系统性能和安全性等关键要素,确保博客网站的稳定和长期可用性。通过本篇文章,读者应该已经对这些技术有了初步了解,并能够结合文件提供的源码开始进行相关开发实践。
recommend-type

【Unity Sunny Land关卡设计高级指南】:打造完美关卡的8大技巧

# 1. Unity关卡设计概述 ## 1.1 关卡设计的核心要素 关卡设计作为游戏开发中不可或缺的一环,承担着构建游戏世界、呈现故事背景、引导玩家体验和激发玩家兴趣的责任。一个优秀的关卡不仅需要有吸引人的视觉效果和有趣的挑战,还需平衡玩家体验,提供连贯的游戏故事。 ## 1.2 关卡设计师的角色 在Unity引擎的环境下,关卡设计师需要具备对场景布局、角色互动、剧情
recommend-type

C++ 模版

<think>我们正在讨论C++模板的使用和常见问题。根据用户要求,所有行内数学表达式使用$...$,独立公式使用$$...$$,用中文回答,并生成相关问题。同时,回答要结构清晰,帮助用户逐步解决问题。 首先,回顾用户的问题:用户想了解C++模板的使用和常见问题。我们将从以下几个方面展开: 1. 模板的基本概念和用途 2. 函数模板和类模板 3. 模板特化和偏特化 4. 模板元编程简介 5. 常见问题及解决方案 注意:在回答中,我们会引用站内提供的引用内容(引用[1]和引用[2])作为参考,但主要围绕模板展开。 ### 1. 模板的基本概念和用途 C++模板是一种支持泛型编程的特性,允许
recommend-type

C#随机数摇奖系统功能及隐藏开关揭秘

### C#摇奖系统知识点梳理 #### 1. C#语言基础 C#(发音为“看井”)是由微软开发的一种面向对象的、类型安全的编程语言。它是.NET框架的核心语言之一,广泛用于开发Windows应用程序、ASP.NET网站、Web服务等。C#提供丰富的数据类型、控制结构和异常处理机制,这使得它在构建复杂应用程序时具有很强的表达能力。 #### 2. 随机数的生成 在编程中,随机数生成是常见的需求之一,尤其在需要模拟抽奖、游戏等场景时。C#提供了System.Random类来生成随机数。Random类的实例可以生成一个伪随机数序列,这些数在统计学上被认为是随机的,但它们是由确定的算法生成,因此每次运行程序时产生的随机数序列相同,除非改变种子值。 ```csharp using System; class Program { static void Main() { Random rand = new Random(); for(int i = 0; i < 10; i++) { Console.WriteLine(rand.Next(1, 101)); // 生成1到100之间的随机数 } } } ``` #### 3. 摇奖系统设计 摇奖系统通常需要以下功能: - 用户界面:显示摇奖结果的界面。 - 随机数生成:用于确定摇奖结果的随机数。 - 动画效果:模拟摇奖的视觉效果。 - 奖项管理:定义摇奖中可能获得的奖品。 - 规则设置:定义摇奖规则,比如中奖概率等。 在C#中,可以使用Windows Forms或WPF技术构建用户界面,并集成上述功能以创建一个完整的摇奖系统。 #### 4. 暗藏的开关(隐藏控制) 标题中提到的“暗藏的开关”通常是指在程序中实现的一个不易被察觉的控制逻辑,用于在特定条件下改变程序的行为。在摇奖系统中,这样的开关可能用于控制中奖的概率、启动或停止摇奖、强制显示特定的结果等。 #### 5. 测试 对于摇奖系统来说,测试是一个非常重要的环节。测试可以确保程序按照预期工作,随机数生成器的随机性符合要求,用户界面友好,以及隐藏的控制逻辑不会被轻易发现或利用。测试可能包括单元测试、集成测试、压力测试等多个方面。 #### 6. System.Random类的局限性 System.Random虽然方便使用,但也有其局限性。其生成的随机数序列具有一定的周期性,并且如果使用不当(例如使用相同的种子创建多个实例),可能会导致生成相同的随机数序列。在安全性要求较高的场合,如密码学应用,推荐使用更加安全的随机数生成方式,比如RNGCryptoServiceProvider。 #### 7. Windows Forms技术 Windows Forms是.NET框架中用于创建图形用户界面应用程序的库。它提供了一套丰富的控件,如按钮、文本框、标签等,以及它们的事件处理机制,允许开发者设计出视觉效果良好且功能丰富的桌面应用程序。 #### 8. WPF技术 WPF(Windows Presentation Foundation)是.NET框架中用于构建桌面应用程序用户界面的另一种技术。与Windows Forms相比,WPF提供了更现代化的控件集,支持更复杂的布局和样式,以及3D图形和动画效果。WPF的XAML标记语言允许开发者以声明性的方式设计用户界面,与C#代码分离,易于维护和更新。 #### 9. 压缩包子文件TransBallDemo分析 从文件名“TransBallDemo”可以推测,这可能是一个C#的示例程序或者演示程序,其中“TransBall”可能表示旋转的球体,暗示该程序包含了动画效果,可能是用来模拟转动的球体(如转盘或摇奖球)。该文件可能是用来展示如何实现一个带有视觉动画效果的摇奖系统的C#程序。 总结以上内容,我们可以得出构建一个C#摇奖系统需要深入理解C#语言及其随机数生成机制,设计用户界面,集成动画效果,确保隐藏控制逻辑的安全性,以及全面测试系统以保证其正确性和公平性。通过掌握Windows Forms或WPF技术,可以进一步增强系统的视觉和交互体验。
recommend-type

【数据驱动的力量】:管道缺陷判别方法论与实践经验

# 摘要 数据驱动技术在管道缺陷检测领域展现出强大的力量,本文首先概述了数据驱动的力量和管道缺陷判别的基础理论。接着,重点探讨了管道缺陷的类型与特征、数据采集与预处理、数据驱动模型的选择与构建。在实践技巧章节,本文详述了实战数据集的准备与处理、缺陷识别算法的应用、以及性能评估与模型优化。此外,高级应用章节深入讨论了实时数据处理分析、多模态数据融合技术、以及深度学习在缺
recommend-type

FatalError: `Segmentation fault` is detected by the operating system. [TimeInfo: *** Aborted at 1752568501 (unix time) try "date -d @1752568501" if you are using GNU date ***] [SignalInfo: *** SIGSEGV (@0x0) received by PID 3203 (TID 0x7f8a04143340) from PID 0 ***],训练模型中出现上述错误,怎么解决

<think>我们面对的是一个模型训练过程中出现的Segmentation fault错误。根据引用[1],Segmentation Fault通常是由非法内存访问引起的,常见原因包括空指针解引用、数组越界、使用未初始化指针等。在深度学习训练场景中,还可能涉及显存溢出、CUDA内核错误等。 引用[2]展示了一个具体案例:在PaddlePaddle框架中遇到Segmentation fault,并提示了C++ Traceback。这通常表明底层C++代码出现了问题。而引用[3]则提到Python环境下的Segmentation fault,可能涉及Python扩展模块的错误。 解决步骤: 1
recommend-type

EditPlus中实现COBOL语言语法高亮的设置

标题中的“editplus”指的是一个轻量级的代码编辑器,特别受到程序员和软件开发者的欢迎,因为它支持多种编程语言。标题中的“mfcobol”指的是一种特定的编程语言,即“Micro Focus COBOL”。COBOL语言全称为“Common Business-Oriented Language”,是一种高级编程语言,主要用于商业、金融和行政管理领域的数据处理。它最初开发于1959年,是历史上最早的高级编程语言之一。 描述中的“cobol语言颜色显示”指的是在EditPlus这款编辑器中为COBOL代码提供语法高亮功能。语法高亮是一种编辑器功能,它可以将代码中的不同部分(如关键字、变量、字符串、注释等)用不同的颜色和样式显示,以便于编程者阅读和理解代码结构,提高代码的可读性和编辑的效率。在EditPlus中,要实现这一功能通常需要用户安装相应的语言语法文件。 标签“cobol”是与描述中提到的COBOL语言直接相关的一个词汇,它是对描述中提到的功能或者内容的分类或者指代。标签在互联网内容管理系统中用来帮助组织内容和便于检索。 在提供的“压缩包子文件的文件名称列表”中只有一个文件名:“Java.stx”。这个文件名可能是指一个语法高亮的模板文件(Syntax Template eXtension),通常以“.stx”为文件扩展名。这样的文件包含了特定语言语法高亮的规则定义,可用于EditPlus等支持自定义语法高亮的编辑器中。不过,Java.stx文件是为Java语言设计的语法高亮文件,与COBOL语言颜色显示并不直接相关。这可能意味着在文件列表中实际上缺少了为COBOL语言定义的相应.stx文件。对于EditPlus编辑器,要实现COBOL语言的颜色显示,需要的是一个COBOL.stx文件,或者需要在EditPlus中进行相应的语法高亮设置以支持COBOL。 为了在EditPlus中使用COBOL语法高亮,用户通常需要做以下几步操作: 1. 确保已经安装了支持COBOL的EditPlus版本。 2. 从Micro Focus或者第三方资源下载COBOL的语法高亮文件(COBOL.stx)。 3. 打开EditPlus,进入到“工具”菜单中的“配置用户工具”选项。 4. 在用户工具配置中,选择“语法高亮”选项卡,然后选择“添加”来载入下载的COBOL.stx文件。 5. 根据需要选择其他语法高亮的选项,比如是否开启自动完成、代码折叠等。 6. 确认并保存设置。 完成上述步骤后,在EditPlus中打开COBOL代码文件时,应该就能看到语法高亮显示了。语法高亮不仅仅是颜色的区分,它还可以包括字体加粗、斜体、下划线等样式,以及在某些情况下,语法错误的高亮显示。这对于提高编码效率和准确性有着重要意义。
recommend-type

影子系统(windows)问题排查:常见故障诊断与修复

# 摘要 本文旨在深入探讨影子系统的概念、工作原理以及故障诊断基础。首先,介绍影子系统的定义及其运作机制,并分析其故障诊断的理论基础,包括系统故障的分类和特征。接着,详细探讨各种故障诊断工具和方法,并提供实际操作中的故障排查步骤。文中还深入分析了影子系统常见故障案例,涵盖系统启动问题、软件兼容性和网络连通性问题,并提供相应的诊断与解决方案。高级故障诊断与修复
recommend-type

nt!DbgBreakPointWithStatus: fffff805`7affd0b0 cc int 3 kd> g KDTARGET: Refreshing KD connection *** Fatal System Error: 0x0000001a (0x0000000000061941,0xFFFFF8057B20E1C0,0x0000000000000019,0xFFFFFC89CACA7190) Break instruction exception - code 80000003 (first chance) A fatal system error has occurred. Debugger entered on first try; Bugcheck callbacks have not been invoked. A fatal system error has occurred. For analysis of this file, run !analyze -v nt!DbgBreakPointWithStatus: fffff805`7affd0b0 cc int 3 kd> !analyze -v Connected to Windows 10 19041 x64 target at (Tue Jul 15 23:02:04.588 2025 (UTC + 8:00)), ptr64 TRUE Loading Kernel Symbols ............. Press ctrl-c (cdb, kd, ntsd) or ctrl-break (windbg) to abort symbol loads that take too long. Run !sym noisy before .reload to track down problems loading symbols. .................................................. ................................................................ ..................................................... Loading User Symbols Loading unloaded module list ...... ERROR: FindPlugIns 8007007b ******************************************************************************* * * * Bugcheck Analysis * * * ******************************************************************************* MEMORY_MANAGEMENT (1a) # Any other values for parameter 1 must be individually examined. Arguments: Arg1: 0000000000061941, The subtype of the BugCheck. Arg2: fffff8057b20e1c0 Arg3: 0000000000000019 Arg4: fffffc89caca7190 Debugging Details: ------------------ Press ctrl-c (cdb, kd, ntsd) or ctrl-break (windbg) to abort symbol loads that take too long. Run !sym noisy before .reload to track down problems loading symbols. 蓝屏了,但原帖子代码能正常执行,帖子作者说:我win10是专门针对2mb大页,win7是专门针对4kb小页的,蓝屏的原因我只能想到是因为没有做大页和小页的处理了。你做了2mb和4kb之类的处理吗,还有2mb的大页的物理地址要2mb对齐,不然电脑会卡死,我win10蓝屏的原因就是因为没有物理地址2mb对齐,我这代码是专门针对我那个虚拟机的环境,其他环境蓝屏你要是懂分页的话自己就会处理了。最重要的细节就是物理地址要对齐,2mb就2mb的对齐,1gb就1gb的对齐。

<think>我们之前讨论的代码中已经包含了对大页(2MB)和小页(4KB)的处理。但是,根据蓝屏信息(MEMORY_MANAGEMENT, 0x1a)和参数,问题可能出在内存对齐或页表项设置上。 原帖作者提到:2MB大页的物理地址必须2MB对齐,否则会导致系统不稳定甚至蓝屏。在之前的代码中,我们确实处理了大页的情况,但在分配2MB内存时,我们使用了`MmAllocateContiguousMemorySpecifyCache`,这个函数分配的内存不一定满足2MB对齐(尽管它分配的是连续内存)。因此,我们需要确保分配的内存地址是2MB对齐的。 解决方案: 1. 在分配2MB内存时,要