ELOG Midas

Back Midas Rome Roody Rootana

Midas DAQ System, Page 131 of 152

Not logged in

Find | Login | Help

Full | Summary | Threaded | Show attachments

3027 Entries

Goto page Previous 1, 2, 3 ... 130, 131, 132 ... 150, 151, 152 Next

ID	Date	Author	Topic	Subject
2847	16 Sep 2024	Marius Koeppel	Bug Report	Crash using ODB watch
This is not the case here. Note that the error message: "Callback received for a midas::odb object which went out of scope" is not called! The segmentation fault happens later line 96. > The answer is in the error message: „Object went out of scope“. When your frontent_init() exits, the odb objects are destroyed. When you get a callback, it‘s linked to the > destroyed object. This is like if you have a local string and pass a reference to that string in the return of the function. > > Use a global object (bad) or use „new“ (potential memory leak). I would use a global structure which holds all odb objects. > > Stefan > > > > > last week I was running MIDAS with the commit 3ad98c5. Today I updated MIDAS and now all my watch functions are crashing. Attached I have a minimal example frontend of the problem. > > > > In our software we have two functions one which sets up the ODB values of the frontend and another one which sets up all watch functions. So overall we connect two time to the ODB during fronend_init one time to create the values and one time to create the watch. In the example code a simple version of this setup is shown: > > > > INT frontend_init() { > > > > cm_msg(MINFO, "frontend_init() setup", "Test FE"); > > > > odb settings = { > > {"Test", 123}, > > {"sub", {}} > > }; > > settings.connect_and_fix_structure("/Equipment/Test FE/Settings"); > > // settings.watch(watch); <-- this works without segmentation fault > > > > odb new_settings("/Equipment/Test FE/Settings"); > > new_settings.watch(watch); // <-- here I am getting a segmentation fault > > > > return CM_SUCCESS; > > } > > > > When I directly set the watch everything runs fine however, when I create a new ODB object and use this one to set a watch I am getting the following segmentation fault: > > > > Process 18474 stopped > > * thread #1, queue = 'com.apple.main-thread', stop reason = EXC_BAD_ACCESS (code=1, address=0x34) > > frame #0: 0x000000010004fa38 test_fe`midas::odb::watch_callback(hDB=<unavailable>, hKey=<unavailable>, index=0, info=0x00006000002001c0) at odbxx.cxx:96:25 [opt] > > 93 if (po->m_data == nullptr) > > 94 mthrow("Callback received for a midas::odb object which went out of scope"); > > 95 midas::odb poh = search_hkey(po, hKey); > > -> 96 poh->m_last_index = index; > > 97 po->m_watch_callback(poh); > > 98 poh->m_last_index = -1; > > 99 } > > > > Best, > > Marius
2848	16 Sep 2024	Stefan Ritt	Bug Report	Crash using ODB watch
Well, the object went out of scope. For my code it‘s hard to realize this, so the error reporting is poor. Also the first object should have the same problem. Just by accident that it does not crash. Stefan > This is not the case here. Note that the error message: "Callback received for a midas::odb object which went out of scope" is not called! The segmentation fault happens later line 96. > > > The answer is in the error message: „Object went out of scope“. When your frontent_init() exits, the odb objects are destroyed. When you get a callback, it‘s linked to the > > destroyed object. This is like if you have a local string and pass a reference to that string in the return of the function. > > > > Use a global object (bad) or use „new“ (potential memory leak). I would use a global structure which holds all odb objects. > > > > Stefan > > > > > > > > last week I was running MIDAS with the commit 3ad98c5. Today I updated MIDAS and now all my watch functions are crashing. Attached I have a minimal example frontend of the problem. > > > > > > In our software we have two functions one which sets up the ODB values of the frontend and another one which sets up all watch functions. So overall we connect two time to the ODB during fronend_init one time to create the values and one time to create the watch. In the example code a simple version of this setup is shown: > > > > > > INT frontend_init() { > > > > > > cm_msg(MINFO, "frontend_init() setup", "Test FE"); > > > > > > odb settings = { > > > {"Test", 123}, > > > {"sub", {}} > > > }; > > > settings.connect_and_fix_structure("/Equipment/Test FE/Settings"); > > > // settings.watch(watch); <-- this works without segmentation fault > > > > > > odb new_settings("/Equipment/Test FE/Settings"); > > > new_settings.watch(watch); // <-- here I am getting a segmentation fault > > > > > > return CM_SUCCESS; > > > } > > > > > > When I directly set the watch everything runs fine however, when I create a new ODB object and use this one to set a watch I am getting the following segmentation fault: > > > > > > Process 18474 stopped > > > * thread #1, queue = 'com.apple.main-thread', stop reason = EXC_BAD_ACCESS (code=1, address=0x34) > > > frame #0: 0x000000010004fa38 test_fe`midas::odb::watch_callback(hDB=<unavailable>, hKey=<unavailable>, index=0, info=0x00006000002001c0) at odbxx.cxx:96:25 [opt] > > > 93 if (po->m_data == nullptr) > > > 94 mthrow("Callback received for a midas::odb object which went out of scope"); > > > 95 midas::odb poh = search_hkey(po, hKey); > > > -> 96 poh->m_last_index = index; > > > 97 po->m_watch_callback(poh); > > > 98 poh->m_last_index = -1; > > > 99 } > > > > > > Best, > > > Marius
2849	16 Sep 2024	Marius Koeppel	Bug Report	Crash using ODB watch
Okay, but this is then a big issue IMO. For Mu3e we do this in every frontend and I also checked again all of these watches are broken at the moment (with commit 3ad98c5 they worked). In the old style we did for example (see https://bitbucket.org/tmidas/midas/src/develop/examples/crfe/crfe.cxx): INT frontend_init() { HNDLE hKey; // create Settings structure in ODB db_create_record(hDB, 0, "Equipment/Clock Reset/Settings", strcomb1(cr_settings_str).c_str()); db_find_key(hDB, 0, "/Equipment/Clock Reset", &hKey); assert(hKey); db_watch(hDB, hKey, cr_settings_changed, NULL); /* * Set our transition sequence. The default is 500. Setting it * to 600 means we are called AFTER most other clients. / cm_set_transition_sequence(TR_START, 600); return CM_SUCCESS; } I thought this will be the same (under the hood) in the current odbxx way via: odb settings("Equipment/Clock Reset/Settings"); settings.watch(cr_settings_changed); Best, Marius > Well, the object went* out of scope. For my code it‘s hard to realize this, so the error reporting is poor. Also the first object should have the same > problem. Just by accident that it does not crash. > > Stefan > > > This is not the case here. Note that the error message: "Callback received for a midas::odb object which went out of scope" is not called! The segmentation fault happens later line 96. > > > > > The answer is in the error message: „Object went out of scope“. When your frontent_init() exits, the odb objects are destroyed. When you get a callback, it‘s linked to the > > > destroyed object. This is like if you have a local string and pass a reference to that string in the return of the function. > > > > > > Use a global object (bad) or use „new“ (potential memory leak). I would use a global structure which holds all odb objects. > > > > > > Stefan > > > > > > > > > > > last week I was running MIDAS with the commit 3ad98c5. Today I updated MIDAS and now all my watch functions are crashing. Attached I have a minimal example frontend of the problem. > > > > > > > > In our software we have two functions one which sets up the ODB values of the frontend and another one which sets up all watch functions. So overall we connect two time to the ODB during fronend_init one time to create the values and one time to create the watch. In the example code a simple version of this setup is shown: > > > > > > > > INT frontend_init() { > > > > > > > > cm_msg(MINFO, "frontend_init() setup", "Test FE"); > > > > > > > > odb settings = { > > > > {"Test", 123}, > > > > {"sub", {}} > > > > }; > > > > settings.connect_and_fix_structure("/Equipment/Test FE/Settings"); > > > > // settings.watch(watch); <-- this works without segmentation fault > > > > > > > > odb new_settings("/Equipment/Test FE/Settings"); > > > > new_settings.watch(watch); // <-- here I am getting a segmentation fault > > > > > > > > return CM_SUCCESS; > > > > } > > > > > > > > When I directly set the watch everything runs fine however, when I create a new ODB object and use this one to set a watch I am getting the following segmentation fault: > > > > > > > > Process 18474 stopped > > > > * thread #1, queue = 'com.apple.main-thread', stop reason = EXC_BAD_ACCESS (code=1, address=0x34) > > > > frame #0: 0x000000010004fa38 test_fe`midas::odb::watch_callback(hDB=<unavailable>, hKey=<unavailable>, index=0, info=0x00006000002001c0) at odbxx.cxx:96:25 [opt] > > > > 93 if (po->m_data == nullptr) > > > > 94 mthrow("Callback received for a midas::odb object which went out of scope"); > > > > 95 midas::odb poh = search_hkey(po, hKey); > > > > -> 96 poh->m_last_index = index; > > > > 97 po->m_watch_callback(poh); > > > > 98 poh->m_last_index = -1; > > > > 99 } > > > > > > > > Best, > > > > Marius
2850	16 Sep 2024	Mark Grimes	Bug Report	Crash using ODB watch
Hi, Maybe I've misunderstood the code, but odb::watch() creates a deep copy of itself to set the watch to. The comment where this happens specifies that this is in case the current one goes out of scope. See https://bitbucket.org/tmidas/midas/src/2878647fb73648474b35223ce53a125180f751b3/src/odbxx.cxx#lines-1393:1395 So as far as I can tell allowing the current odb instance to go out of scope is supported. Thanks, Mark. > Okay, but this is then a big issue IMO. For Mu3e we do this in every frontend and I also checked again all of these watches are broken at the moment (with commit 3ad98c5 they worked). > > In the old style we did for example (see https://bitbucket.org/tmidas/midas/src/develop/examples/crfe/crfe.cxx): > > INT frontend_init() > { > HNDLE hKey; > > // create Settings structure in ODB > db_create_record(hDB, 0, "Equipment/Clock Reset/Settings", strcomb1(cr_settings_str).c_str()); > db_find_key(hDB, 0, "/Equipment/Clock Reset", &hKey); > assert(hKey); > > db_watch(hDB, hKey, cr_settings_changed, NULL); > > /* > * Set our transition sequence. The default is 500. Setting it > * to 600 means we are called AFTER most other clients. > / > cm_set_transition_sequence(TR_START, 600); > > return CM_SUCCESS; > } > > I thought this will be the same (under the hood) in the current odbxx way via: > > odb settings("Equipment/Clock Reset/Settings"); > settings.watch(cr_settings_changed); > > Best, > Marius > > > > Well, the object went* out of scope. For my code it‘s hard to realize this, so the error reporting is poor. Also the first object should have the same > > problem. Just by accident that it does not crash. > > > > Stefan > > > > > This is not the case here. Note that the error message: "Callback received for a midas::odb object which went out of scope" is not called! The segmentation fault happens later line 96. > > > > > > > The answer is in the error message: „Object went out of scope“. When your frontent_init() exits, the odb objects are destroyed. When you get a callback, it‘s linked to the > > > > destroyed object. This is like if you have a local string and pass a reference to that string in the return of the function. > > > > > > > > Use a global object (bad) or use „new“ (potential memory leak). I would use a global structure which holds all odb objects. > > > > > > > > Stefan > > > > > > > > > > > > > > last week I was running MIDAS with the commit 3ad98c5. Today I updated MIDAS and now all my watch functions are crashing. Attached I have a minimal example frontend of the problem. > > > > > > > > > > In our software we have two functions one which sets up the ODB values of the frontend and another one which sets up all watch functions. So overall we connect two time to the ODB during fronend_init one time to create the values and one time to create the watch. In the example code a simple version of this setup is shown: > > > > > > > > > > INT frontend_init() { > > > > > > > > > > cm_msg(MINFO, "frontend_init() setup", "Test FE"); > > > > > > > > > > odb settings = { > > > > > {"Test", 123}, > > > > > {"sub", {}} > > > > > }; > > > > > settings.connect_and_fix_structure("/Equipment/Test FE/Settings"); > > > > > // settings.watch(watch); <-- this works without segmentation fault > > > > > > > > > > odb new_settings("/Equipment/Test FE/Settings"); > > > > > new_settings.watch(watch); // <-- here I am getting a segmentation fault > > > > > > > > > > return CM_SUCCESS; > > > > > } > > > > > > > > > > When I directly set the watch everything runs fine however, when I create a new ODB object and use this one to set a watch I am getting the following segmentation fault: > > > > > > > > > > Process 18474 stopped > > > > > * thread #1, queue = 'com.apple.main-thread', stop reason = EXC_BAD_ACCESS (code=1, address=0x34) > > > > > frame #0: 0x000000010004fa38 test_fe`midas::odb::watch_callback(hDB=<unavailable>, hKey=<unavailable>, index=0, info=0x00006000002001c0) at odbxx.cxx:96:25 [opt] > > > > > 93 if (po->m_data == nullptr) > > > > > 94 mthrow("Callback received for a midas::odb object which went out of scope"); > > > > > 95 midas::odb poh = search_hkey(po, hKey); > > > > > -> 96 poh->m_last_index = index; > > > > > 97 po->m_watch_callback(poh); > > > > > 98 poh->m_last_index = -1; > > > > > 99 } > > > > > > > > > > Best, > > > > > Marius
2851	17 Sep 2024	Konstantin Olchanski	Bug Report	Crash using ODB watch
> { > odb new_settings("/Equipment/Test FE/Settings"); > new_settings.watch(watch); // <-- here I am getting a segmentation fault > } this code has a bug. "watch" is attached to object "new_settings" that is deleted after the closing curly bracket. I would say Stefan's odb API should not allow you to write code like this. an API defect. K.O.
2852	18 Sep 2024	Marius Koeppel	Bug Report	Crash using ODB watch
I created a PR to fix this issue https://bitbucket.org/tmidas/midas/pull-requests/42. The crash happened since the change in commit 3ad98c5 always got the ODB via XML. However, the creation from XML should only be used when a user wants to read fast (and when we are on a remote machine) so I added the flag use_from_xml to explicitly specify this. > > { > > odb new_settings("/Equipment/Test FE/Settings"); > > new_settings.watch(watch); // <-- here I am getting a segmentation fault > > } > > this code has a bug. "watch" is attached to object "new_settings" that is deleted > after the closing curly bracket. > I would say Stefan's odb API should not allow you to write code like this. an API defect. As pointed out in the thread this feature is explicitly supported by odbxx.cxx: void odb::watch(std::function<void(midas::odb &)> f) { if (m_hKey == 0 \|\| m_hKey == -1) mthrow("watch() called for ODB key \"" + m_name + "\" which is not connected to ODB"); // create a deep copy of current object in case it // goes out of scope midas::odb* ow = new midas::odb(*this); ow->m_watch_callback = f; db_watch(s_hDB, m_hKey, midas::odb::watch_callback, ow); // put object into watchlist g_watchlist.push_back(ow); } Also in the old way (see for example https://bitbucket.org/tmidas/midas/src/191d13f98626fae533cbca17b00df7ee361edf16/examples/crfe/crfe.cxx#lines-126) it was possible to create a watch in a scope without the user taking care that the "object" does not go out of scope. I think this feature should be supported by the framework. Best, Marius
2853	20 Sep 2024	Stefan Ritt	Bug Report	Crash using ODB watch
The problem has been fixed in the current version. Here is my analysis: - the midas::odb object can go out of scope in the function, since the odb::watch() function creates a deep copy of the object. This does not cause a memory leak if one call odb::unwatch_all() at the end of a program. - The creation from XML had a flaw where the ODB key handle ("hKey") is not initialized since it is not passed by the db_copy_xml() function. I added code to db_copy_xml() to also fetch the key handle in the XML file, which now fixes the issue. Please note that you have to update both the server and client side of midas to get this functionality if you are using it by a remote client. - I saw the flag MK added on his pull request to the constructor of odb::odb(). This is a way to fight the symptoms (by creating an object the "old" way if not otherwise needed, but how we have the cause cured. Nevertheless I added that parameter, but set to to true by default: odb::odb(const std::string &str, bool init_via_xml = true); since this should be fully working now and should always be faster than the old method. I only keep it for debugging should we observe another flaw in odb_from_xml(). Best regards, Stefan
2856	22 Sep 2024	Tam Kai Chung	Bug Report	Can we convert the .mid file into .root file
Dear experts, I am a new user of MIDAS. I have just created some banks by a frontend.cxx code. Now, I would like to do some analysis from the data. I have an analyzer.cxx code (A very simple one without complicated routine). I try to link the analyzer.o with rmana.o and libmidas.a to create analyzer.exe I am not sure whether I can do the analysis offline in the follow way: analyzer.exe -i run00001.mid -o run00001.root When I run this command, I get the following error: Error in <TClass::LoadClassInfo>: no interpreter information for class TSocket is available even though it has a TClass initialization routine. I am using root 6.30 Any suggestion about this issue? Thank you. Best, Terry
2858	24 Sep 2024	Konstantin Olchanski	Bug Report	Can we convert the .mid file into .root file
"Can we convert the .mid file into .root file". yes, you can, but the operation is under-defined. it's like asking "can I convert these stones into houses". the answer is "yes", but it involves more than running a universal conversion program. For this reason, I recommend against converting midas files "to root". for some types of midas data such a conversion makes no sense (i.e. alpha-g streamed udp packets with chopped compressed waveforms). I recommend that you analyze you data in the midas analyzer. You can start with manalyzer_example_root.cxx, it shows how to create a ROOT histogram, how to access midas event bank data and call the TH1 "Fill" method. Instead of filling histograms in the analyzer, you can create a ROOT TTree and fill it with data from midas data banks, effectively you will create your own custom converter from midas to root. The key thing is that it has to be a custom converter, because only you know the meaning of midas bank data and how it should be best stored in a root tree. K.O.
2861	07 Oct 2024	Amy Roberts	Bug Report	Difficulty running MIDAS on Rocky 9.4
We're trying to install the SuperCDMS version of MIDAS on a Rocky 9.4 Virtual Machine and are getting a persistent error when we run mserver. As far as I know there are minimal changes between this and the MIDAS branch, but Ben Smith may have more to say on this. [lekhraj@sdfcdmsdaq online]$ mserver mserver started interactively [mserver,INFO] Client 'ODBEdit' on buffer 'SYSMSG' removed by bm_open_buffer because process pid 481051 does not exist mserver will listen on TCP port 1175 [mserver,ERROR] [odb.cxx:2498:db_lock_database,ERROR] cannot lock ODB semaphore, timeout 10000 ms, exiting... [mserver,ERROR] [midas.cxx:2205:cm_check_connect,ERROR] cm_disconnect_experiment not called at end of program db_lock_database: Detected recursive call to db_{lock,unlock}_database() while already inside db_{lock,unlock}_database(). Maybe this is a call from a signal handler. Cannot continue, aborting... Aborted (core dumped) We thought perhaps we had a corrupted ODB file, so we removed the ODB file and tried to create a new one (sized correctly for our experiment): [lekhraj@sdfcdmsdaq online]$ odbedit -s 50000000 [ODBEdit,ERROR] [odb.cxx:2052:db_open_database,ERROR] Removed ODB client 'mserver', index 0 because process pid 481326 does not exists [ODBEdit,INFO] Removed open record flag from "/Experiment/Security/RPC hosts/Allowed hosts" [ODBEdit,INFO] Removed exclusive access mode from "/Experiment/Security/RPC hosts/Allowed hosts" [ODBEdit,INFO] Corrected 1 ODB entries [ODBEdit,INFO] Deleted entry '/System/Clients/481326' for client 'mserver' because it is not connected to ODB [ODBEdit,INFO] Client 'mserver' on buffer 'SYSMSG' removed by bm_open_buffer because process pid 481326 does not exist [local:test:S]/>Bus error (core dumped)
2862	07 Oct 2024	Ben Smith	Bug Report	Difficulty running MIDAS on Rocky 9.4
> We're trying to install the SuperCDMS version of MIDAS on a Rocky 9.4 Virtual > Machine and are getting a persistent error when we run mserver. As far as I > know there are minimal changes between this and the MIDAS branch, but Ben Smith > may have more to say on this. For reference, "the SuperCDMS version of MIDAS" is just a fork that no longer has any meaningful differences vs the main MIDAS repo, but we only pull updates infrequently after testing a bunch. We last pulled from the develop branch in November 2023. But that should be irrelevant here as semaphore code hasn't been touched for a very long time. We're running Alma 9.4 on a machine at TRIUMF and the same version of midas works fine there (Amy, you may already have access to scdms-zeus). I believe Alma and Rocky should be basically identical for this. So the questions are: * Have you tried other midas programs, or only mserver? E.g. did odbedit and mhttpd work? * If other programs work, have you been running them all as the same user? In particular, if you ran one program as root and another as an unprivileged user, then you will likely get odd permissions issues. * What do you see if you run `ls -l /dev/shm` and `ls -l ~/packages/SuperCDMS_DAQ/MidasDAQ/online/.SHM`? (Or wherever your online dir is for the 2nd one). Did you follow the full instructions for recovering from a corrupt ODB? https://daq00.triumf.ca/MidasWiki/index.php/FAQ#How_to_recover_from_a_corrupted_ODB In particular the bit about running odbinit with the --cleanup flag?
2863	07 Oct 2024	Konstantin Olchanski	Bug Report	Difficulty running MIDAS on Rocky 9.4
> We're trying to install the SuperCDMS version of MIDAS on a Rocky 9.4 Virtual > Machine and are getting a persistent error when we run mserver. > > [mserver,ERROR] [odb.cxx:2498:db_lock_database,ERROR] cannot lock ODB semaphore, > timeout 10000 ms, exiting... > db_lock_database: Detected recursive call to db_{lock,unlock}_database() while > already inside db_{lock,unlock}_database(). Maybe this is a call from a signal > handler. Cannot continue, aborting... > Aborted (core dumped) This is super very bad. Since you have a core dump, please post the stack trace here (or email it to me). I probably cannot debug your private version of midas and I will recommend that you install and run vanilla midas mserver (just while we debug this problem). Let's look at the core dump stack trace first, but likely we see a problem with System-V semaphores and hopefully it is not some breakage due to Red Hat bogosity or due to something specific to running on a virtual machine. If indeed this is Linux-kernel level breakage of System-V semaphores, solution would be to start using Posix semaphores, something I wanted to do for a long time. We already switched MIDAS shared memory from System-V to Posix shared memory. If we are lucky it is just one more crasher bug in ODB. Let's see that core dump stack trace. K.O.
2864	08 Oct 2024	Mark Grimes	Bug Report	Difficulty running MIDAS on Rocky 9.4
We run Midas with no problems on Rocky 9.4, although not in a virtual machine. We're very close to the head of `develop`. I'm fairly sure I've seen an error like this before. I didn't pay it much attention because it was transitory and I was doing something weird at the time - probably stepping through with a debugger and hit a timeout. It was definitely about an ODB semaphore but I can't recall if it was about a recursive call. Basically I think Rocky 9.4 is a red herring. Do you have another crashed copy of mserver/mhttpd running somewhere and stuck in limbo? If it's a virtual machine are you sharing the shared memory location with the host, and running another midas on there? > > We're trying to install the SuperCDMS version of MIDAS on a Rocky 9.4 Virtual > > Machine and are getting a persistent error when we run mserver. > > > > [mserver,ERROR] [odb.cxx:2498:db_lock_database,ERROR] cannot lock ODB semaphore, > > timeout 10000 ms, exiting... > > db_lock_database: Detected recursive call to db_{lock,unlock}_database() while > > already inside db_{lock,unlock}_database(). Maybe this is a call from a signal > > handler. Cannot continue, aborting... > > Aborted (core dumped) > > This is super very bad. Since you have a core dump, please post the stack trace here (or email it to me). > > I probably cannot debug your private version of midas and I will recommend that you install and run vanilla midas > mserver (just while we debug this problem). > > Let's look at the core dump stack trace first, but likely we see a problem with System-V semaphores and hopefully it > is not some breakage due to Red Hat bogosity or due to something specific to running on a virtual machine. > > If indeed this is Linux-kernel level breakage of System-V semaphores, solution would be to start using Posix > semaphores, something I wanted to do for a long time. We already switched MIDAS shared memory from System-V to Posix > shared memory. > > If we are lucky it is just one more crasher bug in ODB. Let's see that core dump stack trace. > > K.O.
2865	08 Oct 2024	Amy Roberts	Bug Report	Difficulty running MIDAS on Rocky 9.4
> > We're trying to install the SuperCDMS version of MIDAS on a Rocky 9.4 Virtual > > Machine and are getting a persistent error when we run mserver. As far as I > > know there are minimal changes between this and the MIDAS branch, but Ben Smith > > may have more to say on this. > > For reference, "the SuperCDMS version of MIDAS" is just a fork that no longer has any meaningful differences vs the main MIDAS repo, but we only pull updates infrequently after testing a bunch. We last pulled from the develop branch in November 2023. But that should be irrelevant here as semaphore code hasn't been touched for a very long time. > > We're running Alma 9.4 on a machine at TRIUMF and the same version of midas works fine there (Amy, you may already have access to scdms-zeus). I believe Alma and Rocky should be basically identical for this. > > So the questions are: > * Have you tried other midas programs, or only mserver? E.g. did odbedit and mhttpd work? > * If other programs work, have you been running them all as the same user? In particular, if you ran one program as root and another as an unprivileged user, then you will likely get odd permissions issues. > * What do you see if you run `ls -l /dev/shm` and `ls -l ~/packages/SuperCDMS_DAQ/MidasDAQ/online/.SHM`? (Or wherever your online dir is for the 2nd one). > Did you follow the full instructions for recovering from a corrupt ODB? https://daq00.triumf.ca/MidasWiki/index.php/FAQ#How_to_recover_from_a_corrupted_ODB In particular the bit about running odbinit with the --cleanup flag? Here's what happens when I try to run odbedit: [lekhraj@sdfcdmsdaq setup]$ odbedit [ODBEdit,ERROR] [odb.cxx:2052:db_open_database,ERROR] Removed ODB client 'ODBEdit', index 0 because process pid 481823 does not exists [ODBEdit,INFO] Removed open record flag from "/Experiment/Security/RPC hosts/Allowed hosts" [ODBEdit,INFO] Removed exclusive access mode from "/Experiment/Security/RPC hosts/Allowed hosts" [ODBEdit,INFO] Corrected 1 ODB entries [ODBEdit,INFO] Deleted entry '/System/Clients/481823' for client 'ODBEdit' because it is not connected to ODB [ODBEdit,INFO] Client 'ODBEdit' on buffer 'SYSMSG' removed by bm_open_buffer because process pid 481823 does not exist [ODBEdit,ERROR] [odb.cxx:2498:db_lock_database,ERROR] cannot lock ODB semaphore, timeout 10000 ms, exiting... [ODBEdit,ERROR] [midas.cxx:2205:cm_check_connect,ERROR] cm_disconnect_experiment not called at end of program db_lock_database: Detected recursive call to db_{lock,unlock}_database() while already inside db_{lock,unlock}_database(). Maybe this is a call from a signal handler. Cannot continue, aborting... Aborted (core dumped) [lekhraj@sdfcdmsdaq setup]$ And mhttpd: [lekhraj@sdfcdmsdaq setup]$ mhttpd [mhttpd,ERROR] [odb.cxx:2052:db_open_database,ERROR] Removed ODB client 'ODBEdit', index 0 because process pid 601054 does not exists [mhttpd,INFO] Removed open record flag from "/Experiment/Security/RPC hosts/Allowed hosts" [mhttpd,INFO] Removed exclusive access mode from "/Experiment/Security/RPC hosts/Allowed hosts" [mhttpd,INFO] Corrected 1 ODB entries [mhttpd,INFO] Deleted entry '/System/Clients/601054' for client 'ODBEdit' because it is not connected to ODB [mhttpd,INFO] Client 'ODBEdit' on buffer 'SYSMSG' removed by bm_open_buffer because process pid 601054 does not exist [mhttpd,INFO] ODB subtree /Runinfo corrected successfully Password protection is off Hostlist off, connections from anywhere will be accepted Listening on "http://localhost:8080", passwords OFF, hostlist OFF Listening on "http://[::1]:8080", passwords OFF, hostlist OFF bm_lock_buffer: Lock buffer "SYSMSG" is taking longer than 1 second! [mhttpd,ERROR] [odb.cxx:2498:db_lock_database,ERROR] cannot lock ODB semaphore, timeout 10000 ms, exiting... [mhttpd,ERROR] [midas.cxx:2205:cm_check_connect,ERROR] cm_disconnect_experiment not called at end of program db_lock_database: Detected recursive call to db_{lock,unlock}_database() while already inside db_{lock,unlock}_database(). Maybe this is a call from a signal handler. Cannot continue, aborting... Aborted (core dumped) [lekhraj@sdfcdmsdaq setup]$ We have been running everything as a single user, the user who cloned the repositories and owns the directories. We did follow the corrupted-ODB cleanup instructions. [lekhraj@sdfcdmsdaq setup]$ ls -lh /dev/shm total 1.3M -rw------- 1 lekhraj dm 1.2M Oct 8 14:13 17468_test_ODB__sdf_home_l_lekhraj_packages_SuperCDMS_DAQ_MidasDAQ_online_ -rw------- 1 lekhraj dm 114K Oct 7 14:06 17468_test_SYSMSG__sdf_home_l_lekhraj_packages_SuperCDMS_DAQ_MidasDAQ_online_ [lekhraj@sdfcdmsdaq setup]$ ls -lh ~/packages/SuperCDMS_DAQ/MidasDAQ/online/.*SHM -rw-r--r-- 1 lekhraj dm 0 Oct 3 08:46 /sdf/home/l/lekhraj/packages/SuperCDMS_DAQ/MidasDAQ/online/.ALARM.SHM -rw-r--r-- 1 lekhraj dm 0 Oct 3 08:46 /sdf/home/l/lekhraj/packages/SuperCDMS_DAQ/MidasDAQ/online/.ELOG.SHM -rw-r--r-- 1 lekhraj dm 0 Oct 3 08:46 /sdf/home/l/lekhraj/packages/SuperCDMS_DAQ/MidasDAQ/online/.HISTORY.SHM -rw-r--r-- 1 lekhraj dm 0 Oct 3 08:46 /sdf/home/l/lekhraj/packages/SuperCDMS_DAQ/MidasDAQ/online/.LAZY.SHM -rw-r--r-- 1 lekhraj dm 0 Oct 3 08:46 /sdf/home/l/lekhraj/packages/SuperCDMS_DAQ/MidasDAQ/online/.MSG.SHM -rw-r--r-- 1 lekhraj dm 1.2M Oct 8 14:12 /sdf/home/l/lekhraj/packages/SuperCDMS_DAQ/MidasDAQ/online/.ODB.SHM -rw-r--r-- 1 lekhraj dm 0 Oct 3 08:46 /sdf/home/l/lekhraj/packages/SuperCDMS_DAQ/MidasDAQ/online/.SYSMSG.SHM -rw-r--r-- 1 lekhraj dm 0 Oct 3 08:46 /sdf/home/l/lekhraj/packages/SuperCDMS_DAQ/MidasDAQ/online/.SYSTEM.SHM
2866	08 Oct 2024	Amy Roberts	Bug Report	Difficulty running MIDAS on Rocky 9.4
> > We're trying to install the SuperCDMS version of MIDAS on a Rocky 9.4 Virtual > > Machine and are getting a persistent error when we run mserver. > > > > [mserver,ERROR] [odb.cxx:2498:db_lock_database,ERROR] cannot lock ODB semaphore, > > timeout 10000 ms, exiting... > > db_lock_database: Detected recursive call to db_{lock,unlock}_database() while > > already inside db_{lock,unlock}_database(). Maybe this is a call from a signal > > handler. Cannot continue, aborting... > > Aborted (core dumped) > > This is super very bad. Since you have a core dump, please post the stack trace here (or email it to me). > > I probably cannot debug your private version of midas and I will recommend that you install and run vanilla midas > mserver (just while we debug this problem). > > Let's look at the core dump stack trace first, but likely we see a problem with System-V semaphores and hopefully it > is not some breakage due to Red Hat bogosity or due to something specific to running on a virtual machine. > > If indeed this is Linux-kernel level breakage of System-V semaphores, solution would be to start using Posix > semaphores, something I wanted to do for a long time. We already switched MIDAS shared memory from System-V to Posix > shared memory. > > If we are lucky it is just one more crasher bug in ODB. Let's see that core dump stack trace. > > K.O. I've uploaded the current core dump at: https://gitlab.com/det-lab/coredumps#. This was done using the "CDMS" version of MIDAS, I'll compile the current MIDAS repository just to be sure we're seeing the same error and report back here!
2867	08 Oct 2024	Konstantin Olchanski	Bug Report	Difficulty running MIDAS on Rocky 9.4
> I've uploaded the current core dump at: https://gitlab.com/det-lab/coredumps#. I cannot read the core dump without the corresponding executable (and likely all it's shared libraries). It is best if you run gdb and extract the stack traces on your end. In case you are not familiar with gdb: gdb mserver core # start gdb bt # stack trace of crashed thread info thr # get list of threads thr 1 bt thr 2 bt # etc, get stack trace of each thread, there should not be too many of them K.O.
2868	08 Oct 2024	Konstantin Olchanski	Bug Report	Difficulty running MIDAS on Rocky 9.4
I read these error messages. There is no ODB corruption. ODB semaphore is locked and all midas programs will fail, they will timeout trying to get the lock, report the timeout, then it looks like a bug was introduced where instead of hard exit or abort() they attempt a clean shutdown which crashes from a recursive call in db_lock_database(). Amy's core dump should confirm this. K.O. > > > We're trying to install the SuperCDMS version of MIDAS on a Rocky 9.4 Virtual > > > Machine and are getting a persistent error when we run mserver. As far as I > > > know there are minimal changes between this and the MIDAS branch, but Ben Smith > > > may have more to say on this. > > > > For reference, "the SuperCDMS version of MIDAS" is just a fork that no longer has any meaningful differences vs the main MIDAS repo, but we only pull updates infrequently after testing a bunch. We last pulled from the develop branch in November 2023. But that should be irrelevant here as semaphore code hasn't been touched for a very long time. > > > > We're running Alma 9.4 on a machine at TRIUMF and the same version of midas works fine there (Amy, you may already have access to scdms-zeus). I believe Alma and Rocky should be basically identical for this. > > > > So the questions are: > > * Have you tried other midas programs, or only mserver? E.g. did odbedit and mhttpd work? > > * If other programs work, have you been running them all as the same user? In particular, if you ran one program as root and another as an unprivileged user, then you will likely get odd permissions issues. > > * What do you see if you run `ls -l /dev/shm` and `ls -l ~/packages/SuperCDMS_DAQ/MidasDAQ/online/.SHM`? (Or wherever your online dir is for the 2nd one). > > Did you follow the full instructions for recovering from a corrupt ODB? https://daq00.triumf.ca/MidasWiki/index.php/FAQ#How_to_recover_from_a_corrupted_ODB In particular the bit about running odbinit with the --cleanup flag? > > Here's what happens when I try to run odbedit: > > [lekhraj@sdfcdmsdaq setup]$ odbedit > [ODBEdit,ERROR] [odb.cxx:2052:db_open_database,ERROR] Removed ODB client 'ODBEdit', index 0 because process pid 481823 does not exists > [ODBEdit,INFO] Removed open record flag from "/Experiment/Security/RPC hosts/Allowed hosts" > [ODBEdit,INFO] Removed exclusive access mode from "/Experiment/Security/RPC hosts/Allowed hosts" > [ODBEdit,INFO] Corrected 1 ODB entries > [ODBEdit,INFO] Deleted entry '/System/Clients/481823' for client 'ODBEdit' because it is not connected to ODB > [ODBEdit,INFO] Client 'ODBEdit' on buffer 'SYSMSG' removed by bm_open_buffer because process pid 481823 does not exist > [ODBEdit,ERROR] [odb.cxx:2498:db_lock_database,ERROR] cannot lock ODB semaphore, timeout 10000 ms, exiting... > [ODBEdit,ERROR] [midas.cxx:2205:cm_check_connect,ERROR] cm_disconnect_experiment not called at end of program > db_lock_database: Detected recursive call to db_{lock,unlock}_database() while already inside db_{lock,unlock}_database(). Maybe this is a call from a signal handler. Cannot continue, aborting... > Aborted (core dumped) > [lekhraj@sdfcdmsdaq setup]$ > > And mhttpd: > > [lekhraj@sdfcdmsdaq setup]$ mhttpd > [mhttpd,ERROR] [odb.cxx:2052:db_open_database,ERROR] Removed ODB client 'ODBEdit', index 0 because process pid 601054 does not exists > [mhttpd,INFO] Removed open record flag from "/Experiment/Security/RPC hosts/Allowed hosts" > [mhttpd,INFO] Removed exclusive access mode from "/Experiment/Security/RPC hosts/Allowed hosts" > [mhttpd,INFO] Corrected 1 ODB entries > [mhttpd,INFO] Deleted entry '/System/Clients/601054' for client 'ODBEdit' because it is not connected to ODB > [mhttpd,INFO] Client 'ODBEdit' on buffer 'SYSMSG' removed by bm_open_buffer because process pid 601054 does not exist > [mhttpd,INFO] ODB subtree /Runinfo corrected successfully > Password protection is off > Hostlist off, connections from anywhere will be accepted > Listening on "http://localhost:8080", passwords OFF, hostlist OFF > Listening on "http://[::1]:8080", passwords OFF, hostlist OFF > bm_lock_buffer: Lock buffer "SYSMSG" is taking longer than 1 second! > [mhttpd,ERROR] [odb.cxx:2498:db_lock_database,ERROR] cannot lock ODB semaphore, timeout 10000 ms, exiting... > [mhttpd,ERROR] [midas.cxx:2205:cm_check_connect,ERROR] cm_disconnect_experiment not called at end of program > db_lock_database: Detected recursive call to db_{lock,unlock}_database() while already inside db_{lock,unlock}_database(). Maybe this is a call from a signal handler. Cannot continue, aborting... > Aborted (core dumped) > [lekhraj@sdfcdmsdaq setup]$ > > We have been running everything as a single user, the user who cloned the repositories and owns the directories. > > We did follow the corrupted-ODB cleanup instructions. > > [lekhraj@sdfcdmsdaq setup]$ ls -lh /dev/shm > total 1.3M > -rw------- 1 lekhraj dm 1.2M Oct 8 14:13 17468_test_ODB__sdf_home_l_lekhraj_packages_SuperCDMS_DAQ_MidasDAQ_online_ > -rw------- 1 lekhraj dm 114K Oct 7 14:06 17468_test_SYSMSG__sdf_home_l_lekhraj_packages_SuperCDMS_DAQ_MidasDAQ_online_ > > [lekhraj@sdfcdmsdaq setup]$ ls -lh ~/packages/SuperCDMS_DAQ/MidasDAQ/online/.*SHM > -rw-r--r-- 1 lekhraj dm 0 Oct 3 08:46 /sdf/home/l/lekhraj/packages/SuperCDMS_DAQ/MidasDAQ/online/.ALARM.SHM > -rw-r--r-- 1 lekhraj dm 0 Oct 3 08:46 /sdf/home/l/lekhraj/packages/SuperCDMS_DAQ/MidasDAQ/online/.ELOG.SHM > -rw-r--r-- 1 lekhraj dm 0 Oct 3 08:46 /sdf/home/l/lekhraj/packages/SuperCDMS_DAQ/MidasDAQ/online/.HISTORY.SHM > -rw-r--r-- 1 lekhraj dm 0 Oct 3 08:46 /sdf/home/l/lekhraj/packages/SuperCDMS_DAQ/MidasDAQ/online/.LAZY.SHM > -rw-r--r-- 1 lekhraj dm 0 Oct 3 08:46 /sdf/home/l/lekhraj/packages/SuperCDMS_DAQ/MidasDAQ/online/.MSG.SHM > -rw-r--r-- 1 lekhraj dm 1.2M Oct 8 14:12 /sdf/home/l/lekhraj/packages/SuperCDMS_DAQ/MidasDAQ/online/.ODB.SHM > -rw-r--r-- 1 lekhraj dm 0 Oct 3 08:46 /sdf/home/l/lekhraj/packages/SuperCDMS_DAQ/MidasDAQ/online/.SYSMSG.SHM > -rw-r--r-- 1 lekhraj dm 0 Oct 3 08:46 /sdf/home/l/lekhraj/packages/SuperCDMS_DAQ/MidasDAQ/online/.SYSTEM.SHM
2869	08 Oct 2024	Konstantin Olchanski	Bug Report	Difficulty running MIDAS on Rocky 9.4
> Basically I think Rocky 9.4 is a red herring. This is what likely happened: - some program crashed while holding the ODB lock semaphore (by ctrl-C at the wrong time or by kill -KILL at the wring time) - semaphore is locked with a flag "unlock if this program stops" - this is supposed to ensure ODB lock semaphore never gets stuck in the locked state - (there is no code in MIDAS to unlock the ODB lock semaphore without locking it first) - we have observed a malfunction in the Linux kernel, where this automatic unlock does not happen. - it is rare and so far cannot be reproduced. you can find more about it by searching this forum. - I think it is a bug in the System-V semaphore code or in the Linux "program stop" code (a path where they fail to call the semaphore unlock handler, and who knows what other handlers). - System-V semaphores are very obsolete, replaced by POSIX semaphores. - POSIX semaphores do not have the "unlock if this program stops" magic, the user (MIDAS) is responsible with detecting that the program who locked the semaphore is gone and and with taking corrective action, i.e. release the lock, automatically. I do not know if this problem with System-V semaphores is in the generic Linux kernel, or if it is specific to the Red Hat kernels (they are known to have many patches, deviating quite far from vanilla kernels). I do not know if this problem is somehow sensitive to virtual machines. So yes/no, Red Hat derived linux on a virtual machine could be where this problem happens more often that elsewhere. K.O.
2870	08 Oct 2024	Konstantin Olchanski	Bug Report	Difficulty running MIDAS on Rocky 9.4
> I read these error messages. There is no ODB corruption. ODB semaphore is locked and all midas programs will fail... Recovery from this is: - stop all midas programs (actually they should have all crashed by now) - identify the ODB semaphore with: ipcs -s -t - remove the ODB semaphore with: ipcrm sem <semid> - where <semid> is from the first column of ipcs - keep deleting semaphores until odbedit works. - if you delete extra midas sempahores, odbedit will recreate them - if you delete non-midas semaphores, oh, well... Little bit better steps for this recovery may be written up by Suzannah in this forum or in the midas wiki... good luck finding them. K.O.
2873	10 Oct 2024	Amy Roberts	Bug Report	Difficulty running MIDAS on Rocky 9.4
> > I've uploaded the current core dump at: https://gitlab.com/det-lab/coredumps#. > > I cannot read the core dump without the corresponding executable (and likely all it's shared libraries). > > It is best if you run gdb and extract the stack traces on your end. > > In case you are not familiar with gdb: > > gdb mserver core # start gdb > bt # stack trace of crashed thread > info thr # get list of threads > thr 1 > bt > thr 2 > bt > # etc, get stack trace of each thread, there should not be too many of them > > K.O. Hi Konstantin, thanks for the instructions. I do appear to be missing some debug symbols, but the output looks potentially useful: [lekhraj@sdfcdmsdaq ~]$ gdb mserver core.mserver.17468.b174bb74f2bb44f9a0905e78ec6b2677.601715.1728422354000000 GNU gdb (GDB) Rocky Linux 10.2-11.1.el9_3 ... For help, type "help". Type "apropos word" to search for commands related to "word"... Reading symbols from mserver... [New LWP 601715] warning: Section `.reg-xstate/601715' in core file too small. [Thread debugging using libthread_db enabled] Using host libthread_db library "/lib64/libthread_db.so.1". Core was generated by `mserver'. Program terminated with signal SIGABRT, Aborted. warning: Section `.reg-xstate/601715' in core file too small. #0 0x00007fbdeaca154c in __pthread_kill_implementation () from /lib64/libc.so.6 Missing separate debuginfos, use: dnf debuginfo-install glibc-2.34-83.el9.12.x86_64 libgcc-11.4.1- 3.el9.x86_64 libstdc++-11.4.1-2.1.el9.x86_64 libzstd-1.5.1-2.el9.x86_64 mysql-libs-8.0.36-1.el9_3.x86_64 openssl-libs-3.0.7-25.el9_3.x86_64 zlib-1.2.11-40.el9.x86_64 (gdb) (gdb) bt #0 0x00007fbdeaca154c in __pthread_kill_implementation () from /lib64/libc.so.6 #1 0x00007fbdeac54d06 in raise () from /lib64/libc.so.6 #2 0x00007fbdeac287f3 in abort () from /lib64/libc.so.6 #3 0x0000000000430ee4 in db_lock_database (hDB=hDB@entry=1) at /sdf/home/l/lekhraj/packages/SuperCDMS_DAQ/midas_fork/src/odb.cxx:2473 #4 0x0000000000437e9c in db_find_key (subhKey=0x7ffcc536d348, key_name=0x4687a8 "/Logger/Message file date format", hKey=0, hDB=1) at /sdf/home/l/lekhraj/packages/SuperCDMS_DAQ/midas_fork/src/odb.cxx:4099 #5 db_find_key (hDB=1, hKey=0, key_name=0x4687a8 "/Logger/Message file date format", subhKey=0x7ffcc536d348) at /sdf/home/l/lekhraj/packages/SuperCDMS_DAQ/midas_fork/src/odb.cxx:4075 #6 0x0000000000448297 in db_get_value_string (hdb=1, hKeyRoot=hKeyRoot@entry=0, key_name=key_name@entry=0x4687a8 "/Logger/Message file date format", index=index@entry=0, s=s@entry=0x7ffcc536d470, create=create@entry=1, create_string_length=0) at /sdf/home/l/lekhraj/packages/SuperCDMS_DAQ/midas_fork/src/odb.cxx:13950 #7 0x000000000040a690 in cm_msg_get_logfile (fac=<optimized out>, t=<optimized out>, filename=0x7ffcc536d690, linkname=0x7ffcc536d6b0, linktarget=0x7ffcc536d6d0) at /sdf/home/l/lekhraj/packages/SuperCDMS_DAQ/midas_fork/src/midas.cxx:573 #8 0x000000000041a307 in cm_msg_log (message_type=1, facility=0x46db0e "midas", message=0x7e4290 "[mserver,ERROR] [odb.cxx:2498:db_lock_database,ERROR] cannot lock ODB semaphore, timeout 10000 ms, exiting...") at /sdf/home/l/lekhraj/packages/SuperCDMS_DAQ/midas_fork/src/midas.cxx:685 #9 0x0000000000421fcd in cm_msg_flush_buffer () at /usr/include/c++/11/bits/basic_string.h:194 #10 0x00007fbdeac574dd in __run_exit_handlers () from /lib64/libc.so.6 #11 0x00007fbdeac57620 in exit () from /lib64/libc.so.6 #12 0x0000000000430f7a in db_lock_database (hDB=hDB@entry=1) at /sdf/home/l/lekhraj/packages/SuperCDMS_DAQ/midas_fork/src/odb.cxx:2499 #13 0x0000000000437e9c in db_find_key (subhKey=0x7ffcc536da04, key_name=0x476a21 "/Alarms/Alarms", hKey=0, hDB=1) at /sdf/home/l/lekhraj/packages/SuperCDMS_DAQ/midas_fork/src/odb.cxx:4099 #14 db_find_key (hDB=1, hKey=hKey@entry=0, key_name=key_name@entry=0x476a21 "/Alarms/Alarms", subhKey=subhKey@entry=0x7ffcc536da04) at /sdf/home/l/lekhraj/packages/SuperCDMS_DAQ/midas_fork/src/odb.cxx:4075 #15 0x0000000000455fd2 in al_check () at /sdf/home/l/lekhraj/packages/SuperCDMS_DAQ/midas_fork/src/alarm.cxx:614 --Type <RET> for more, q to quit, c to continue without paging-- #16 0x000000000041ff85 in cm_periodic_tasks () at /sdf/home/l/lekhraj/packages/SuperCDMS_DAQ/midas_fork/src/midas.cxx:5596 #17 0x00000000004235c5 in cm_yield (millisec=millisec@entry=1000) at /sdf/home/l/lekhraj/packages/SuperCDMS_DAQ/midas_fork/src/midas.cxx:5676 #18 0x00000000004065c2 in main (argc=<optimized out>, argv=0x7ffcc536e628) at /sdf/home/l/lekhraj/packages/SuperCDMS_DAQ/midas_fork/progs/mserver.cxx:295 (gdb) info thr Id Target Id Frame * 1 Thread 0x7fbdec0b1740 (LWP 601715) 0x00007fbdeaca154c in __pthread_kill_implementation () from /lib64/libc.so.6 (gdb) thr 1 [Switching to thread 1 (Thread 0x7fbdec0b1740 (LWP 601715))] #0 0x00007fbdeaca154c in __pthread_kill_implementation () from /lib64/libc.so.6 (gdb) bt #0 0x00007fbdeaca154c in __pthread_kill_implementation () from /lib64/libc.so.6 #1 0x00007fbdeac54d06 in raise () from /lib64/libc.so.6 #2 0x00007fbdeac287f3 in abort () from /lib64/libc.so.6 #3 0x0000000000430ee4 in db_lock_database (hDB=hDB@entry=1) at /sdf/home/l/lekhraj/packages/SuperCDMS_DAQ/midas_fork/src/odb.cxx:2473 #4 0x0000000000437e9c in db_find_key (subhKey=0x7ffcc536d348, key_name=0x4687a8 "/Logger/Message file date format", hKey=0, hDB=1) at /sdf/home/l/lekhraj/packages/SuperCDMS_DAQ/midas_fork/src/odb.cxx:4099 #5 db_find_key (hDB=1, hKey=0, key_name=0x4687a8 "/Logger/Message file date format", subhKey=0x7ffcc536d348) at /sdf/home/l/lekhraj/packages/SuperCDMS_DAQ/midas_fork/src/odb.cxx:4075 #6 0x0000000000448297 in db_get_value_string (hdb=1, hKeyRoot=hKeyRoot@entry=0, key_name=key_name@entry=0x4687a8 "/Logger/Message file date format", index=index@entry=0, s=s@entry=0x7ffcc536d470, create=create@entry=1, create_string_length=0) at /sdf/home/l/lekhraj/packages/SuperCDMS_DAQ/midas_fork/src/odb.cxx:13950 #7 0x000000000040a690 in cm_msg_get_logfile (fac=<optimized out>, t=<optimized out>, filename=0x7ffcc536d690, linkname=0x7ffcc536d6b0, linktarget=0x7ffcc536d6d0) at /sdf/home/l/lekhraj/packages/SuperCDMS_DAQ/midas_fork/src/midas.cxx:573 #8 0x000000000041a307 in cm_msg_log (message_type=1, facility=0x46db0e "midas", message=0x7e4290 "[mserver,ERROR] [odb.cxx:2498:db_lock_database,ERROR] cannot lock ODB semaphore, timeout 10000 ms, exiting...") at /sdf/home/l/lekhraj/packages/SuperCDMS_DAQ/midas_fork/src/midas.cxx:685 #9 0x0000000000421fcd in cm_msg_flush_buffer () at /usr/include/c++/11/bits/basic_string.h:194 #10 0x00007fbdeac574dd in __run_exit_handlers () from /lib64/libc.so.6 #11 0x00007fbdeac57620 in exit () from /lib64/libc.so.6 #12 0x0000000000430f7a in db_lock_database (hDB=hDB@entry=1) at /sdf/home/l/lekhraj/packages/SuperCDMS_DAQ/midas_fork/src/odb.cxx:2499 #13 0x0000000000437e9c in db_find_key (subhKey=0x7ffcc536da04, key_name=0x476a21 "/Alarms/Alarms", hKey=0, hDB=1) at /sdf/home/l/lekhraj/packages/SuperCDMS_DAQ/midas_fork/src/odb.cxx:4099 #14 db_find_key (hDB=1, hKey=hKey@entry=0, key_name=key_name@entry=0x476a21 "/Alarms/Alarms", subhKey=subhKey@entry=0x7ffcc536da04) at /sdf/home/l/lekhraj/packages/SuperCDMS_DAQ/midas_fork/src/odb.cxx:4075 #15 0x0000000000455fd2 in al_check () at /sdf/home/l/lekhraj/packages/SuperCDMS_DAQ/midas_fork/src/alarm.cxx:614 #16 0x000000000041ff85 in cm_periodic_tasks () at /sdf/home/l/lekhraj/packages/SuperCDMS_DAQ/midas_fork/src/midas.cxx:5596 #17 0x00000000004235c5 in cm_yield (millisec=millisec@entry=1000) at /sdf/home/l/lekhraj/packages/SuperCDMS_DAQ/midas_fork/src/midas.cxx:5676 #18 0x00000000004065c2 in main (argc=<optimized out>, argv=0x7ffcc536e628) at /sdf/home/l/lekhraj/packages/SuperCDMS_DAQ/midas_fork/progs/mserver.cxx:295 (gdb)

Goto page Previous 1, 2, 3 ... 130, 131, 132 ... 150, 151, 152 Next

ELOG V3.1.4-2e1708b5