ELOG Midas

Back Midas Rome Roody Rootana

Midas DAQ System, Page 142 of 161

Not logged in

Find | Login | Help

Full | Summary | Threaded | Show attachments

3205 Entries

Goto page Previous 1, 2, 3 ... 141, 142, 143 ... 159, 160, 161 Next

ID	Date	Author	Topic	Subject
386	09 Jun 2007	Randolf Pohl	Forum	crash when analyzing multiple runs offline
Hello Stefan, tree_struct.n_tree keeps counting up from run to run (in book_ttree). This should presumably not be the case, since CloseRootOutputFile() frees the trees at eor(). ------------------- output --------------------------- lamb@lamb2:~/midas/root_3705> ./analyzer -e exa_root -i /tmp/midas/examples/root/run%05d.mid -o /tmp/midas/run%05d.root -r 1 2 Root server listening on port 9090... Running analyzer offline. Stop with "!" book_ttree: tree_struct.n_tree = 1 book_ttree: tree_struct.n_tree = 2 Set run number 1 in ODB Load ODB from run 1...OK /tmp/midas/examples/root/run00001.mid:2722 /tmp/midas/run00001.root:2720 events, 0.21s book_ttree: tree_struct.n_tree = 3 <<---- !!!! book_ttree: tree_struct.n_tree = 4 Set run number 2 in ODB Load ODB from run 2...OK /tmp/midas/examples/root/run00002.mid:2347 /tmp/midas/run00002.root:2345 events, 0.18s * Break * segmentation violation ----------------- \output ---------------------------- Adding this one line fixes the segfault problem for the root example expt. ----------------- code ------------------------- lamb@lamb2:/data/software/midas/midas_3705/src/src> svn diff mana.c Index: mana.c =================================================================== --- mana.c (revision 3705) +++ mana.c (working copy) @@ -1496,6 +1496,7 @@ /* delete event tree */ free(tree_struct.event_tree); tree_struct.event_tree = NULL; + tree_struct.n_tree = 0; // go to ROOT root directory gROOT->cd(); ---------------- \code --------------------------- Please check if this gives the intended behaviour. I am not very familiar with the midas internals. Unfortunately my own analyzer's segfault problem is not solved by this patch. I guess I have to keep searching for a bug on my side..... :-) Cheers, Randolf
385	08 Jun 2007	Stefan Ritt	Forum	crash when analyzing multiple runs offline
Unfortunately I don't have time right now to debug the problem, but I could see roughly what it could be. The analyzer crashes inside CloseRootOutputFile: #5 <signal handler called> #6 0x00002b5f52ad5ee5 in free () from /lib64/libc.so.6 #7 0x000000000040c89b in CloseRootOutputFile () at src/mana.c:1489 in the line free(tree_struct.event_tree[i].branch); If a "free" crashes, it might indicate that the memory beyond the allocated space got corrupted. The branch gets allocated in book_ttree(), once for each analyze_request[i]. The branch gets filled in write_event_ttree(): /* fill tree both online and offline */ if (!exclude_all) et->tree->Fill(); Maybe one should put printf debugging statements in these places to see what's going on.
384	08 Jun 2007	Stefan Ritt	Suggestion	RFC- ACLs for midas rpc, mserver, mhttpd access
First I have a general question: mserver is started through xinetd, and xinetd has the options "only_from" and "no_access". This is equivalent to the tcp_wrapper functionality. Why not using this? It's possible without changing anything in midas. Or am I missing anything? If that does not work for some reason, here are some thought from my side: - We don't have much of a problem with malicious hackers, but with institute-wide security checking. Hackers are only interested in mechanisms where they can obtain control over thousands of machines (like breaking ssh etc.). The few midas machines are not a good target for them. But even at PSI there are security scans, which try to connect to various ports and can crash systems, so I agree that something needs to be done. - Whatever we do, it should be consistent on linux and windows and should not rely on external packages, since I don't want to get into dependencies there. - I see that both having the security information in the ODB or having them in external files can be advantageous. There is certainly the aspect of restoring old ODBs, or keeping several experiments (ODB) on one machine consistent. On the other hand storing data in the ODB might me liked by people who are familiar with this concept, and want to change things though mhttpd for example. - Having said all that, it would make sense to me to write a simple central routine access_allowed(), which takes the IP address of a remote client wanting to connect, and return true or false. This routine should read /etc/hosts.allow, /etc/hosts.deny and interprete it, but only the section for midas, and maybe only a subset of the functionality there (we probably don't need NIS netgroup names, external files and spawn commands there). If the files /etc/hosts.x do not contain anything about midas or are not preset (Windows!), the routine should look in the ODB under /experiment/security/mserver/hosts.allow and /experiment/security/mserver/hosts.deny and use that information instead of the files. - We probably need different mechanisms for mserver and for mhttpd. The mserver clients are usually only a few programs like the front-ends, while one may want to control an experiment over mhttpd from much more machines. So we should establish a second ACL for mhttpd. The already present "/experiment/security/allowed hosts" for mhttpd should be converted into "/Experiment/Security/mhttpd/hosts.allow" and the function access_allowed() should be used to interprete that, so that we only need to write it once.
383	07 Jun 2007	John M O'Donnell	Suggestion	RFC- ACLs for midas rpc, mserver, mhttpd access
I am in favor of tcp_wrappers. tcp_wrappers is well understood. It works well in combination with a firewall. mhttpd hangs when our security folks scan us. We are not allowed to block them with a firewall, but we can use tcpwrappers. Would it make sense to put the same mechanism on mserver? the man page for libtcpwrappers.a (taken from the tcpwrappers7.6 tar ball) is attached. And the output after running it through nroff -man. The odb is too fragile for security. It is not understood well enough by many experimenters. As you can see I am in favor of tcp_wrappers. This is mainly because it is part of an existing and tested security model. I don't know about the windows world, but as you can also see, I vote for using something that is already part of the windows security model. Here's an example of how well the integrated security model works: if an person is part of an experiment I make sure they can ssh to the experiment's computer the same rules could provide them with web access Second is that when a change is needed to the security model then it is easy to keep it current. What if somebody restores an old ODB? What if they setup a small test with a new ODB? If mhttpd used tcp_wrappers, then all our machines here at LANL would already be configured! No need for users to do any root access (though those that need it have it anyway). John.
382	07 Jun 2007	Randolf Pohl	Forum	crash when analyzing multiple runs offline
Hello, I am having a problem with the root-based analyzer. It crashes when I try to analyze multiple runs OFFLINE using the "-i run%05d.mid -o result%05d.root -r 1 2" feature. I can reproduce the problem with the example experiment which comes with the MIDAS distribution: Running the analyzer ONLINE works fine: One can start and stop runs one after the other, roody shows the histograms being reset and then filled again and such. But OFFLINE, the analyzer crashes when trying to analyze the SECOND run in a sequence. So ./analyzer -i run%05d.mid -o result%05d.root -r 1 1 works (only run 1) ./analyzer -i run%05d.mid -o result%05d.root -r 1 3 dies on run 2 Output attached (I added printf's to the "init"-modules, but that's irrelevant here) My own analyzer shows the same effect. There I got the impression the segfault happens on the first attempt to Fill/Reset/SetName etc. a histogram in the 2nd run. But with the midas example it looks like the analyzer finishes filling histos even for run 2, but then dies in eor. Can you reproduce the problem? I run MIDAS on an Intel Quadcore, 64 bit SuSE Linux 10.2. pohl@lamb2:~/midas/examples/root> gcc --version gcc (GCC) 4.1.2 20061115 (prerelease) (SUSE Linux) (maybe 4.1.2 "PRERELEASE" is the problem? See message ID 344) I am using midas rev. 3674 (April 19, 2007), but I got the impression there has since not been a change relevant to this problem. Please correct me if I am wrong, then I would try it with Rev HEAD. (My version includes already the fix to the x86_64 segfault problem of message ID 337) Best regards, Randolf
381	07 Jun 2007	Konstantin Olchanski	Suggestion	RFC- ACLs for midas rpc, mserver, mhttpd access
Running MIDAS at CERN is proving more challenging than I expected. The network environement is not as benign as I am used to (i.e. at TRIUMF) and our machines are being constantly probed by something/ somebody. This already caused failures in the mserver (fixed in midas svn) and I would like to resolve this problem once and for all. The age of "nice networks" is over. The case of the mserver and for the midas rpc servers (every midas applications listens for midas rpc requests, i.e. run transitions) is simple. The list of machines running midas applications is known ahead of time, so we can put them all into a list of permitted machines and deny rpc connections to anybody else. I propose we keep this list of permitted mserver clients in "/experiment/security/mserver hosts". (The already existing "/experiment/security/allowed hosts" mechanism is insufficient: it does not prevent the mserver from accepting connections from hostile machines, and talking to them, for example giving them the list of available experiments. There is a fair amount of code involved and I do not presume to certify any of it as hack-proof or even as crash-proof.) For mhttpd http:// access control, I thought of using tcp_wrappers, but C-API documentation does not exist (I looked), the example code in tcpd.c is way too complicated, editing the ACL /etc/hosts.allow unnecessarily requires root privileges and non of it would work on Windows. So I am favouring a home-made hostname or ip-address filter, similar to /etc/hosts.allow, with ACL stored, for example, in "/experiment/security/mhttpd hosts". Any thoughts? K.O.
380	22 May 2007	Stefan Ritt	Bug Report	analyzer_init called by odb_load
> Thanks for the quick reply, Stefan. > > Please don't change anything in the code unless you find it really important. I guess > changing the analyzer_init prototype will break a lot of code out there? > > In fact, I think I do understand this behavior now. > And even without your suggested fix there is a simple workaround: I add a static > variable to my analyzer_init.cxx file, and do something similar to your bFirst fix. > > In conclusion, commit your fix if it does not harm others. Postpone this commit to a > future new version of midas which breaks a lot of things anyway... > > A last question, for me to understand: Why not call db_open_record in > ana_begin_of_run then? I fully agree with you that db_open_record would better go into ana_begin_of_run (and analyzer_init not being called in odb_load), and I fully agree with you that changing the code would break many experiments. ;-) So I guess we leave it as it is right now as you suggested.
379	22 May 2007	Randolf Pohl	Bug Report	analyzer_init called by odb_load
Thanks for the quick reply, Stefan. Please don't change anything in the code unless you find it really important. I guess changing the analyzer_init prototype will break a lot of code out there? In fact, I think I do understand this behavior now. And even without your suggested fix there is a simple workaround: I add a static variable to my analyzer_init.cxx file, and do something similar to your bFirst fix. In conclusion, commit your fix if it does not harm others. Postpone this commit to a future new version of midas which breaks a lot of things anyway... A last question, for me to understand: Why not call db_open_record in ana_begin_of_run then? Cheers, Randolf
378	22 May 2007	Stefan Ritt	Bug Report	analyzer_init called by odb_load
The reason to call analyzer_init in odb_load is the following: Assume you run the analyzer offline, analyzing many files in series. Then assume that you have /Experiment/Run Parameters, which is actively used by the analyzer (like beam settings etc.). In this case you do a db_open_record() to map /Experiment/Run Parameters to the exp_param C structure. For this mapping to work, the ODB structure and the C structure have to be exactly the same. Now assume that you changed your run parameters over time, like you added some comment later. Now you want to analyzer several runs, some before and some after the modification. Both sets have a different structure in /Experiment/Run Parameters, which is a problem, since the compiled analyzer can only have a single C structure. My "poor" solution was to call analyzer_init after each loading of the ODB from the *.mid file. The db_create_record() call matches the C structure to the ODB structure by modifying the ODB structure if necessary. So if you added one parameter later, this (modified) structure gets loaded by odb_load, but then it gets adjusted in analyzer_init(). I understand now that this case might not happen so often, and you are more bothered by the fact that analyzer_init gets called several time. There must however be a hook for offline analysis that the user code can correct the ODB structure. So I propose to add a flag to analyzer_init, such as INT analyzer_init(BOOL bFirst) { } If bFirst equals TRUE, the function got called from mana_init(), if FALSE, it got called from odb_load. Then you can put code like INT analyzer_init(BOOL bFirst) { if (bFirst) { p = malloc() ... } } If you agree, I will modify the code and commit the change. - Stefan
377	22 May 2007	Randolf Pohl	Bug Report	analyzer_init called by odb_load
Hi, I wonder why mana.c:odb_load() calls analyzer_init(). This way analyzer_init is called TWICE or more times: first from mana.c:mana_init(), for each invocation of the analyzer, and second from mana.c:odb_load(), for each run to be analyzed Isn't this a bug? It can mess up several things (like mallocs) if you don't take the necessary precautions. Other module_init functions are correctly called only once, before all runs are analyzed. I have the feeling, that odb_load should NOT call analyzer_init. Or am I wrong (probably, but please explain to me)? Do I have to live with it and make sure that my beautiful global initialization in analyzer_init is only done once? :-) Cheers, Randolf And here is the annotated log using the ROOT example experiment (several modules changed/added to print their respective names) :~/midas/examples/root> ./analyzer -e exa_root -i run%05d.mid -r 1 3 analyzer_init <-- ok Root server listening on port 9090... adc_calib_init <-- ok adc_summing_init <-- ok scaler_init <-- ok Running analyzer offline. Stop with "!" Set run number 1 in ODB Load ODB from run 1... analyzer_init <-- not ok, or is it? OK run00001.mid:777 events, 0.00s Set run number 2 in ODB Load ODB from run 2... analyzer_init <-- not ok, or is it? OK run00002.mid:7227 events, 0.03s Set run number 3 in ODB Load ODB from run 3... analyzer_init <-- not ok, or is it? OK run00003.mid:13866 events, 0.06s adc_calib_exit adc_summing_exit scaler_exit analyzer_exit
376	21 May 2007	Konstantin Olchanski	Info	mhttpd changes to use /History/Tags data
I am slowly commiting the changes to the history code. This installement adds code to mhttpd to use the /History/Tags data (to be) generated by the mlogger. In the nutshell, the logger fills /History/Tags to "remember" what events, variables and tags exist in the history files. This replaces the old code that attempts to guess the contents of history files by looking at /Equipment tree. To ease the transition to the new system, I am leaving all the old code alive and active in the absense of "/History/Tags" entries. As soon as one starts using the new mlogger (to be commited), the new tags based mhttpd code will activate itself. K.O.
375	14 May 2007	Carl Metelko	Forum	Splitting data transfer and control onto different networks
Hi, thanks for the advice. We do have dual core Xeons so we'll try running most things on the server. Unless it proves to be a problem we'll run all MIDAS signals on one network and NFS etc on the other. I do have one more query about running systems like Konstantin. What we would like to do is have a 'mirror' server serving multiple online monitoring machines so that the load on the server is constant nomatter the demands on the mirror. Is there a way to set this up? Or would it be best to have a remote analyser making short (1min) root files shared with the online monitoring?
374	10 May 2007	Konstantin Olchanski	Info	RHEL5/SL5 success!
FWIW, I am running latest 32-bit MIDAS on an AM2 dual core AMD machine under 64-bit SL5. Everything seems to work correctly. K.O. P.S. For the record, the compiler produces two sets of warnings: - warning: pointer targets in passing argument 3 of � differ in signedness - warning: dereferencing type-punned pointer will break strict-aliasing rules (I do not understand the meaning of the second warning. type-punned pointer, huh?) K.O.
373	10 May 2007	Konstantin Olchanski	Bug Fix	Fix error reporting from cm_transition()
For some time now, error reporting from cm_transition() was broken. Typical symptom was when starting a run from mhttpd, when a transition error occurred, the run does not start (good) but the user is presented with a message "Success" in big letters (confusing the user). Part of the problem was caused by user-written frontends that return an empty error string. Code in cm_transition() now detects this and shows the numeric value of the error status returned by the frontend. This is fixed in revision 3681. The error string "Success" is now returned only when cm_transition() was successful, and other error reporting inside this function was cleaned up. K.O.
372	10 May 2007	Konstantin Olchanski	Bug Fix	mhttpd: fix broken boolean arrays in "edit on start"
For some time now, boolean arrays did not work correctly in "/experiment/edit on start". This is now fixed in rev 3680. K.O.
371	09 May 2007	Konstantin Olchanski	Forum	Splitting data transfer and control onto different networks
> I'm setting up a system with two networks with the intension of having > control info (odb, alarm) on the 192.168.0.x > and the frontend readout on 192.168.1.x We have some experience with this at TRIUMF - the TWIST experiment we run with the main data generating frontends on a private network - it is a supported configuration and it works fine. We ran into one problem after adding some code to the frontends for stopping the run upon detecting some data errors - stopping runs requires sending RPC transactions to every midas client, so we had to add static network routes for routing packets between midas nodes on the private network and midas nodes on the normal network. > I'm also trying to separate processes onto different machines, is there > any way to not have mserver,mhttpd and (mlogger,mevt) all run on the same machine? mserver runs on the machine with the ODB shared memory by definition (think of it as "nfs server"). mhttpd typically runs on the machine with the ODB shared memory and until recently it had no code for connecting to the mserver. I recently fixed some of it, and now you can run mhttpd in "history mode" through the mserver. This is useful for offloading the generation of history plots to another cpu or another machine. In our case, we run the "history mhttpd" on the machine that holds the history files. mlogger could be made to run remotely via the mserver, but presently it will refuse to do so, as it has some code that requires direct access to midas shared memory. If data has to be written to a remote filesystem, the consensus is that it is more efficient to run mserver locally and let the OS handle remote filesystem access (NFS, etc). All other midas programs should be able to run remotely via the mserver. K.O.
370	09 May 2007	Stefan Ritt	Forum	Splitting data transfer and control onto different networks
Hi Carl, so far I did not experience any problems of running odb&alarm on the same link as the readout, since the data goes usually frontend->backend, and all other messages from backend->frontend. So before you do something complicated, try it first the easy way and check if you have problems at all. So far I don't know anybody who did separate the network interfaces so I have not description for that. You can however separate processes. The easiest is to buy a multi-core machine. If you want to use however separate computers, note that receiving events over the network is not very optimized. So you should run mserver connected to the frontend , the event builder and mlogger on the same machine. mhttpd can easily live on another machine, but there is not much CPU consumption from that (unless you don't plot long history trends). Running mserver, the event builder and mlogger on the same machine (dual Xenon mainboard) gave me easily 50 MB/sec (actually disk limited), and not both CPUs were near 100%. If you put any receiving process (like the event builder or mlogger or the analyzer) on a separate machine, you might see a bottlened on the event receiving side of maybe 10MB/sec or so (never really tried recently). Best regards, Stefan > Hi, > I'm setting up a system with two networks with the intension of having > control info (odb, alarm) on the 192.168.0.x > and the frontend readout on 192.168.1.x > > Is there any easy way of doing this? > I'm also trying to separate processes onto different machines, is there > any way to not have mserver,mhttpd and (mlogger,mevt) all run on the same > machine? > Thanks, > Carl Metelko
369	09 May 2007	Carl Metelko	Forum	Splitting data transfer and control onto different networks
Hi, I'm setting up a system with two networks with the intension of having control info (odb, alarm) on the 192.168.0.x and the frontend readout on 192.168.1.x Is there any easy way of doing this? I'm also trying to separate processes onto different machines, is there any way to not have mserver,mhttpd and (mlogger,mevt) all run on the same machine? Thanks, Carl Metelko
368	10 Apr 2007	Dan Gastler	Forum	Interrupt code for VME?
Hello, Is there any example code for using midas for interrupt driven data collection over VME? I am using a Struck SIS3100 PCI/VME setup to connect to my VME crate. Thanks, -Dan
367	09 Apr 2007	Konstantin Olchanski	Info	move history, elog and alarm functions into separate files
As approved by Stefan, I moved the history (hs_xxx), alarm (al_xxx) and elog (el_xxx) functions out of midas.c into separate files. Commited as revision 3665. This change should be transparent to all users. K.O.

Goto page Previous 1, 2, 3 ... 141, 142, 143 ... 159, 160, 161 Next

ELOG V3.1.6-083448f7