ELOG Midas

Back Midas Rome Roody Rootana

Midas DAQ System, Page 108 of 161

Not logged in

Find | Login | Help

New entries since:

Wed Dec 31 16:00:00 1969

Full | Summary | Threaded | Hide attachments

3201 Entries

Goto page Previous 1, 2, 3 ... 107, 108, 109 ... 159, 160, 161 Next

ID	Date	Author	Topic	Subject
1663	14 Aug 2019	Konstantin Olchanski	Bug Fix	incorrect recursion in ss_suspend() via the user event handler
The ROOTANA midas analyzer uncovered a problem with recursive use of ss_suspend(). When running in graphical mode, the ROOT graphics main event loop was calling ss_suspend(0) (no MSG_BM) which would sometimes call the user event handler (if a new event arrives into the midas event buffer). Because this loop was already running in the user event handler, there was a crash. This is the calling sequence leading to the incorrect recursion: (from system.cxx comments to ss_suspend()) analyzer -> -> cm_yield() in the main loop -> ss_suspend(0) -> MSG_BM message arrives in the UDP socket -> ss_suspend_process_ipc(0) -> cm_dispatch_ipc() -> bm_push_event() -> bm_push_buffer() -> bm_read_buffer() -> bm_dispatch_event() -> user event handler -> user event handler ROOT graphics main loop needs to sleep -> ss_suspend(0) <--- should be ss_suspend(MSG_BM)!!! -> MSG_BM message arrives in the UDP socket -> ss_suspend_process_ipc(0) <- should be ss_suspend_process_ipc(MSG_BM)!!! -> cm_dispatch_ipc() <- without MSG_BM, calling cm_dispatch_ipc() again -> bm_push_event() -> bm_push_buffer() -> bm_read_buffer() -> bm_dispatch_event() -> user event handler <---- called recursively, very bad! The proper fix for this is to always call ss_suspend(MSG_BM) from the user event handler and ss_suspend(MSG_ODB) from the user db_watch handler. In this second case, ss_suspend(MSG_OBM) will lose/ignore/discard db_watch notifications, if you do not want that, call ss_suspend(0) and be ready for recursive calls to your db_watch handler. (this can happen in a frontend program that acts on changes in ODB and where some of these actions may require sleeping via ss_suspend()). ss_suspend(MSG_BM) will discard MSG_BM messages, which is not a problem as bm_wait_for_events() and cm_yield() will immediately poll the event buffer and there will be no delay in receiving new events. Until commit 99d6e90 there were problems in ss_suspend(MSG_BM) and recursive calls to the user event handler were still possible. It is now fixed in this and the previous commits. K.O.
1664	14 Aug 2019	Konstantin Olchanski	Bug Report	ROOTANA bug?
> - ss_suspend_set_dispatch_ipc(NULL); > + // ss_suspend_set_dispatch_ipc(NULL); > > This compiles and at least runs for me; so maybe that is helpful for you. But Konstantin will provide a longer term solution. I now understand why this fix worked. Around December 2018 timeframe, I reworked the MIDAS event buffer code and one improvement was to only send UDP buffer notifications if somebody is waiting for them. This probably reduced to zero the probability of recursive calls to the user event handler - the problem originally fixed by the monkey work against the midas ipc handler. After looking at it, I now understand that the correct solution is to call ss_suspend(MSG_BM), but it turns out inside MIDAS, handling of MSG_BM was incomplete and recursive calls to the user event handler were still possible. (but most likely not actually happening anymore because of those changes to the event buffer code). So. a) ss_suspend(MSG_BM) inside midas now works correctly, recursive call to the user event handler will not happen. b) TMidasOnline::sleep() now calls ss_suspend(MSG_BM), monkey business with ss_suspend_set_dispatch_ipc() is removed. The problem of recursive call to the analyzer event handler is now fixed, both rootana and manalyzer (both use the same TMidasOnline code). Read more about this here: https://midas.triumf.ca/elog/Midas/1663 K.O.
1685	16 Sep 2019	Konstantin Olchanski	Forum	Open a hotlink to a single element in an ODB array
> Is it possible to open a hotlink to a single element in an ODB array? Not possible. > sprintf(element, "%s[%d]", path, i); > db_find_key(hDB, hv_info->hKeyRoot, element, &hKey); There is some confusion and inconsistency between db_xxx() functions, some of them accept the array index "a[10]" syntax, some do not. db_find_key() and db_watch()/db_open_record() do not operate on array elements and do not accept the "a[10]" array index syntax. K.O.
1686	16 Sep 2019	Konstantin Olchanski	Bug Report	https redirect and ODB access
> I'm not sure if these issues are related or not, but I'm getting an error > message when I want to access the root of the ODB via the webserver: > [mhttpd,ERROR] [mhttpd.cxx:563:rread,ERROR] Cannot read file '/root', read of > 4096 returned -1, errno 21 (Is a directory) This is an old bug. It was part of the "custom path" confusion. Fixed (I think) in all midas-2019 releases. To confirm, which version are you using (run "odbedit ver" or look on the mhttpd "help" page)? If you have an older version, I recommend that you update to midas-2019-03 (cd midas; git pull; git checkout midas-2019-03; make clean; make). If you feel adventurous, you can also update to the head of the development version and see all the new features (cmake, c++11, new history pages). If you do not feel adventurous, wait until we have midas-2019-09 ready, use midas-2019-03 until then. K.O.
1687	16 Sep 2019	Konstantin Olchanski	Info	New history plot facility
> I see currently quite often is the error hs_read_arraybuffer (see the attachement). > Are there ways to get a log which would document where the problems start? > [also crash of mhttpd] We can debug it from both ends, javascript and mhttpd: On the web page, the error message says "see javascript console", do you see anything there? Or the tab is so hung-up that you cannot even access the console? In this case, can you open the console before running your test? In some browsers (firefox, google-chrome) this will also activate the javascript debugger and as likely as not will make the bug go away (ouch!) On the mhttpd side, please capture the stack trace from the crash: enable core dumps (ODB "/experiment/enable core dumps" set to "y", after the crash, run "ls -l core.*; gdb mhttpd core.9999") or run mhttpd inside gdb or attach gdb to a running mhttpd (gdb -p 9999). Once in gdb, run "info thr" to list all threads, "thr 0; bt", "thr 1; bt", etc to get stack traces from all threads, only one of them contains the crash (tedious!). Email me the stack trace (or post here), in case we want to look at values of any variables from the crash, keep the core dump and do not rebuild mhttpd. K.O.
1688	16 Sep 2019	Konstantin Olchanski	Info	New history plot facility
> During my visit at TRIUMF we rewrote the history plotting functionality of midas. This is a most amazing achievement. We wanted to do this "for years" and I think we have benefitted greatly from the delay - tools available for building interactive web graphics have improved so much so recently. For example, delivering binary data from mhttpd to javascript (avoiding json encoding and decoding saves tons of CPU cycles) went from "how do I do this?!?" to "I did it in only 3 hours!". > We are now in a state where this is still work in progress, but already at this stage it might > be useful for others to report any feedback. The old gif-based history plots took a lot of effort and a long time to get where they work well for most experiments and where we are happy with them. From the TRIUMF side of things, lots of polishing of the graphics and of the user interface came through use at our bigger experiments - TWIST (TRIUMF), ALPHA (CERN), T2K/ND280 (Japan). So, much improvement and polishing of the new graphics is still ahead for us. > Simply upgrade the the newest develop branch of midas, and you will see two menu items > "OldHistory" which is the old system and "History" which is the new system. I hope to start the new release branch for midas-2019-09 soon. For the release, we will try to have both the old and the new history graphics to integrate smoothly. The old graphics still has to work well, as some users may prefer the old graphics and the old user interface. Also the new system is still incomplete, i.e. there is no trivial way to save a history plot into a file: > Following items are planned, but not yet implemented: > - Printing of run markers as in the old history > - Export / Printing / Sending to ELOG any history plot K.O.
1689	16 Sep 2019	Konstantin Olchanski	Forum	History plot problems for frontend with multiple indicies
> My first question would be why are you using several font-ends at all? That makes things more > complicated than needed. In the normal FE framework, you can define either several equipment > served by one frontend, or even one equipment linked to several devices. I am the culprit here, as I wrote the original code for T2K/ND280 that Nick is looking at. At the time, we needed to control multiple units of identical equipment. Most of these equipments needed to be controlled independently from each other, so we could not/did not want to use one single frontend executable to control all of them at the same time. For example, for equipment not in use, we can stop the corresponding frontend. In case of trouble, we can restart the corresponding frontend without disrupting the frontends for the other equipments. The successful operation of the T2K/ND280 experiment is sufficient defence for the validity of this approach. One lesson learned was that the MIDAS frontend framework did not make it easy to have multiple identical frontends for controlling multiple identical equipments. (typical use is control of 2-3 Wiener power supplies, 1-2-3 UPS devices, etc). At the time (and today), only the "i NNN" flag is available to tell the frontend "who am I?". To make it work, one has to use the hard to "%02d" stuff in the equipment name, and there are other complications. For my "next generation" of frontends, I tried to specialize the frontend executables at compile time using C/C++ preprocessor defines (-Dwiener01, -Dwiener02, etc), this worked better, but still not super happy. My current solution as implemented by the tmfe frontend framework is to give the user full control over the command line arguments (mfe.c did not permit any "user arguments" and did not allow access to argc/argv) and full control over the equipment names (mfe.c equipment names are fixed at compile time). K.O.
1691	16 Sep 2019	Konstantin Olchanski	Forum	History plot problems for frontend with multiple indicies
> it's probably better to run a multi-threaded setup, than individual frontends. I recommend against using multiple threads if at all possible and unless absolutely required. Only for one reason: multithreaded c++ programs are notoriously hard to debug. In addition, one has to face several classes of bugs absent in single-threaded applications: a) which thread "owns" which object b) locking of all shared data c) huge overheads from locking at high data rates (a performance bug) d) correct locking order, dead locks, live locks e) incomprehensible core dumps and stack traces f) race conditions To control 2 power supplies, run 2 frontend programs, 1 per power supply. To control 64 frontend cards, run 1 frontend with many threads: 64 (per device) + 1 (main thread) + 1 (RPC handler) + 1 (watchdog) + 1 (common event generator/data transmitter) + 1 (odb/web page status update). You will bump into each and every one of the problems (a) to (f) above. K.O.
1692	16 Sep 2019	Konstantin Olchanski	Forum	History plot problems for frontend with multiple indicies
> thanks for your reply. I can confirm that your suggested workaround does indeed > make the problem dissapear. > I guess this issue hasn't been seen at T2K since we use MYSQL for the history. I think you found the source of the problem, confused event id assignments. To confirm, can you email me (or post here) the output of odbedit "ls -l /History/Events". If that's the problem, you can avoid it completely by switching to a history storage method that does not rely on magic mapping between equipment names and numeric event id's: try the "FILE" method (set odb /Logger/History/FILE/Active to "y", restart the logger) or the "MYSQL" method (you will need to setup a mysql database). You tell mhttpd and mhist which history data to read by setting ODB /History/LoggerHistoryChannel to one of the channel names from /logger/history/, restart mhttpd. (mhttpd and mhist used to print a message "reading history data from channel XXX", but somebody removed this message). K.O.
1695	17 Sep 2019	Konstantin Olchanski	Info	New history plot facility
> > On the mhttpd side, please capture the stack trace from the crash > > here comes the stack trace (only happens when using safari 12.1.2 macOS 10.14.6): > > #10 0x000000000041ce0f in check_digest_auth ... > The crash is in check_digest_auth() which checks the mongoose web server password (if not using password protection from the https proxy i.e. apache httpd). If so you should see this crash on all pages, not just when you access history pages, yes? Ok, I just checked, my safari is "Version 12.1.2 (13607.3.10)" and I see no immediate crash, even on history pages. But I am macos 10.13.6, maybe that makes a difference. If you see the safari crash on all pages, then it is not history-specific. In this case, I would like you to file a bug report on bitbucket "mhttpd crash with safari" and we follow up on it there. K.O.
1696	17 Sep 2019	Konstantin Olchanski	Forum	History plot problems for frontend with multiple indicies
> [local:e666:S]History>ls -l /History/Events > Key name Type #Val Size Last Opn Mode Value > --------------------------------------------------------------------------- > 1 STRING 1 10 2m 0 RWD FeDummy02 > 0 STRING 1 16 2m 0 RWD Run transitions Something is very broken. There should be more entries here, at least there should be entries for "FeDummy01" and usually there is also an entry for "FeDummy" because one invariably runs fedummy without "-i" at least once. The fact that changing from "midas" storage to "file" storage makes no difference also indicates that something is very broken. I want to debug this. Since you tried the "file" storage, can you send me the output of "ls -l mhf.dat" in the directory with the history files? (it should have the ".hst" files from the "midas" storage and "mhf*.dat" files from the "file" storage. K.O.
1704	27 Sep 2019	Konstantin Olchanski	Bug Fix	improvement for midas web page resource use
I noticed that midas web pages consume unexpectedly large amount of resources, as observed by the chrome browser "task manager" and by other tools. For example, size of the "status" page was observe to reach 200, 600 and even 900 Mbytes. The "programs" page (which does not have nearly as much stuff as the status page), was observed to reach 200-600 Mbytes. This is comparable to the New York Times front page, which has much more stuff, but usually runs at about 200 Mbytes. (they do force a periodic full page reload, to deal with exactly this same type of trouble, I suspect). Also I observed the midas web pages consume an unusual amount of CPU - 5-10-15% - all in inactive tabs in minimized windows. All this was quite noticeable in my oldish mac laptop with only 8 GBytes of RAM. Using the google-chrome performance analyzer I was able to identify the reason of high memory use - our 1/sec periodic updates leak "too many" DOM "nodes" and I suspect that due to throttling of inactive tabs, the garbage collector simply does not keep up with us. (Note that javascript features automatic memory management with garbage collection. In practice in means that where in C/C++ we have malloc() and free(), in javascript we only have malloc() and no free(), and cannot explicitly release memory we know we no longer need. In the C/C++ sense, all memory allocations are leaked, and one relies on a janitor to "clean it all up" eventually, later). The source of node leakage was unexpected (unexpected to me). It turns out that each assignment to e.innerHTML creates a new node, even if the new contents is the same as the old contents. (also the html parser has to run, consuming extra cpu cycles). Obvious solution is to write code like this: if (v !== e.innerHTML) { e.innerHTML = v }; This helped quite a bit on the "programs" page, but not as much as expected, and hardly at all on the "status" page. It turns out, that read of innerHTML does not necessarily return the same string as it was written into it. For example, if "v" is "a&b", e.innerHTML will return "a&b" and the comparison will misfire. There is more cases like this, see the section "Test set and get e.innerHTML" on the "example" midas page. To help dealing with this, I suggest that instead of "inline" comparison (as above), one writes this: mhttpd_set_if_changed(e, v); Then to check that the comparison is effective, go to mhttpd.js and uncomment the console.log() call in mhttpd_set_if_changed(), reload the page and look at the javascript console to see all calls that result in assignment of innerHTML (and leakage of DOM nodes). This done, after replacing many "&" with "&" and many "\'" with "\"", node leakage on the "programs" page was reduced to 1 node per 1/sec update: the unavoidable change to the timestamp on the top-right of the page. Luckily, Stefan pointed me to the solution for this: use of e.firstChild.data instead of e.innerHTML. The only quirk is that the node should not be empty, which was easy to arrange by setting the initial value of the timestamp to a dummy value. With these changes, the "programs" page (and most other pages) now leak 0 nodes (from the 1/sec periodic updates). There is still some small memory leakage from making the RPC requests and from receiving the RPC replies, but the garbage collector seems to have no trouble with them. Typical memory use for all midas pages is now 50-60 Mbytes (down from 100-200 Mbytes). The "status" page took a bit more work to fix due to it's curious coding, but it, too now uses 50-60 Mbytes as well. It still leaks quite a few nodes (to be fixed!), but the garbage collector seems to keep up with the allocations. K.O.
1705	27 Sep 2019	Konstantin Olchanski	Bug Fix	improvement for midas web page resource use (alarm sound)
> I noticed that midas web pages consume unexpectedly large amount of resources, as observed by the chrome browser > "task manager" and by other tools. > > For example, size of the "status" page was observe to reach 200, 600 and even 900 Mbytes. > [this was fixed by using set_if_changed(e, v); > > Also I observed the midas web pages consume an unusual amount of CPU - 5-10-15% - all in inactive tabs in minimized > windows. > The case of high CPU use turned out to be quite nasty. The symptoms: - the "programs" page in an inactive tab in a minimized window sits "doing nothing" for a day or two. - uses about 0 to 0.1 to 1% CPU and 40-50-60 Mbytes of RAM (after the previous improvements) - suddenly I see it use 10-15-20% CPU, continuously, non stop - I open this tab - suddenly, CPU use goes to 100%, memory use quickly grows from 40-50-60 Mbytes to 100-200 Mbytes. - after a few seconds everything settles down, CPU use is back to 0-0.1-1%, but memory use does not go down. - WTH?!? The culprit turned out to be the playing of the alarm sound. (I have all tabs "muted" by default, also speakers usually powered down). If I comment-out the playing of the alarm sound, this problem goes away completely. Pretty conclusive, I think. After adding lots of debug console.log() calls, I think I identified the problem: audio objects were being created, but they were not starting to play their sound files. When I opened the tab, all of them (about 400) at the same time loaded the mp3 file (resulting in memory use going from 50 Mbytes to 190 Mbytes, typical) and started playing (as seen on the audio event activity in the cpu profile traces from the google-chrome "performance" tool). I think I am looking at an unexpected interaction between audio objects and google-chrome throttling of inactive tabs. To muddy the waters some more, google-chrome periodically fails audio.play() with an exception to the effect of "we will not play audio because user is not interacting with this page enough". See https://bitbucket.org/tmidas/midas/issues/191/exception-on-audioplay Now I think I have this sort of fixed. I have to handle the audio.play() failure (which is not a normal exception, but a rejected promise, the handler is quite different), and I do not allow creating new audio objects if previous audio object did not finish playing. (note the "normal" timing: periodic update every 1 sec, playing of alarm sound event 60 seconds, length of alarm sound file is 3 sec, two sound files should never overlap. now a console.log message is printed if overlap is detected) This leaves us with the problem of alarm sound not playing "because the user didn't interact with the document first", and I think there is nothing I can do about that. K.O. P.S. Another quirk is I discovered: go to the "config" page and press the new buttons "play test sound" and "speak test message". In muted tabs, the test sound will not sound, but the test message will be shouted out loudly. This seems inconsistent to me. Unwanted audio/video ads are blocked but loud shouting of "shave with burma-shave" is permitted. I also wonder if speaking is subject to this "user did not interact" business. If not, we could replace the playing of our relaxing alarm beep with the yelling of "alarm! alarm! alarm!". K.O.
1706	27 Sep 2019	Konstantin Olchanski	Release	midas-2019-09
I created the release branch for midas-2019-09 and tag midas-2019-09-a. Since the previous release midas-2019-06, some news: - new history graphics (Stefan) - c++ frontend framework mvodb.h and tmfe.h merged from ALPHA-g (K.O.) - we think we have all the fallout from switching to cmake and to c++11 sorted out There is a number of known problems with the current code, see the bitbucket bug tracker: https://bitbucket.org/tmidas/midas/issues?status=new&status=open Hopefully we can use this release as a baseline for more testing and with luck we will fix all the pending bugs and add all the pending missing code (the new sequencer web pages, the "m" analyzer, etc) quickly and our next release midas-2019-10 will be the best midas ever. To obtain this release, either checkout the top of branch feature/midas-2019-09 (recommended) or checkout the tag midas-2019-09-a. If you are using the last pre-cmake/c++ release midas-2019-03, I recommend that you stay with it until our next release midas-2019-10. K.O.
1707	27 Sep 2019	Konstantin Olchanski	Forum	Open a hotlink to a single element in an ODB array
> I will try to use the db_watch function in the future. Note that db_watch() and db_open_record() work exactly the same way, both only allow watching "whole" odb entries, you cannot watch individual array elements. The db_watch() callback function gives you the array index of the array element that was changed and that fired the notification. but If you change many array elements quickly you will not necessary receive notifications for all and each of of them (underlying transport is UDP allows notification packet loss). If you are watching 1 array element change at a slow rate (1/sec), db_watch() will work well. Otherwise, you can watch the whole array, in the db_watch() callback, read the new array contents, compare it with your saved copy of pervious array contents, identify which array elements have changed and dance from here. (this method does not work if you do not actually change the array element values: change from "1" to "1", this is an old weakness in the midas hot link mechanism). If you are not sure how to use db_watch(), look inside midas/progs/odbedit.cxx search for db_watch() and search for the db_watch() callback function. K.O.
1708	27 Sep 2019	Konstantin Olchanski	Suggestion	recover daq and hardware safety.
> We have encountered a safety issue with our HPGe HV and it's midas frontend. At TRIUMF and other labs the words "safety issue" have very specific meaning and we tend to follow this guidance: MIDAS is not certified for and is not intended for use with safety critical applications as defined here: https://en.wikipedia.org/wiki/Safety-critical_system > A safety-critical system ... malfunction may result in ... following outcomes: > death or serious injury to people > loss or severe damage to equipment/property > environmental harm If this is your case, you should use properly certified software and hardware. Safety officers at most institutions require certified hardware interlocks and other protections to prevent such undesirable outcomes. Use of certified PLCs is sometimes permitted. But I suspect in your case, there is no "safety issue", you only want to protect some valuable but not critical equipment against accidental damage. In this case, you can probably use midas, but if midas malfunction may result in destroying your experiment (i.e. accidentally set wrong voltage on 3000 phototubes), you should also have hardware based protections (hardware limits on max/min high voltage). Most HV power supplies implement such protections (screw-driver actuated max voltage limits). If there is danger of destroying your experiment you should also have an independent review of your control system to avoid avoidable mistakes and obvious problems. > Turning off or changing HV unknowingly has to be avoided at all costs The function of changing high-voltage is implemented in your frontend program. Right in the place in this program where you transmit the voltage setting from ODB to the hardware is where you implement your protections (validate the voltage range, check that changing the voltage is permitted, etc). This protects you against unexpected/incorrect/erroneous changes in ODB (wrong ODB is loaded, wrong values in ODB, ODB is corrupted, etc). In addition, it is wise to set software based limits in the HV power supply (software controlled max high voltage, software controlled max current, etc). Most HV power supplies implement such functions. To ensure high voltage cannot be changed at the wrong times, you can also implement procedural and hardware protections, such as unplug the power supply control connection (usually ethernet or serial or usb cable). This will prevent you from monitoring the high voltage currents and the only solution is to use a power supply with a hardware "write protect" function (a key needs to be inserted and turned to allow changing anything). All of this is generic and applies to any controls software, not just MIDAS. Without at least some of these protections (especially protections in your frontend program), the questions you asked about loading ODB are insufficient. K.O.
1709	27 Sep 2019	Konstantin Olchanski	Bug Report	lazylogger in cmake & max_event_size
> The compile option -DHAVE_FTPLIB checked in mdsupport.cxx disappeared if you > compile with cmake. Hi, Stefan - do we still need to support FTP in the logger? In the lazylogger, special support for FTP is not needed, they can you the "script" method and do FTP without our help. I move to remove FTP support from MIDAS. (second? other opinions?) > Our MAX_EVENT_SIZE is set in the odb to 805306368. This number is also used in > this is to big when copying with ftp, causing a crash. Reducing it here with a > factor 10 solves our problems. I am surprised that changing MAX_EVENT_SIZE (to a "too big" value) causes lazylogger to crash. More usually MAX_EVENT_SIZE has no effect until you try to write an event that is somehow "too big", then there is a crash. Perhaps there is a bug specifically in the FTP code. Anyhow, I recommend the solution of using the "script" method. We have example lazylogger scripts in midas/progs/lazy*.perl (the scripts do not have to be in perl, python is ok). We do not have any example that uses FTP because we do not use FTP for data storage. But you can easily adapt lazy_test.perl and lazy_copy.perl to use scp and sftp, the secure versions of FTP. K.O.
1710	27 Sep 2019	Konstantin Olchanski	Forum	History plot problems for frontend with multiple indicies
We should fix this for midas-2019-10. https://bitbucket.org/tmidas/midas/issues/193/confusion-in-history-event-ids K.O. > Hi Konstantin, > > > > [local:e666:S]History>ls -l /History/Events > > > Key name Type #Val Size Last Opn Mode Value > > > --------------------------------------------------------------------------- > > > 1 STRING 1 10 2m 0 RWD FeDummy02 > > > 0 STRING 1 16 2m 0 RWD Run transitions > > > > Something is very broken. There should be more entries here, at least > > there should be entries for "FeDummy01" and usually there is also an entry > > for "FeDummy" because one invariably runs fedummy without "-i" at least once. > > This is a fresh experiment that I started just to test this this issue, that is why there are not many > entries in /History/Events. I agree though that we should expect to see a FeDummy01 entry. > > > The fact that changing from "midas" storage to "file" storage makes no difference > > also indicates that something is very broken. > > > > I want to debug this. > > > > Since you tried the "file" storage, can you send me the output of "ls -l mhf.dat" in the directory > > with the history files? (it should have the ".hst" files from the "midas" storage and "mhf.dat" > files > > from the "file" storage. > > When I started this experiment yesterday(?) I disabled the Midas history when I enbled the file > history. Jsut now I reenabled the Midas history, so they are currently both active. > > % ls -l work/online/{.hst,mhf*.dat} > -rw-r--r-- 1 hastings hastings 14996 Sep 17 10:21 work/online/190917.hst > -rw-r--r-- 1 hastings hastings 3292 Sep 18 16:29 work/online/190918.hst > -rw-r--r-- 1 hastings hastings 867288 Sep 18 16:29 work/online/mhf_1568683062_20190917_fedummy01.dat > -rw-r--r-- 1 hastings hastings 867288 Sep 18 16:29 work/online/mhf_1568683062_20190917_fedummy02.dat > -rw-r--r-- 1 hastings hastings 166 Sep 17 10:17 > work/online/mhf_1568683062_20190917_run_transitions.dat > > And again, just as a sanity check: > > % odbedit -c 'ls -l /History/Events' > Key name Type #Val Size Last Opn Mode Value > --------------------------------------------------------------------------- > 1 STRING 1 10 1m 0 RWD FeDummy02 > 0 STRING 1 16 1m 0 RWD Run transitions > > Regards, > > Nick.
1711	27 Sep 2019	Konstantin Olchanski	Forum	mhttpd start and stop redirect to Transition page
> I recently upgraded to MIDAS version midas-2019-06-b. I had to make a few changes > to get our custom page running again, but am a little confused on starting and > stopping runs. So far so good. > When I click on my "Start" button, it now redirects to a > Transition page rather than reloading the status page. Are you sure? The "start" button redirects to the "start" page (start.html) which redirects to the "transition" page (transition.html), which does not redirect anywhere so you can see the result of the transition. > Could someone explain the reasoning for the current behavior? It's been like this for years now. Stefan suggest that we implement the "start" page and the "transition" page as overlays on top of the status page, but it did not happen yet. > Furthermore my "Stop" button is now broken with the following error: > Error: Invalid URL "CS/EngeRun&" or query "cmd=Stop&redir=EngeRun%26" or command "Stop" I grep for "EngeRun" and I do not see it anywhere in the midas sources. Can you grep for it to see if it is coming from one of your pages? If you want to start/stop runs from your custom page, look at start.html and transition.html - you will need to make the run transition RPC calls (cut-and-paste the code to your page) and (obviously) you will not have any redirects to some strange pages. > For example, start calls: > location.search = "cmd=Transition"; > whereas stop does: > mhttpd_goto_page("Transition"); // DOES NOT RETURN It's the same thing, look at mhttpd_goto_page(). > Can anyone offer any insights or advice? I can change the former to "cmd=Status", but > the latter doesn't allow it. I am not sure what you are trying to do. If you need the "start" button on the status page to do something different from what it does now, just hack status.html until it does so. If you need some specific help with that, I am happy to help. I think I answered all questions you asked so far. K.O.
1715	29 Sep 2019	Konstantin Olchanski	Suggestion	recover daq and hardware safety.
> > The issue occurs when e.g. one channel can not be turned on and ramp for some temp/specific > reason, and someone else is working on the daq and reloads the odb for e.g. 1h ago. > So you want to ensure that some HV channels are turned off and stay turned off. Yes? Most effective solution will depend on the consequences of unwanted turning-on of your channels: - if hardware is destroyed if turned on - I think you should have a hardware lock-out. (unplug the HV cable) - if hardware malfunctions and will degrade if left turned on for long time (i.e. a hot phototube or sparking wire chamber) - your data monitoring software should detect the anomaly (it will show up as a hot channel, dead channel, etc) and the people running the experiment will realize the mistake and turn the channel back off. also hardware monitoring (HV currents, etc) should detect this, with same effect. - if collected data becomes useless (the turned-off channel make big noise in all other channels), then same thing, your data monitoring should catch it. The next consideration is what are you protecting against: a) one person flags channel defective, turns it off, next person knows nothing, turns it back on - you need to work on documentation, shift hand-off and other human-level procedures b) people running experiment load random odb files - same thing, from human-level procedures and documentation it should be made clear which odb files are correct and which should not be used c) software malfunction (not human person) causes data change in odb causes turned-off channel to turn back on d) hardware malfunction causes turned-off channel to turn back on (HV power supply hardware or firmware malfunctions and decides that all channels should be turned on at maximum high voltage) In the experiments I am most familiar with, problem (b) is avoided by never loading/reloading odb files directly, most/all interaction with the experiment is done through web pages, and these web pages are carefully coded to be safe against most user mistakes. Cases (a), (b) and (c) you can protect against by changing the frontend code to refuse to turn on some channels: int set_hv(int channel, int voltage) { if (channel == 35) return COMMAND_REFUSED; write_to_hardware(channel, voltage); return COMMAND_SUCCESS; } But in reality this solution only creates problem (e): e) people running the experiment start random versions of the frontend program, make random changes to the frontend source code, multiple people working on the frontend have their own personal versions/copies of the source code, etc. This is the worst-case scenario, meaning the experiment lost control of software configuration, and even basic software version control tools (like svn or git) are not being used. If your experiment gets that chaotic, all protections are likely to be ineffective - documentation will not work (people will ignore post-it notes "do not turn on!"), hardware protections will not work (unplugged cable labeled "do not plug in!" will be plugged back in and powered), etc. good luck, then. K.O.

Goto page Previous 1, 2, 3 ... 107, 108, 109 ... 159, 160, 161 Next

ELOG V3.1.6-083448f7