ELOG Midas

Back Midas Rome Roody Rootana

Midas DAQ System, Page 79 of 162

Not logged in

Find | Login | Help

Full | Summary | Threaded | Hide attachments

3222 Entries

Goto page Previous 1, 2, 3 ... 78, 79, 80 ... 160, 161, 162 Next

ID	Date	Author	Topic	Subject
224	19 Sep 2005	Konstantin Olchanski	Info	Added driver for the Wiener CC-USB CAMAC interface
Commited to CVS is the preliminary driver for the Wiener CC-USB CAMAC interface. The driver implements all the mcstd.h camac access functions, except for those not supported by hardware (8-bit operations, interrupts) and a few esoteric functions not implemented in any other camac driver. The driver uses the musbstd.h library to access USB, also commited in preliminary form. Affected files: midas/Makefile (added musbstd.c to libmidas.{a,so}) include/musbstd.h, src/musbstd.c (preliminary USB access library) drivers/bus/ccusb.{c,h} Most of the CAMAC access functions have been tested (see comments in ccusb.c). If you find errors and problems, please email me (olchansk@triumf.ca) or write an elog reply to this elog message. Missing is the documentation and finalization of USB access library. Missing is conformity to some MIDAS coding conventions. Enjoy, K.O.
237	14 Dec 2005	Konstantin Olchanski	Bug Report	misc problems
I would like to document a few problems I ran into while setting up a new experiment (two USB interfaces to Alice TPC electronics, plus maybe a USB interface to CAMAC). I am using a midas cvs checkout from last October, so I am not sure if these problems exist in the very latest code. I have fixes for all of them and I will commit them after some more testing and after I figure out how to commit into this new svn thingy. - mxml: writing xml into an in-memory buffer probably produces invalid xml because one of the mxml functions always writes "/>" into writer->fh, which is 0 for in-memory writers, so the "/>" tag goes to the console instead of the xml data stream. - hs_write_event() closes fd 0 (standard input), which confuses ss_getch(), which makes mlogger not work (at least on my machine). I traced this down to the history file file descriptors being initialized to zero and hs_write_event() closing files without checking that it ever opened them. - mevb: event builder did not work with a single frontend (a two-liner fix, once Pierre showed me where to look. Why? My second TPC-USB interface did not yet arrive and I wanted to test my frontend code. Yes, it had enough bugs to prevent the event builder from working). - mevb: consumes 100% CPU. Fix: add a delay in the main busy-loop. - mlogger ROOT tree output does not work for data banks coming through the event builder: mlogger looks for the bank definition under the event_id of mevb, in /equipment/evb/variables, which is empty, as the data banks are under /equipment/frontendNN/variables. This may be hard to fix: bank "TPCA" may be under "fe01", "TPCB" under "fe02" and mlogger knows nothing about any of this. Fix: go back to .mid files. K.O.
238	22 Dec 2005	Konstantin Olchanski	Info	midas max event size?
My TPC events are fairly large: 18 FEC cards * 128 channels per card * 2 Kbytes per channel = about 4 Mbytes. In my frontend, when I request this event size, MIDAS complaints (in mfe.c) that it is bigger than MAX_EVENT_SIZE, which is set to 0.5 Mbytes in midas.h. What is the best way to deal with this? Should we increase MAX_EVENT_SIZE to something bigger? Remove the MAX_EVENT_SIZE limitation altogether? For now, I increased the value MAX_EVENT_SIZE & co to (1010241024) and it seems to work (I also had to bump the sanity check in bm_open_buffer() from 10E6 to 100E6). With 1/4 of the FEC cards, the event size is 1 Mbyte at ~6 ev/sec the machine is almost idle, with the biggest CPU user being the event builder at 10% CPU utilization. K.O.
239	22 Dec 2005	Konstantin Olchanski	Info	How do I do custom event building?
It turns out the the standard event builder fragment matching algorithm cannot be used in my TPC application. I have two TPC-USB interfaces, which lack any "busy" or synchronization logic. I send the hardware trigger into both interfaces, and if one of them misses it, the data is out of sync forever. Consider: Hardware trigger trig1 trig2 trig3 trig4 TPC01 serial1 serial2 serial3 serial4 TPC02 serial1 (missing) serial2 serial3 With the event builder matching only the event serial numbers, the first event will be okey, but the second event will have trig2 data from TPC01 and trig3 data from TPC02, etc. The problem exists even if the TPC-USB interfaces do not miss any triggers: during begin and end of run, the interfaces are enabled one at a time, so if a trigger arrives after the first interface was enabled, but before the second is enabled, the data starts being out of sync (and if the same happens during the end-of-run, the event counts from both frontends will match, but all data would still be out of sync). Obviously additional data is needed to match the fragments. So in each frontend, I have a high-precision timestamp (gettimeofday(), usec resolution) and I would like to have the event builder match the timestamps instead of event serial numbers. What is the best way to do this? The mevb.c code does not have any user callbacks for checking "do these fragments belong to the same event?". P.S. The event rate will be about 1/sec from cosmic ray tests and at most 10-50/sec in the M11 beam line at TRIUMF, at these low rates, the gettimeofday() timestamps should be adequate. K.O.
242	23 Dec 2005	Konstantin Olchanski	Bug Report	minor changes to run transition code
> Minor changes to run transitions code: > - fail transition if cannot connect to one of the clients This change introduced a problem: 1) a run is happily taking data 2) a frontend crashes 3) the web interface cannot stop the run (cannot contact the crashed frontend) until it is removed by the timeout (10-60 seconds?). I am now considering allowing the run to end even if some clients cannot be contacted. The begin, pause and resume transitions would continue to fail if clients cannot be contacted. K.O.
244	28 Dec 2005	Konstantin Olchanski	Suggestion	Handling multiple identical USB devices
When I wrote the musbstd.h "open" method, I kind of punted on the problem of handling multiple identical USB devices. Instead of a real solution, I added an "instance" parameter, which allows one to "open" the "first", "second", etc USB device, as listed in a magic random system dependant order. Normally, USB devices are identified by two 16-bit integers: manufacturer ID and product ID (i.e. as reported by "lsusb"). This works well until one has more than one "identical" device. Two years ago, I had 5 identical USB cameras (optical alignement system for TRIUMF-TWIST); last year, I had multiple USB serial adapters; today I have two identical USB-TPC interfaces. Most of the time, the devices are plugged into the same USB ports, so theoretically, one should be able to tell exactly which one is which ("upstream camera is plugged into port 1, downstream camera is plugged into port 2"). But in the magic system dependant enumeration order, they keep moving around, depending on the order of enumeration, history of powering up and down, phase of the Moon, etc. So my generic "musbstd" method of "open first", "open second", etc turned out to be completely disfunctional. So far, I am unable to come up with a system independant solution. But I have a solution for Linux and maybe for MacOSX: 1) on Linux, I can use the information parsed from /proc/bus/usb/devices to say "please open the USB device on USB bus 1, port 1", the so called USB device "path", as seen in the system log and in /sys/bus/usb/devices. 2) on MacOSX, I was unable to find a way to discover the USB topology, but they seem to maintain an uint32_t "location", which they promise to keep at least across reboots (did not check this yet). 3) Windows I did not look at yet. So we have a choice: a) use system dependant "musb_open_linux(usbpath,vendor,product)", "musb_open_macosx(???,vendor,product)", etc b) create order out of chaos by manually keeping a map of "instances" (first, second, third device) to "persistant addresses". On Linux, it would be a file containing something like this: "USB-TPC-0 is on bus1-port1, USB-TPC-1 is on bus1-port2". Then again, I can say "please open USB-TPC interface instance 0" or "instance 1", etc. There is a small difficulty with dealing with devices temporarily or permanantly going away, or changing physical addresses ("I moved the USB device from port 1 to port 3"). This could be handled by telling the user "hmm... USB topology has changed, please delete the map file and try again", or we could come up with something more user friendly. Any thoughts? P.S. For my immediate need (I need this tomorrow), I will write a musb_open_linux(usbpath,vendor,product) function. K.O.
245	30 Dec 2005	Konstantin Olchanski	Bug Report	mhttpd "edit on start" broken for arrays
If a variable under "/experiment/edit on start/" is an array, it is correctly offered for editing on the "start run page", but then all elements in the array end up set to the value of the first element. This appears to be an error in mhttpd.c:interprete(), in the "start dialog" section. The non-working version in CVS reads: for (j = 0; j < key.num_values; j++) { size = key.item_size; sprintf(str, "x%d", n++); db_sscanf(getparam(str), data, &size, j, key.type); db_set_data_index(hDB, hsubkey, data, size + 1, j, key.type); } the fix that works for me reads: db_sscanf(getparam(str), data, &size, 0, key.type); (notice: the argument "j" is replaced with "0"). The way I understand this, all array elements are encoded into individual HTTP thingy strings, named sequentially x0, x1, ... and when we parse the values out of them, the array index should never show up. (Stefan, if you can, please commit a fix to svn). K.O.
252	07 May 2006	Konstantin Olchanski	Bug Fix	Update & add VME drivers
I commited fixes for a few minor compilation errors in the VME drivers (vmicvme.c, etc) I also added new drivers for the v513 latch and v560 scaler that I wrote for CERN-ALPHA. (Maybe I should mention that we also have drivers for the SIS 3820 multiscaler, the v895 VME discriminator and a few more modules. Will commit them as they mature). K.O.
253	07 May 2006	Konstantin Olchanski	Bug Report	cm_register_transition gyrations
I am debugging a Rome-based DAQ system setup by Pierre A. (the system does not work because of bugs in Rome). One problem I see is with my copy of cm_register_transition() in midas.c. Rome calls it with a NULL function to register a "queued" transition, but the cm_register_transition() code has changed around (rev 3051) to make NULL mean "unregister" a transition (this broke the queued transitions used by Rome), then it got changed back (rev 3085). Of course, I was stuck with the broken version, so Rome did not work at all, and it cost me real wall time to get to the bottom of all this, only to discover that this problem is already fixed. So- I would greatly appreciate it if, in the future, changes (and bug fixes) to the MIDAS API were announced on this mailing list here. K.O.
255	11 May 2006	Konstantin Olchanski	Bug Report	MIDAS and Fedora 4
Fellow Midasites- we are receiving reports that current Midas sources do not compile on Fedora 4 (and 5?) with errors "invalid lvalue in assignment". It looks like the new compilers reject what looks to my eye like perfectly valid C code that we have been writing since the beginning of C. Any suggestions on the best fix? K.O.
256	18 May 2006	Konstantin Olchanski	Bug Fix	removed a few "//" comments to fix compilation on VxWorks
Our VxWorks C compiler (gcc-2.8-something) does not like the "//" comments. Luckily, on VxWorks, we only compile a small subset of midas, so there is no point in banning all "//" comments. But I did have to convert a couple of them to /* commens */ in odb.c to make it compile. Changes to odb.c commited. K.O.
258	25 May 2006	Konstantin Olchanski	Bug Fix	fix crash in xml odb load
There is a crash in odbedit when loading some xml odb files: a missing check for NULL pointer when loading an array of strings and one of the array elements is blank. This check is present when loading other string values. Here is the diff: -bash-3.00$ diff odb.c odb.c-new 5621c5621,5624 < db_set_data_index(hDB, hKey, mxml_get_value(child), size, i, tid); --- > if (mxml_get_value(child) == NULL) > db_set_data_index(hDB, hKey, "", size, i, tid); > else > db_set_data_index(hDB, hKey, mxml_get_value(child), size, i, tid); K.O.
261	30 May 2006	Konstantin Olchanski	Bug Report	badness with vxworks/ppc
It appears that the latest version of MIDAS malfunctions on PowerPC/VxWorks machines, below are two problem reports. As reported, previous versions of MIDAS work fine, I guess that reduces the probability of it being buggy user code. At least one of the problems feels like a missing endian conversion somewhere, but I am not aware of any recent changes in the MIDAS RPC code... We will be trying to debug both problems, but any insight would be greatly appreciated. K.O. From suz@triumf.ca Tue May 30 16:58:16 2006 Date: Tue, 30 May 2006 16:58:16 -0700 (PDT) From: Suzannah Daviel <suz@triumf.ca> To: konstantin olchanski <olchansk@triumf.ca> Subject: rpc problems Hi Konstantin, Herewith a description of the problems, Suzannah Problem on system A: -------------------- After upgrading the Linux operating system from RH9 to SL4, and installing latest Midas software, the first time a manual trigger is issued, the VxWorks frontend (running on a PPC) crashes: Output on PPC consol: trigger histo event from status page rpc_client_accept: starting with sock:11 program Exception current instruction address: 0x01ac7388 Machine Status Register: 0x0008b030 Condition Register: 0x24000082 Task: 0x1b47908 "mfe" The histo event is usually large so is fragmented. It is sent out by a manual trigger and at end of run. When the run is ended (before an event request using a manual trigger so program has not yet crashed) the histo event is sent successfully. After returning to the previous version of Midas but still running SL4, this problem disappeared. Problem on system B: -------------------- Again, SL9 was installed, and the Midas software updated to the latest. When sending a periodic (non-fragmented) event, after a while, one of the parameters appears to become corrupted, and a lot of rpc_call error messages appear. These continue while data is still successfully sent out until the run is ended. Tue May 9 05:20:29 2006 [Mdarc] * data saved in file /is01_data/bnmr/dlog/2006/040377.msr_v5 at Tue May 9 05:20:29 2006 (SN=5) * Tue May 9 05:21:30 2006 [Mdarc] * data saved in file /is01_data/bnmr/dlog/2006/040377.msr_v6 at Tue May 9 05:21:30 2006 (SN=6) * Tue May 9 05:22:31 2006 [Mdarc] * data saved in file /is01_data/bnmr/dlog/2006/040377.msr_v7 at Tue May 9 05:22:31 2006 (SN=7) * Tue May 9 05:23:12 2006 [feBNMR] [midas.c:9325:rpc_call] parameters (1099059848) too large for network buffer (524344); param_size=1099059808 Tue May 9 05:23:12 2006 [feBNMR] [midas.c:9325:rpc_call] parameters (1099059848) too large for network buffer (524344); param_size=1099059808 ........................................ Tue May 9 05:23:31 2006 [feBNMR] [midas.c:9325:rpc_call] parameters (1099059848) too large for network buffer (524344); param_size=1099059808 Tue May 9 05:23:32 2006 [feBNMR] [midas.c:9325:rpc_call] parameters (1099059848) too large for network buffer (524344); param_size=1099059808 Tue May 9 05:23:32 2006 [Mdarc] * data saved in file /is01_data/bnmr/dlog/2006/040377.msr_v8 at Tue May 9 05:23:32 2006 (SN=8) * Tue May 9 05:23:32 2006 [feBNMR] [midas.c:9325:rpc_call] parameters (1099059848) too large for network buffer (524344); param_size=1099059808 Tue May 9 05:23:33 2006 [feBNMR] [midas.c:9325:rpc_call] parameters (1099059848) too large for network buffer (524344); param_size=1099059808 etc. Another example showing that the corrupted parameter varies in size: Thu Apr 13 19:00:00 2006 [mhttpd] Run #30005 started Thu Apr 13 19:00:08 2006 [Mdarc] * Saved data file /is01_data/bnmr/dlog/2006/030005.msr_v1 at Thu Apr 13 19:00:08 2006 * Thu Apr 13 19:01:10 2006 [Mdarc] * Saved data file /is01_data/bnmr/dlog/2006/030005.msr_v2 at Thu Apr 13 19:01:10 2006 * Thu Apr 13 19:02:14 2006 [Mdarc] * Saved data file /is01_data/bnmr/dlog/2006/030005.msr_v3 at Thu Apr 13 19:02:14 2006 * Thu Apr 13 19:03:20 2006 [Mdarc] * Saved data file /is01_data/bnmr/dlog/2006/030005.msr_v4 at Thu Apr 13 19:03:20 2006 * Thu Apr 13 19:04:22 2006 [Mdarc] * Saved data file /is01_data/bnmr/dlog/2006/030005.msr_v5 at Thu Apr 13 19:04:22 2006 * Thu Apr 13 19:05:12 2006 [feBNMR] [midas.c:9323:rpc_call] parameters (1077739560) too large for network buffer (524344) Thu Apr 13 19:05:13 2006 [feBNMR] [midas.c:9323:rpc_call] parameters (1077739560) too large for network buffer (524344) etc.
262	31 May 2006	Konstantin Olchanski	Bug Fix	mhist could not look at array data
When using mhist interactively, I could not look at array data: 1) if the array is the only variable, the question "what array index to use?" was not asked, zero was assumed, 2) even if the question was asked, the answer was ignored, zero was used. Fixes commited to utils/mhist.c K.O.
263	08 Jun 2006	Konstantin Olchanski	Bug Fix	fix compilation of musbstd.h, add it back to libmidas
I fixed the compilation of musbstd.h (it required -DHAVE_LIBUSB on Linux, but nothing knew about defining it) and put musbstd.o back into libmidas (USB support should be part of the standard base midas library). K.O.
264	08 Jun 2006	Konstantin Olchanski	Bug Fix	commit latest ccusb.c CAMAC-USB driver
I commited the latest driver for the Wiener CCUSB USB-CAMAC driver. It implements all functions from mcstd.h and has been tested to be plug-compatible with at least one of our CAMAC frontends. K.O.
265	08 Jun 2006	Konstantin Olchanski	Bug Fix	updated vmicvme driver
I commited the latest VMIC VME driver we use at TRIUMF. It has working support for D32 and D64 DMA and can move data from the SIS3820 multiscaler through the MIDAS frontend at > 30 Mbytes/sec on our VMICVME-7805 machines. The actual DMA speed on the VME bus is around 50 Mbytes/sec, effective data rate is lower because of a memcpy() from the kernel DMA buffer into user memory (required by the MIDAS mvmestd.h interface, quite inefficient for DMA operations). K.O.
266	08 Jun 2006	Konstantin Olchanski	Bug Report	Midas does not build on Fedora 5
Fresh svn checkout of MIDAS does not build on Fedora 5, I get this error: cc -c -g -O2 -Wall -Wuninitialized -Iinclude -Idrivers -I../mxml -Llinux/lib -DINCLUDE_FTPLIB -D_LARGEFILE64_SOURCE -DHAVE_ROOT -pthread -I/triumfcs/trshare/olchansk/root/root_v5.10.00_SL40/include -m32 -DOS_LINUX -fPIC -Wno-unused-function -o linux/lib/odb.o src/odb.c src/odb.c: In function 'db_open_database': src/odb.c:805: warning: dereferencing type-punned pointer will break strict-aliasing rules src/odb.c: In function 'db_lock_database': src/odb.c:1350: warning: dereferencing type-punned pointer will break strict-aliasing rules cc: Internal error: Segmentation fault (program cc1) Please submit a full bug report. See <URL:http://bugzilla.redhat.com/bugzilla> for instructions. make: *** [linux/lib/odb.o] Error 1 If I compile odb.c without "-O2", the rest of MIDAS builds without any more errors. The observed warnings are (I do not know what they mean): warning: dereferencing type-punned pointer will break strict-aliasing rules warning: missing sentinel in function call (Cannot do without sentinels, eh?) warning: pointer targets in passing argument 3 of 'getsockname' differ in signedness warning: non-local variable '<anonymous struct> out_info' uses anonymous type The "invalid lvalue" errors seem to have been successfully vanquished. K.O.
277	25 Jul 2006	Konstantin Olchanski	Bug Report	mhttpd passwords broken for MacOS 10.4 Safari
I observe that the mhttpd passwords do not work correctly for the Safari web browser on MacOS 10.4.7: Safari 2.0.4 (419.3). For example, I cannot submit elog messages- the system gets stuck on the "Password" page. The Safari browser in MacOS 10.3 works fine. Mozilla/Firefox works fine. (Also would be useful if "remember password" worked with MIDAS, in any browser). K.O.
281	28 Jul 2006	Konstantin Olchanski	Bug Fix	mhttpd: use more strlcpy(), fix a few bugs
While investigating the mhttpd password error with the MacOS Safari browser, I found that it was caused by an strcpy() buffer overflow. With Stefan's blessing, I now converted most uses of strcpy() and strcat() to strlcpy() and strlcat(). This fixes the Safari password problem (it was memory corruption in mhttpd). While validating these changes, I also found an incorrect use of sizeof() in the mhttpd history code for plotting run markers. I fixed that as well. P.S. The remaining strcpy() calls look safe wrt buffer overflows. There are no strcat() calls left. But there is still a large number of unsafe-looking sprintf() uses. K.O.

Goto page Previous 1, 2, 3 ... 78, 79, 80 ... 160, 161, 162 Next

ELOG V3.1.6-083448f7