ELOG Midas

Back Midas Rome Roody Rootana

Midas DAQ System, Page 11 of 139

Not logged in

Find | Login | Help

New entries since:

Wed Dec 31 16:00:00 1969

Full | Summary | Threaded | Show attachments

2765 Entries

Goto page Previous 1, 2, 3 ... 10, 11, 12 ... 137, 138, 139 Next

ID	Date	Author	Topic	Subject
2589	17 Aug 2023	Stefan Ritt	Bug Report	midas wants to show notification?
> > This feature was asked by some people ... > > "show notifications" popups are strongly associated with disreputable web sites (presumably to > push spam), it was surprising to see it from midas. > > K.O. I agree. But unlike emails (where you get lots of spam as well), you can nicely blacklist/whitelist desktop notifications. I suppress all of them except the one for MIDAS. This allows me to watch our experiment without staring on the web page all the time. The main question here is maybe if the desktop notification should be on or off by default (for a fresh browser). While you always can change that via the mhttpd "Config" page, the default value is chosen by the system. I thought I put it to "on" so people can experience it, and then turn it off if they don't like. Having them off by default, most people never would notice this possibility. But I'm open to a discussion here. Stefan
2588	16 Aug 2023	Stefan Ritt	Bug Report	Error accessing history files
Tonight we got another error of that type after the update: 04:17 - [mhttpd,ERROR] [history_schema.cxx:2913:FileHistory::read_data,ERROR] Cannot read '/data2/history/mhf_1692128214_20230815_gassystem.dat', read() errno 2 (No such file or directory) This morning I looked at the file, and it was there: [meg@megon02 history]$ ls -alg mhf_1692128214_20230815_gassystem.dat -rw-rw-r--. 1 meg 4663228 Aug 17 08:50 mhf_1692128214_20230815_gassystem.dat [meg@megon02 history]$ Stefan
2587	16 Aug 2023	Konstantin Olchanski	Bug Report	excessive logging of http requests
> > Our default configuration of apache httpd logs every request. > > MIDAS custom web pages can easily make a huge number of RPC calls creating a > > huge log file and filling system disk to 100% capacity. added "daily" to /etc/logrotate.d/httpd, default was "weekly", not often enough. K.O.
2586	16 Aug 2023	Konstantin Olchanski	Bug Report	midas wants to show notification?
> This feature was asked by some people ... "show notifications" popups are strongly associated with disreputable web sites (presumably to push spam), it was surprising to see it from midas. K.O.
2585	16 Aug 2023	Stefan Ritt	Bug Report	midas wants to show notification?
> > I started to get web browser popups about "midas wants to show notifications, > > block/allow/x". is this a glitch or a new unannounced/undocumented feature? > > google chrome on macos. K.O. > > https://bitbucket.org/tmidas/midas/commits/e101dea764c647211c560a68db7ecda1834198db > > I did not consider this a significant feature to be announced here. Just a few lines > of code. You can turn it on/off via the "Config" web page. > > Stefan Now as I look at it again I realized that the config check boxes had a bug. I fixed that and now the disable should work correctly. This feature was asked by some people who monitor an experiment and have the browser window in the background, also have sound off (large office). So desktop notifications are a good thing for them. Stefan
2584	16 Aug 2023	Stefan Ritt	Bug Report	midas wants to show notification?
> I started to get web browser popups about "midas wants to show notifications, > block/allow/x". is this a glitch or a new unannounced/undocumented feature? > google chrome on macos. K.O. https://bitbucket.org/tmidas/midas/commits/e101dea764c647211c560a68db7ecda1834198db I did not consider this a significant feature to be announced here. Just a few lines of code. You can turn it on/off via the "Config" web page. Stefan
2583	16 Aug 2023	Konstantin Olchanski	Bug Report	midas wants to show notification?
I started to get web browser popups about "midas wants to show notifications, block/allow/x". is this a glitch or a new unannounced/undocumented feature? google chrome on macos. K.O.
2582	15 Aug 2023	Konstantin Olchanski	Info	mlogger update
A bit of update to the mlogger. In preparation for more cleanup when Stefan is here at TRIUMF. 1) fix overwrite of existing files if run number is reset (check for existing files was missing in the LZ4, BZ2 & co data path) 2) made output files read-only (midas, json and checksum files) 3) commented out the old code paths Currently active per-channel ODB settings: Active - enable or disable mlogger channel Type - NOT USED Filename - output filename template, %d are replaced by run number and subrun number, also pipe command for PIPE output Format - NOT USED Compression - NOT USED ODB dump - enable/disable writing ODB dump to data file ODB dump format - "json" is recommended for new experiments Log messages - write log messages to output file, 0=off, -1=write all messages Buffer - "SYSTEM" read events from this event buffer EventID - "-1" for all events Trigger Mask - "-1" for all events Event Limit - stop run after so many events Byte Limit - stop run after so many bytes Subrun Byte limit - switch to next subrun file after writing so many bytes. actual file size is longer than subrun_byte_limit because of ODB dumps. Tape Capacity - NOT USED Subdir Format - if not empty, output file name is DIR/SUBDIR/FILENAME, "%" format things are expanded by strftime(). Current Filename - updated by mlogger, contains the currently written file name Data checksum - checksum before compression, use CRC32C for maximum speed, SHA512 for maximum security. File checksum - checksum after compression, CRC32C is good against accidental file corruption, SHA512 is cryptographically strong, good against purposeful tampering. Compress - use "lz4" for maximum speed, bzip2 or pbzip2 for maximum compression. no compression and gzip are not recommended. (ZFS may apply lz4 compression to uncompressed data). Output - "NULL" do not write anything, "FILE" write to disk, "FTP" write to FTP server, "ROOT" write via the mlogger ROOT writer (docs?), "PIPE" pipe data through an external command (i.e. for bzip2 compression). Gzip compression - gzip compression flags (see gzip docs, 1=max speed, 9=max compression) Bzip2 compression - if non-zero, bzip2 compression level (see "bzip2 -h", 1=max speed, 9=max compression) Pbzip2 num cpu - number of CPUs used by parallel bzip2 compression, pbzip2 -p flag Pbzip2 compression - if non-zero, pbzip2 compresison level (see "pbzip2 -h", default is 9=max compression) Pbzip2 options - any additional pbzip2 options, i.e. -l, -m, -p, etc. Currently active /Logger options: Data Dir - where to write all output files, if empty, cm_get_path() is used. Message file date format - not used in mlogger Message dir - not used in mlogger Write data - if set to "no", midas file, runlog, etc will not be written. ODB Dump - at run stop, save odb to disk ODB Dump File - file name for "ODB Dump" save file. "%d" is replaced by run number. "json" format is recommended for new experiments. ODB Last Dump File - at run start, save ODB to disk. "json" format is recommended for new experiments. Auto restart - run stopped by time limit or event limit is automatically restarted Auto restart delay - wair for some many seconds before restarting the run Tape message - NOT USED Run duration - stop the run after so many seconds Next subrun - change from "no" to "yes" to force mlogger to open a new subrun file (should this be per-channel?) Subrun duration - open new subrun file after so many seconds (should this be per-channel?) History dir - not used in mlogger Detached transition - "no" use the normal multithreaded transtions (recommended), "yes" use mtransition helper to stop and restart runs. sometimes files because mtransition is not in the user $PATH or wrong version of mtransition is in the user $PATH. K.O.
2581	14 Aug 2023	Konstantin Olchanski	Bug Report	excessive logging of http requests
> Our default configuration of apache httpd logs every request. > MIDAS custom web pages can easily make a huge number of RPC calls creating a > huge log file and filling system disk to 100% capacity. close but no cigar. mhttpd is not running and /var/log got filled to 100% capacity by http error messages. I do not see any apache facility to filter error messages, hmm... -rw-r--r-- 1 root root 1864421376 Aug 14 12:53 ssl_error_log [Sun Aug 13 23:53:12.416247 2023] [proxy:error] [pid 18608] AH00940: HTTP: disabled connection for (localhost) [Sun Aug 13 23:53:12.416538 2023] [proxy:error] [pid 19686] AH00940: HTTP: disabled connection for (localhost) [Sun Aug 13 23:53:12.416603 2023] [proxy:error] [pid 19681] AH00940: HTTP: disabled connection for (localhost) [Sun Aug 13 23:53:12.416775 2023] [proxy:error] [pid 19588] AH00940: HTTP: disabled connection for (localhost) [Sun Aug 13 23:53:12.417022 2023] [proxy:error] [pid 19311] AH00940: HTTP: disabled connection for (localhost) [Sun Aug 13 23:53:12.421864 2023] [proxy:error] [pid 18620] AH00940: HTTP: disabled connection for (localhost) [Sun Aug 13 23:53:12.422051 2023] [proxy:error] [pid 19693] AH00940: HTTP: disabled connection for (localhost) [Sun Aug 13 23:53:12.422199 2023] [proxy:error] [pid 19673] AH00940: HTTP: disabled connection for (localhost) [Sun Aug 13 23:53:12.422222 2023] [proxy:error] [pid 18608] AH00940: HTTP: disabled connection for (localhost) [Sun Aug 13 23:53:12.422230 2023] [proxy:error] [pid 19657] AH00940: HTTP: disabled connection for (localhost) [Sun Aug 13 23:53:12.422259 2023] [proxy:error] [pid 18633] AH00940: HTTP: disabled connection for (localhost) [Sun Aug 13 23:53:12.427513 2023] [proxy:error] [pid 19686] AH00940: HTTP: disabled connection for (localhost) [Sun Aug 13 23:53:12.427549 2023] [proxy:error] [pid 19681] AH00940: HTTP: disabled connection for (localhost) [Sun Aug 13 23:53:12.427645 2023] [proxy:error] [pid 19588] AH00940: HTTP: disabled connection for (localhost) [Sun Aug 13 23:53:12.427774 2023] [proxy:error] [pid 19693] AH00940: HTTP: disabled connection for (localhost) [Sun Aug 13 23:53:12.427800 2023] [proxy:error] [pid 18620] AH00940: HTTP: disabled connection for (localhost) K.O.
2580	09 Aug 2023	Konstantin Olchanski	Bug Fix	Stefan's improved ODB flush to disk
This is an important improvement, should have a post of it's own. K.O. > > > RFE filed: > > > https://bitbucket.org/tmidas/midas/issues/367/odb-should-be-saved-to-disk- periodically > > > > Implemented and closed: https://bitbucket.org/tmidas/midas/issues/367/odb- should-be-saved-to-disk-periodically > > > > Stefan > > Stefan's comments from the closed bug report: > > Ok I implemented some periodic flushing. Here is what I did: > > Created > > /System/Flush/Flush period : TID_UINT32 /System/Flush/Last flush : TID_UINT32 > > which control the flushing to disk. The default value for �Flush period� is 60 seconds or one minute. > > All clients call db_flush_database() through their cm_yield() function > db_flush_database() checks the �Last flush� and only flushes the ODB when the period has expired. This test is > done inside the ODB semaphore so that we don�t get a race condigiton > If the period has expired, db_flush_database() calls ss_shm_flush() > ss_shm_flush() tries to allocate a buffer of the shared memory. If the allocation is not successful (out of > memory), ss_shm_flush() writes directly to the binary file as before. > If the allocation is successful, ss_shm_flush() copies the share memory to a buffer and passes this buffer to a > dedicated thread which writes the buffer to the binary file. This causes ss_shm_flush() to return immediately and > not block the calling program during the disk write operation. > Added back the �if (destroy_flag) ss_shm_flush()� so that the ODB is flushed for sure before the shared memory > gets deleted. > This means now that under normal circumstances, exiting programs like odbedit do NOT flush the ODB. This allows to > call many �odbedit -c� in a row without the flush penalty. Nevertheless, the ODB then gets flushed by other > clients latest 60 seconds (or whatever the flush period is) after odbedit exits. > > Please note that ODB flushing has two purposes: > > When all programs exit, we need a persistent storage for the ODB. In most experiments this only happens very > seldom. Maybe at the end of a beam time period. > If the computer crashes, a recent version of the ODB is kept on disk to simplify recovery after the crash. > Since crashes are not so often (during production periods we have maybe one hardware failure every few years) the > flushing of the ODB too often does not make sense and just consumes resources. Flushing does also not help from > corrupted ODBs, since the binary image will also get corrupted. So the only reason for periodic flushes is to ease > recovery after a total crash. I put the default to 60 seconds, but if people are really paranoid they can decrease > it to 10 seconds or so. Or increase it to 600 seconds if their system does not crash every week and disks are > slow. > > I made a dedicated branch feature/periodic_odb_flush so people can test the new functionality. If there are no > complaints within the next few days, I will merge that into develop. > > Stefan
2578	09 Aug 2023	Konstantin Olchanski	Suggestion	Maximum ODB size
> > RFE filed: > > https://bitbucket.org/tmidas/midas/issues/367/odb-should-be-saved-to-disk-periodically > > Implemented and closed: https://bitbucket.org/tmidas/midas/issues/367/odb-should-be-saved-to-disk-periodically > > Stefan Stefan's comments from the closed bug report: Ok I implemented some periodic flushing. Here is what I did: Created /System/Flush/Flush period : TID_UINT32 /System/Flush/Last flush : TID_UINT32 which control the flushing to disk. The default value for �Flush period� is 60 seconds or one minute. All clients call db_flush_database() through their cm_yield() function db_flush_database() checks the �Last flush� and only flushes the ODB when the period has expired. This test is done inside the ODB semaphore so that we don�t get a race condigiton If the period has expired, db_flush_database() calls ss_shm_flush() ss_shm_flush() tries to allocate a buffer of the shared memory. If the allocation is not successful (out of memory), ss_shm_flush() writes directly to the binary file as before. If the allocation is successful, ss_shm_flush() copies the share memory to a buffer and passes this buffer to a dedicated thread which writes the buffer to the binary file. This causes ss_shm_flush() to return immediately and not block the calling program during the disk write operation. Added back the �if (destroy_flag) ss_shm_flush()� so that the ODB is flushed for sure before the shared memory gets deleted. This means now that under normal circumstances, exiting programs like odbedit do NOT flush the ODB. This allows to call many �odbedit -c� in a row without the flush penalty. Nevertheless, the ODB then gets flushed by other clients latest 60 seconds (or whatever the flush period is) after odbedit exits. Please note that ODB flushing has two purposes: When all programs exit, we need a persistent storage for the ODB. In most experiments this only happens very seldom. Maybe at the end of a beam time period. If the computer crashes, a recent version of the ODB is kept on disk to simplify recovery after the crash. Since crashes are not so often (during production periods we have maybe one hardware failure every few years) the flushing of the ODB too often does not make sense and just consumes resources. Flushing does also not help from corrupted ODBs, since the binary image will also get corrupted. So the only reason for periodic flushes is to ease recovery after a total crash. I put the default to 60 seconds, but if people are really paranoid they can decrease it to 10 seconds or so. Or increase it to 600 seconds if their system does not crash every week and disks are slow. I made a dedicated branch feature/periodic_odb_flush so people can test the new functionality. If there are no complaints within the next few days, I will merge that into develop. Stefan
2577	09 Aug 2023	Konstantin Olchanski	Bug Report	Error accessing history files
I confirm I see same on the agmini system. Two problems: (a) error message is wrong, it's a short read, not a read error (clue: read() syscall does not return "no such file"). (b) mlogger is supposed to write history in record-size blocks, read in the same record size blocks. UNIX file semantics require that both reader and writer see read() and write() as atomic, even on NFS, so mhttpd should never see partially written history records. I can debug this on the agmini system. Probably should. Problem (a) fixed in commit bb423c8680cc67220312534403840442868f2b3b, if you update, you should see error messages about "short read" and the read sizes it reports are very interesting, please put them in the elog here. K.O. > We sporadically (like once per few hours) have an error message when we access the > history plots through mhttpd: > > 07:21:35.109 2023/08/03 [mhttpd,ERROR] > [history_schema.cxx:2345:FileHistory::read_data,ERROR] Cannot read > '/data2/history/mhf_1690890685_20230801_dc_hv.dat', read() errno 2 (No such file > or directory) > > When I log in to the machine, I properly see the file and also can access it > > [meg@megon02 history]$ ls -l mhf_1690890685_20230801_dc_hv.dat > -rw-rw-r--. 1 meg meg 34176312 Aug 3 07:23 mhf_1690890685_20230801_dc_hv.dat > > and I also can dump that file. > > When I try again with mhttpd, I properly see that file. > > Now in principle this is not a problem, but the error message is annoying, since this > is the only error we get in 24 hours. I attached a 24h log to see what I mean. If this > is an OS issue, I wonder if we should add code to retry the file access in case we get > that error. > > Anybody seen a similar thing? > > Best, > Stefan
2576	09 Aug 2023	Konstantin Olchanski	Forum	pull request for PostgreSQL support
> The compilation of midas was broken by the last modification. The reason is that > Pgsql *fPgsql = NULL; > was not protected by #ifdef HAVE_PGSQL confirmed, my mistake, I forgot to test with "make cmake NO_PGSQL". your fix is correct, thanks. K.O.
2575	04 Aug 2023	Konstantin Olchanski	Forum	Issues with Universe II Driver
> I can compile 32 bit midas. Unless I am interpreting the linking error, I don't > think I can use the driver as built. I think you are right, Makefile from the Universe package does not build a -m32 version of libvme.so. I think I can fix that... K.O.
2574	04 Aug 2023	Caleb Marshall	Forum	Issues with Universe II Driver
I can compile 32 bit midas. Unless I am interpreting the linking error, I don't think I can use the driver as built. While trying to compile vme_scan, most of the programs fail with: /usr/bin/ld: skipping incompatible /usr/lib/gcc/x86_64-redhat- linux/4.8.5/../../../../lib/libvme.so when searching for -lvme /usr/bin/ld: skipping incompatible /lib/../lib/libvme.so when searching for -lvme /usr/bin/ld: skipping incompatible /usr/lib/../lib/libvme.so when searching for - lvme /usr/bin/ld: skipping incompatible /usr/lib/gcc/x86_64-redhat- linux/4.8.5/../../../libvme.so when searching for -lvme /usr/bin/ld: skipping incompatible //lib/libvme.so when searching for -lvme /usr/bin/ld: skipping incompatible //usr/lib/libvme.so when searching for -lvme with libvme.so being built by the universe-II driver. Not sure if I can get around this without messing with the driver? Is it possible to build a 32 bit version of that shared library without having to touch the actual kernel module? -Caleb
2573	03 Aug 2023	Konstantin Olchanski	Bug Report	excessive logging of http requests
> > > Our default configuration of apache httpd logs every request. MIDAS custom web pages can easily make a huge number of RPC calls creating a > > > huge log file and filling system disk to 100% capacity > > perhaps use existing logrotate, add limit on file size (size) and limit of 2 old log files (rotate). > > CustomLog logs/ssl_request_log "%t %h %{SSL_PROTOCOL}x %{SSL_CIPHER}x \"%r\" %b" env=!envnolog > TransferLog is not conditional and has to be commented out to stop logging every jsonrpc request. K.O.
2572	03 Aug 2023	Caleb Marshall	Forum	Issues with Universe II Driver
I am looking into compiling the 32 bit midas. In the meantime, here is the kernel info: 3.10.0-1160.11.1.el7.x86_64 Thank you for the help. -Caleb
2571	03 Aug 2023	Konstantin Olchanski	Forum	Issues with Universe II Driver
> Here is the output: > > vmic_mmap: Mapped VME AM 0x0d addr 0x00000000 size 0x00ffffff at address 0x80a01000 > mvme_open: > Bus handle = 0x3 > DMA handle = 0x158f5d0 > DMA area size = 1048576 bytes > DMA physical address = 0x7f91db553000 > vmic_mmap: Mapped VME AM 0x2d addr 0x00000000 size 0x0000ffff at address 0x86ff0000 > vme addr: 00000000 > addr: db543000 I see the problem. A24 is mapped at 0x80xxxxxx, A16 is mapped at 0x86ffxxxx, but mvme_read computed address 0xdb543000, out of range of either mapped vme address. ouch. One more thing to check, AFAIK, this universe-II codes were never used on 64-bit CPU before, we only have 32-bit Pentium-3 and Pentium-4 machines with these chips. The tsi148 codes used to work both 32-bit and 64-bit, we used to have both flavours of CPUs, but now only have 64-bit. What is your output for "uname -a"? does it report 32-bit or 64-bit kernel? If you feel adventurous, you can build 32-bit midas (cd .../midas; make linux32), compile vmescan.o with "-m32" and link vmescan.exe against .../midas/linux-m32/lib, and see if that works. Meanwhile, I can check if vmicvme.c is 64-bit clean. Checking if kernel module is 64-bit clean would be more difficult... K.O.
2570	03 Aug 2023	Caleb Marshall	Forum	Issues with Universe II Driver
Here is the output: vmic_mmap: Mapped VME AM 0x0d addr 0x00000000 size 0x00ffffff at address 0x80a01000 mvme_open: Bus handle = 0x3 DMA handle = 0x158f5d0 DMA area size = 1048576 bytes DMA physical address = 0x7f91db553000 vmic_mmap: Mapped VME AM 0x2d addr 0x00000000 size 0x0000ffff at address 0x86ff0000 vme addr: 00000000 addr: db543000
2569	02 Aug 2023	Stefan Ritt	Bug Report	Error accessing history files
We sporadically (like once per few hours) have an error message when we access the history plots through mhttpd: 07:21:35.109 2023/08/03 [mhttpd,ERROR] [history_schema.cxx:2345:FileHistory::read_data,ERROR] Cannot read '/data2/history/mhf_1690890685_20230801_dc_hv.dat', read() errno 2 (No such file or directory) When I log in to the machine, I properly see the file and also can access it [meg@megon02 history]$ ls -l mhf_1690890685_20230801_dc_hv.dat -rw-rw-r--. 1 meg meg 34176312 Aug 3 07:23 mhf_1690890685_20230801_dc_hv.dat and I also can dump that file. When I try again with mhttpd, I properly see that file. Now in principle this is not a problem, but the error message is annoying, since this is the only error we get in 24 hours. I attached a 24h log to see what I mean. If this is an OS issue, I wonder if we should add code to retry the file access in case we get that error. Anybody seen a similar thing? Best, Stefan

Goto page Previous 1, 2, 3 ... 10, 11, 12 ... 137, 138, 139 Next

ELOG V3.1.4-2e1708b5