ELOG Midas

Back Midas Rome Roody Rootana

Midas DAQ System, Page 144 of 152

Not logged in

Find | Login | Help

Full | Summary | Threaded | Hide attachments

3034 Entries

Goto page Previous 1, 2, 3 ... 143, 144, 145 ... 150, 151, 152 Next

ID	Date	Author	Topic	Subject
2405	16 May 2022	Konstantin Olchanski	Bug Fix	mserver buffer overrun and crash
> There is a memory allocation bug in the mserver. Fix for this problem introduced a new problem, an infinite loop in bm_flush_cache, bitbucket bugs https://bitbucket.org/tmidas/midas/issues/339/infinite-loop-in- mserver-due-to-mfes and https://bitbucket.org/tmidas/midas/issues/331/stuck- semaphore-of-system-buffer This is now fixed and the buffer write cache logic and size was rejigged according to calculations in https://daq00.triumf.ca/elog-midas/Midas/2401 Event buffer write cache (as set via ODB Equipment/Common and via bm_set_cache_size()) now take 2 possible values: 0 - write cache is disabled and MIN_WRITE_CACHE_SIZE - (10 Mbytes) minimum permitted cache size bigger cache size values are permitted, up to buffer_size/3, but probably not useful if my calculations are right. smaller cache size values are generally not useful, if my calculations are right. mfe.c and tmfe c++ frontends updated to request the new write cache size by default. if events are getting stuck in the write cache for too long, instead of reducing the cache size, one should increase frequency of bm_flush_cache() calls (1/sec by default). commit 373bcc3ab7f83c3c7bf6c051c237de043a982502 K.O.
2433	19 Aug 2022	Konstantin Olchanski	Bug Fix	"Detected duplicate or non-monotonous data" in history files
serious (but rare) bug was fixed in the history reader. unlucky experiment would see errors about "Detected duplicate or non-monotonous data" in some history file, fixed by removing/renaming the offending file. (reported by MEG experiment) it turns out there was nothing wrong with the data files (good), but there was a nasty bug in the history reader. it did not ensure that we read history files in chronological order. under some conditions order of files could be reversed, older files would be read after newer files and trip the built-in protection against returning non-monotonically increasing history data to the user. fixed commit https://bitbucket.org/tmidas/midas/commits/9893f85ebe33e96cc63f501a0f89e1f8932c894d for more details, see https://bitbucket.org/tmidas/midas/issues/350/file-history-non- monotonic-time K.O.
2436	23 Aug 2022	Konstantin Olchanski	Bug Fix	"Detected duplicate or non-monotonous data" in history files
> serious (but rare) bug was fixed in the history reader. previous fix was incomplete. please update to git commit https://bitbucket.org/tmidas/midas/commits/b343c3c98e4e6fd00a00cf686c74c7ccc6da0c63 K.O.
2447	11 Nov 2022	Frederik Wauters	Bug Fix	O_CREAT in open in split.cxx
midas currently does not compile on linux /usr/include/x86_64-linux-gnu/bits/fcntl2.h:50:24: error: call to ‘__open_missing_mode’ declared with attribute error: open with O_CREAT or O_TMPFILE in second argument needs 3 arguments 50 \| __open_missing_mode (); giving the mode is mandatory: https://man7.org/linux/man-pages/man2/open.2.html fix is to give open in midas/examples/lowlevel/split.cxx a default mode, e.g. 006600
2448	12 Nov 2022	Stefan Ritt	Bug Fix	O_CREAT in open in split.cxx
> midas currently does not compile on linux > > /usr/include/x86_64-linux-gnu/bits/fcntl2.h:50:24: error: call to ‘__open_missing_mode’ declared with attribute error: open with O_CREAT or O_TMPFILE in second argument needs 3 arguments > 50 \| __open_missing_mode (); > > giving the mode is mandatory: https://man7.org/linux/man-pages/man2/open.2.html > > fix is to give open in midas/examples/lowlevel/split.cxx a default mode, e.g. 006600 Thanks. Fixed. Stefan
2449	17 Nov 2022	Konstantin Olchanski	Bug Fix	O_CREAT in open in split.cxx
> > midas currently does not compile on linux > > fix is to give open in midas/examples/lowlevel/split.cxx a default mode, e.g. 006600 I got more warnings from split.cxx, looked at the code and see so many problems that it is easier to delete it than it is to fix it. Check for end of file is done incorrectly (check for read() return 0, -1 or short read), memory overrun if given file name is longer than 80 bytes, no check for valid event length read from the file, and so on and so on. A better example for reading and writing midas files is in midasio/test_midasio.cxx. Proper c++ coding, and can read compressed files. K.O.
2450	17 Nov 2022	Konstantin Olchanski	Bug Fix	"Detected duplicate or non-monotonous data" in history files
> > serious (but rare) bug was fixed in the history reader. > previous fix was incomplete. please update to git commit > https://bitbucket.org/tmidas/midas/commits/b343c3c98e4e6fd00a00cf686c74c7ccc6da0c63 a race condition between reading history file in mhttpd and writing history file in mlogger was accidentally introduced. mhttpd would file spurious errors about "timestamp is after last timestamp". fixed, please update to git commit https://bitbucket.org/tmidas/midas/commits/7a9f6e0c58ffddcacb9ee19934ce3e2033a805ef fix race condition in history file reader - a race condition was added accidentally - first the reader remembers the history file size and the time of the last entry, then it goes to read the file and bombs if at the same time mlogger added more entries - their time is after the remembered time of last entry and error "timestamp is after last timestamp" is triggered. K.O.
2580	09 Aug 2023	Konstantin Olchanski	Bug Fix	Stefan's improved ODB flush to disk
This is an important improvement, should have a post of it's own. K.O. > > > RFE filed: > > > https://bitbucket.org/tmidas/midas/issues/367/odb-should-be-saved-to-disk- periodically > > > > Implemented and closed: https://bitbucket.org/tmidas/midas/issues/367/odb- should-be-saved-to-disk-periodically > > > > Stefan > > Stefan's comments from the closed bug report: > > Ok I implemented some periodic flushing. Here is what I did: > > Created > > /System/Flush/Flush period : TID_UINT32 /System/Flush/Last flush : TID_UINT32 > > which control the flushing to disk. The default value for “Flush period” is 60 seconds or one minute. > > All clients call db_flush_database() through their cm_yield() function > db_flush_database() checks the “Last flush” and only flushes the ODB when the period has expired. This test is > done inside the ODB semaphore so that we don’t get a race condigiton > If the period has expired, db_flush_database() calls ss_shm_flush() > ss_shm_flush() tries to allocate a buffer of the shared memory. If the allocation is not successful (out of > memory), ss_shm_flush() writes directly to the binary file as before. > If the allocation is successful, ss_shm_flush() copies the share memory to a buffer and passes this buffer to a > dedicated thread which writes the buffer to the binary file. This causes ss_shm_flush() to return immediately and > not block the calling program during the disk write operation. > Added back the “if (destroy_flag) ss_shm_flush()” so that the ODB is flushed for sure before the shared memory > gets deleted. > This means now that under normal circumstances, exiting programs like odbedit do NOT flush the ODB. This allows to > call many “odbedit -c” in a row without the flush penalty. Nevertheless, the ODB then gets flushed by other > clients latest 60 seconds (or whatever the flush period is) after odbedit exits. > > Please note that ODB flushing has two purposes: > > When all programs exit, we need a persistent storage for the ODB. In most experiments this only happens very > seldom. Maybe at the end of a beam time period. > If the computer crashes, a recent version of the ODB is kept on disk to simplify recovery after the crash. > Since crashes are not so often (during production periods we have maybe one hardware failure every few years) the > flushing of the ODB too often does not make sense and just consumes resources. Flushing does also not help from > corrupted ODBs, since the binary image will also get corrupted. So the only reason for periodic flushes is to ease > recovery after a total crash. I put the default to 60 seconds, but if people are really paranoid they can decrease > it to 10 seconds or so. Or increase it to 600 seconds if their system does not crash every week and disks are > slow. > > I made a dedicated branch feature/periodic_odb_flush so people can test the new functionality. If there are no > complaints within the next few days, I will merge that into develop. > > Stefan
2614	03 Oct 2023	Konstantin Olchanski	Bug Fix	wrong array size after loading xml or json file
both the xml and the json decoders have a bug (fix pending). loading saved odb from xml and json file did not truncate arrays in odb to the size of arrays in the file. for example, if /example/double_array has size 20 in odb, but size 5 in xml or json file, after loading the file, array size is still 20. this is unexpected: after loading an odb save file we expect odb to return to same state as when odb save file was created. we do not expect some arrays to have half of their elements restored from file and half their elements left unchanged. save and restore from .odb file does not have this problem. I think this is a bug and I committed (but did not yet push) a fix for both xml and json odb decoder. I have run this problem while writing the new history panel editor, where deleting variables did not work because json rpc db_paste() was not truncating any arrays. I am still finishing up the last few bits of the new history panel editor, and there is a bit of time to discuss and comment this odb change before I push it to midas. K.O.
2703	05 Feb 2024	Ben Smith	Bug Fix	string --> int64 conversion in the python interface ?
> The symptoms are consistent with a string --> int64 conversion not happening > where it is needed. Thanks for the report Pasha. Indeed I was missing a conversion in one place. Fixed now! Ben
2710	13 Feb 2024	Konstantin Olchanski	Bug Fix	string --> int64 conversion in the python interface ?
> > The symptoms are consistent with a string --> int64 conversion not happening > > where it is needed. > > Thanks for the report Pasha. Indeed I was missing a conversion in one place. Fixed now! > Are we running these tests as part of the nightly build on bitbucket? They would be part of the "make test" target. Correct python dependancies may need to be added to the bitbucket OS image in bitbucket-pipelines.yml. (This is a PITA to get right). K.O.
2711	14 Feb 2024	Konstantin Olchanski	Bug Fix	added ubuntu-22 to nightly build on bitbucket, now need python!
> Are we running these tests as part of the nightly build on bitbucket? They would be part of > the "make test" target. Correct python dependancies may need to be added to the bitbucket OS > image in bitbucket-pipelines.yml. (This is a PITA to get right). I added ubuntu-22 to the nightly builds. but I notice the build says "no python" and I am not sure what packages I need to install for midas python to work. Ben, can you help me with this? https://bitbucket.org/tmidas/midas/pipelines/results/1106/steps/%7B9ef2cf97-bd9f-4fd3-9ca2-9c6aa5e20828%7D K.O.
2792	26 Jul 2024	Lukas Gerritzen	Bug Fix	strlcpy and strlcat added to glibc 2.38
A year ago, these two function were included in glibc. If trying to compile midas with a recent version of Ubuntu or Fedora, one gets errors like this: /usr/include/string.h:506:15: error: declaration of ‘size_t strlcpy(char, const char, size_t) noexcept’ has a different exception specifier 506 \| extern size_t strlcpy (char __restrict __dest, \| ^~~~~~~ In file included from /home/luk/midas/src/midas.cxx:14: /home/luk/midas/include/midas.h:2190:17: note: from previous declaration ‘size_t strlcpy(char, const char*, size_t)’ My proposed solution is a check in midas.h around line 248: #if (__GLIBC__ > 2) \|\| (__GLIBC__ == 2 && __GLIBC_MINOR__ >= 38) #ifndef HAVE_STRLCPY #define HAVE_STRLCPY 1 #endif #endif
2793	26 Jul 2024	Stefan Ritt	Bug Fix	strlcpy and strlcat added to glibc 2.38
Good catch. I added your code to the current develop branch of MIDAS. Stefan
2839	12 Sep 2024	Konstantin Olchanski	Bug Fix	bitbucket builds repaired
bitbucket builds work again, also added ubuntu-24 and almalinux-9. two problems fixed: - cmake file in examples/experiment was replaced by a non-working version - unannounced change of strlcpy() to mstrlcpy() broke "make remoteonly" P.S. I should also fix the rootana and the roody bitbucket builds. K.O.
2840	13 Sep 2024	Konstantin Olchanski	Bug Fix	rootana bitbucket build fixed
rootana bitbucket build is fixed, only a few minor build problems. I am using the root official docker image (which turned out to not work right out of the box becuase of missing libvdt-dev package). K.O.
2842	13 Sep 2024	Konstantin Olchanski	Bug Fix	mstrcpy, was: strlcpy and strlcat added to glibc 2.38
for the record, as ultimate solution, strlcpy() and strlcat() were wholesale replaced by mstrlcpy() and mstrlcat(). this should fix "missing strlcpy()" problem for good and make midas more consistent across all platforms (including non-linux, non-unix). on my side, I continue replacing these function with proper std::string operations. K.O.
2967	20 Mar 2025	Konstantin Olchanski	Bug Fix	bitbucket builds fixed
bitbucket automatic builds were broken after mfe.cxx started printing some additional messages added in commit https://bitbucket.org/tmidas/midas/commits/0ae08cd3b96ebd8e4f57bfe00dd45527d82d7a38 this is now fixed. to check if your changes will break automatic builds, before final push, please do: make clean make mini -j make cmake -j make test K.O.
2988	21 Mar 2025	Stefan Ritt	Bug Fix	bitbucket builds fixed
> bitbucket automatic builds were broken after mfe.cxx started printing some additional messages added in commit > https://bitbucket.org/tmidas/midas/commits/0ae08cd3b96ebd8e4f57bfe00dd45527d82d7a38 > > this is now fixed. to check if your changes will break automatic builds, before final push, please do: > > make clean > make mini -j > make cmake -j > make test Unfortunately we will break the automatic build each time a program outputs one different character, which even might happen if we add a line of code and a cm_msg() gets produced with a different line number. Is there a standard way to update testexpt.example (like "make testexpt" or so). Should be trigger the update of testexpt.example before each commit via a hook? Stefan
2992	21 Mar 2025	Konstantin Olchanski	Bug Fix	bitbucket builds fixed
> > bitbucket automatic builds > > Unfortunately we will break the automatic build each time a program outputs one different character, which even might happen if we add a line of code and > a cm_msg() gets produced with a different line number. Is there a standard way to update testexpt.example (like "make testexpt" or so). Should be trigger > the update of testexpt.example before each commit via a hook? > Actually line numbers are not logged by messages printed from "make test", so moving code around does not break the test. Changing what programs output does break the test and this is intentional - somebody must look at confirm that program output was changed on purpose or because a bug was introduced (or fixed). Most "make test" things work this way - run programs, compare output to what is expected. Discrepancies are flagged for human examination. K.O.

Goto page Previous 1, 2, 3 ... 143, 144, 145 ... 150, 151, 152 Next

ELOG V3.1.4-2e1708b5