Mirror/zeek - git.uphillsecurity.com: We code.

mirror of https://github.com/zeek/zeek.git synced 2025-10-05 16:18:19 +00:00

Author	SHA1	Message	Date
Johanna Amann	6b9abe85a7	Add error events to input framework. This change introduces error events for Table and Event readers. Users can now specify an event that is called when an info, warning, or error is emitted by their input reader. This can, e.g., be used to raise notices in case errors occur when reading an important input stream. Example: event error_event(desc: Input::TableDescription, msg: string, level: Reporter::Level) { ... } event bro_init() { Input::add_table([$source="a", $error_ev=error_event, ...]); } For the moment, this converts all errors in the Asciiformatter into warnings (to show that they are non-fatal) - the Reader itself also has to throw an Error to show that a fatal error occurred and processing will be abort. It might be nicer to change this and require readers to mark fatal errors as such when throwing them. Addresses BIT-1181	2016-07-22 19:45:28 -07:00
Robin Sommer	1ef4daf0a7	Merge remote-tracking branch 'origin/fastpath' * origin/fastpath: Change how input/logging threads set their name. Fix bug when clearing Bloom filter contents.	2014-04-17 17:49:52 -05:00
Jon Siwek	c9b40f1ca7	Change how input/logging threads set their name. Setting the thread name on every heartbeat uses a mild amount of cycles and there's not much benefit to doing it there to get the additional info regarding the number of processed messages since thread names usually get truncated to 16 characters and omit that part anyway.	2014-04-15 16:36:47 -05:00
Robin Sommer	639a6410c6	Merge remote-tracking branch 'origin/topic/bernhard/thread-cleanup' * origin/topic/bernhard/thread-cleanup: and just to be really sure - always make threads go through OnWaitForStop hopefully finally fix last interesting race-condition it is apparently getting a bit late for changes at important code... spoke to soon (forgot to comment in line again). Change thread shutdown again to also work with input framework. Changing semantics of thread stop methods. Support for cleaning up threads that have terminated.	2013-05-15 17:16:41 -07:00
Bernhard Amann	39f1b9e01f	Change thread shutdown again to also work with input framework. Seems to work, tests pass, but not really verified. Major change 1: finished flag in MsgThread was replaced by 2 flags: child_finished and main_finished. child_finished is set by child_thread and means that the processing loop is stopped immediately (no longer needed, no new input messages will be processed, if loop continues running there is an ugly delay on shutdown). (This took me a while to realize...) main_finished is set by a message that is sent back by the child to the main thread when Finished() is called (and child_finished is set). when main_finished is set, processing of output messages stops. But all messages that the child thread pushed in the queue before calling Finish() are still processed. Change 2: Logging terminate call was replaced by a smaller call that just flushes out the cache held by the main thread. This call has to be done before thread shutdown is called - otherwhise the threads will be shut down before all messages are pushed on them. (This also took me a while to realize...). Change 3: Input framework actually calls it stop methods correctly (everything was prepared, function call was missing)	2013-05-14 23:45:55 -07:00
Robin Sommer	d11bd56b5d	Changing semantics of thread stop methods. PrepareStop() is now SignalStop() and just signals a thread that it should terminate. After that's called, WaitForStop() (formerly Stop()) wait for it to actually finish processing. When stopping writers during operation, we now no longer wait for them to finish.	2013-03-15 17:57:58 -07:00
Robin Sommer	762c034ec2	Merge remote-tracking branch 'origin/topic/bernhard/input-logging-commmon-functions' * origin/topic/bernhard/input-logging-commmon-functions: add the last of Robins suggestions (separate info-struct for constructors). port memory leak fix from master harmonize function naming move AsciiInputOutput over to threading and thinking about it, ascii-io doesn't need the separator change constructors and factor stuff out the input framework too. factor out ascii input/output. std::string accessors to escape_sequence functionality intermediate commit - it has been over a month since I touched this... I cleaned up the AsciiInputOutput class somewhat, including renaming it to AsciiFormatter, renaming some of its methods, and turning the static methods into members for consistency. Closes #929.	2013-01-23 16:51:54 -08:00
Bernhard Amann	501328d61a	factor out ascii input/output. First step - factored out everything the logging classes use ( so only output ). Moved the script-level configuration to logging/main, and made the individual writers just refer to it - no idea if this is good design. It works. But I am happy about opinions :) Next step - add support for input...	2012-12-03 12:59:11 -08:00
Robin Sommer	f5862fb014	Preventing writers/readers from receiving further messages after a failure. Once a writer/reader Do* method has returned false, no further ones will be executed anymore. This is primarily a safety mechanism to make it easier for writer/reader authors as otherwise they would often need to track the failure state themselves (because with the now delayed termination from the earlier commit, furhter messages can now still arrive for a little bit).	2012-07-26 17:27:56 -07:00
Robin Sommer	71fc2a1728	Another small change to MsgThread API. Threads will now reliably get a call to DoFinish() no matter how the thread terminates. This will always be called from within the thread, whereas the destructor is called from the main thread after the child thread has already terminated. Also removing debugging code. However, two problems remain with the ASCII writer (seeing them only on MacOS): - the #start/#end timestamps contain only dummy values right now. The odd thing is that once I enable strftime() to print actual timestamps, I get crashes (even though strftime() is supposed to be thread-safe). - occassionally, there's still output missing in tests. In those cases, the file descriptor apparently goes bad: a write() will suddently return EBADF for reasons I don't understand yet.	2012-07-22 15:50:12 -07:00
Robin Sommer	87e10b5f97	Further threading and API restructuring for logging and input frameworks. There were a number of cases that weren't thread-safe. In particular, we don't use std::string anymore for anything that's passed between threads (but instead plain old const char*, with manual memmory managmenet). This is still a check-point commit, I'll do more testing.	2012-07-19 22:28:30 -07:00
Robin Sommer	f6b883bafc	Further reworking the thread API.	2012-07-19 21:22:28 -07:00
Robin Sommer	f73eb3b086	Reworking thread termination logic. Turns out the finish methods weren't called correctly, caused by a mess up with method names which all sounded too similar and the wrong one ended up being called. I've reworked this by changing the thread/writer/reader interfaces, which actually also simplifies them by getting rid of the requirement for writer backends to call their parent methods (i.e., less opportunity for errors). This commit also includes the following (because I noticed the problem above when working on some of these): - The ASCII log writer now includes "#start <timestamp>" and "#end <timestamp> lines in the each file. The latter supersedes Bernhard's "EOF" patch. This required a number of tests updates. The standard canonifier removes the timestamps, but some tests compare files directly, which doesn't work if they aren't printing out the same timestamps (like the comm tests). - The above required yet another change to the writer API to network_time to methods. - Renamed ASCII logger "header" options to "meta". - Fixes #763 "Escape # when first character in log file line". All btests pass for me on Linux FC15. Will try MacOS next.	2012-07-19 21:21:53 -07:00
Robin Sommer	61ce9b5412	Checkpoint - all src/ except src/input	2012-05-25 14:05:50 -07:00
Bernhard Amann	3b82d69eb3	Merge remote-tracking branch 'origin/master' into topic/bernhard/input-threads Conflicts: src/CMakeLists.txt testing/btest/Baseline/coverage.bare-load-baseline/canonified_loaded_scripts.log testing/btest/Baseline/coverage.default-load-baseline/canonified_loaded_scripts.log	2012-05-18 15:26:36 -07:00
Robin Sommer	7cc863c5fc	Fix for when not producing local output; that hung. * origin/topic/robin/dataseries: Moving trace for rotation test into traces directory. Fixing a rotation race condition at termination. Portability fixes. Extending DS docs with some examples. Updating doc. Fixing pack_scale and time-as-int. Adding format specifier to DS spec to print out double as %.6f. DataSeries updates and fixes. DataSeries tuning. Tweaking DataSeries support. Extending log post-processor call to include the name of the writer. Removing an unnecessary const cast. DataSeries TODO list with open issues/questions. Starting DataSeries HowTo. Additional test output canonification for ds2txt's timestamps. In threads, an internal error now immediately aborts. DataSeries cleanup. Working on DataSeries support. Merging in DataSeries support from topic/gilbert/logging. Fixing threads' DoFinish() method.	2012-05-17 12:38:47 -07:00
Robin Sommer	99e3c58494	Fixing threads' DoFinish() method. It wasn't called reliably. Now, it's always called before the thread is destroyed (assuming processing has went normally so far).	2012-04-03 22:12:44 -07:00
Bernhard Amann	fd70560017	Merge remote-tracking branch 'origin/topic/robin/log-threads' into topic/bernhard/input-threads	2012-03-30 11:00:51 -07:00
Bernhard Amann	3405cbdfbd	Introducing - the check if a thread queue might have data. Without locks. Who needs those anyways.	2012-03-30 09:17:16 -07:00
Robin Sommer	d7c9471818	Extending queue statistics.	2012-03-23 15:57:25 -07:00
Robin Sommer	b8ec653ebf	Bugfixes. - Data queued at termination wasn't written out completely. - Fixed some race conditions. - Fixing IOSource integration. - Fixing setting thread names on Linux. - Fixing minor leaks. All tests now pass for me on Linux in debug and non-debug compiles. Remaining TODOs: - Needs leak check. - Test on MacOS and FreeBSD. - More testing: - High volume traffic. - Different platforms.	2012-02-12 13:07:26 -08:00
Robin Sommer	70fe7876a1	Updating thread naming. Also includes experimental code to adapt the thread name as shown by top, but it's untested.	2012-02-03 04:04:38 -08:00
Robin Sommer	4f0fc571ef	Doing bulkd writes instead of individual writes now. Also slight change to Writer API, going back to how the rotate methods were before.	2012-02-03 04:04:37 -08:00
Robin Sommer	a428645b2a	Documenting the threading/* classes. Also switching from semaphores to mutexes as the former don't seem to be fully supported on MacOS.	2012-02-03 04:04:37 -08:00
Robin Sommer	e4e770d475	Threaded logging framework. This is based on Gilbert's code but I ended up refactoring it quite a bit. That's why I didn't do a direct merge but started with a new branch and copied things over to adapt. It looks quite a bit different now as I tried to generalize things a bit more to also support the Input Framework. The larger changes code are: - Moved all logging code into subdirectory src/logging/. Code here is in namespace "logging". - Moved all threading code into subdirectory src/threading/. Code here is in namespace "threading". - Introduced a central thread manager that tracks threads and is in charge of termination and (eventually) statistics. - Refactored logging independent threading code into base classes BasicThread and MsgThread. The former encapsulates all the pthread code with simple start/stop methods and provides a single Run() method to override. The latter is derived from BasicThread and adds bi-directional message passing between main and child threads. The hope is that the Input Framework can reuse this part quite directly. - A log writer is now split into a general WriterFrontend (LogEmissary in Gilbert's code) and a type-specific WriterBackend. Specific writers are implemented by deriving from the latter. (The plugin interface is almost unchanged compared to the 2.0 version.). Frontend and backend communicate via MsgThread's message passing. - MsgThread (and thus WriterBackend) has a Heartbeat() method that a thread can override to execute code on a regular basis. It's triggered roughly once a second by the main thread. - Integration into "the rest of Bro". Threads can send messages to the reporter and do debugging output; they are hooked into the I/O loop for sending messages back; and there's a new debugging stream "threading" that logs, well, threading activity. This all seems to work for the most part, but it's not done yet. TODO list: - Not all tests pass yet. In particular, diffs for the external tests seem to indicate some memory problem (no crashes, just an occasional weird character). - Only tested in --enable-debug mode. - Only tested on Linux. - Needs leak check. - Each log write is currently a single inter-thread message. Bring Gilbert's bulk writes back. - Code needs further cleanup. - Document the class API. - Document the internal structure of the logging framework. - Check for robustness: live traffic, aborting, signals, etc. - Add thread statistics to profile.log (most of the code is there). - Customize the OS-visible thread names on platforms that support it.	2012-01-27 17:16:14 -08:00

25 commits