Mirror/zeek - git.uphillsecurity.com: We code.

mirror of https://github.com/zeek/zeek.git synced 2025-10-09 10:08:20 +00:00

Author	SHA1	Message	Date
Arne Welzel	f0e67022fd	logging: Introduce Log::delay() and Log::delay_finish() This is a verbose, opinionated and fairly restrictive version of the log delay idea. Main drivers are explicitly, foot-gun-avoidance and implementation simplicity. Calling the new Log::delay() function is only allowed within the execution of a Log::log_stream_policy() hook for the currently active log write. Conceptually, the delay is placed between the execution of the global stream policy hook and the individual filter policy hooks. A post delay callback can be registered with every Log::delay() invocation. Post delay callbacks can (1) modify a log record as they see fit, (2) veto the forwarding of the log record to the log filters and (3) extend the delay duration by calling Log::delay() again. The last point allows to delay a record by an indefinite amount of time, rather than a fixed maximum amount. This should be rare and is therefore explicit. Log::delay() increases an internal reference count and returns an opaque token value to be passed to Log::delay_finish() to release a delay reference. Once all references are released, the record is forwarded to all filters attached to a stream when the delay completes. This functionality separates Log::log_stream_policy() and individual filter policy hooks. One consequence is that a common use-case of filter policy hooks, removing unproductive log records, may run after a record was delayed. Users can lift their filtering logic to the stream level (or replicate the condition before the delay decision). The main motivation here is that deciding on a stream-level delay in per-filter hooks is too late. Attaching multiple filters to a stream can additionally result in hard to understand behavior. On the flip side, filter policy hooks are guaranteed to run after the delay and can be used for further mangling or filtering of a delayed record.	2023-11-29 11:53:11 +01:00
Arne Welzel	3afd6242c7	logging/Manager: Split Write() If we delay in the stream policy hook, we'll need to resume writing to the attached filters later on. Prepare for that by splitting out the filter processing.	2023-11-29 11:53:11 +01:00
Benjamin Bannier	f5a76c1aed	Reformat Zeek in Spicy style This largely copies over Spicy's `.clang-format` configuration file. The one place where we deviate is header include order since Zeek depends on headers being included in a certain order.	2023-10-30 09:40:55 +01:00
Vern Paxson	4600ca41f6	logging speedup by switching to raw record access	2023-04-10 11:43:19 -07:00
Arne Welzel	69a98e2cbb	logging: Add telemetry for streams and log writers This adds one metric per log stream and one metric per log writer (path based) to track the number of writes on a stream level as well as on a writer level. $ curl -sSf localhost:8181/metrics \| grep Conn zeek_log_writer_writes_total{endpoint="",filter-name="default",module="HTTP",path="http",stream="HTTP::LOG",writer="Log::WRITER_SQLITE"} 1 1677497572770 zeek_log_stream_writes_total{endpoint="",module="HTTP",stream="HTTP::LOG"} 1 1677497572770 The initial version of this change also included metrics around log write vetoes, but given no log policies exist in the default configuration and they are mostly interesting for a few streams/writers only, skip this for now. These can always be added by the script writer, too. The difference between the stream level writes and concrete writers can be used to deduce the number of vetoes (or errors) as a starting point.	2023-02-27 12:51:03 +01:00
Josh Soref	cd201aa24e	Spelling src These are non-functional changes. * accounting * activation * actual * added * addresult * aggregable * aligned * alternatively * ambiguous * analysis * analyzer * anticlimactic * apparently * application * appropriate * arithmetic * assignment * assigns * associated * authentication * authoritative * barrier * boundary * broccoli * buffering * caching * called * canonicalized * capturing * certificates * ciphersuite * columns * communication * comparison * comparisons * compilation * component * concatenating * concatenation * connection * convenience * correctly * corresponding * could * counting * data * declared * decryption * defining * dependent * deprecated * detached * dictionary * directional * directly * directory * discarding * disconnecting * distinguishes * documentation * elsewhere * emitted * empty * endianness * endpoint * enumerator * essentially * evaluated * everything * exactly * execute * explicit * expressions * facilitates * fiddling * filesystem * flag * flagged * for * fragments * guarantee * guaranteed * happen * happening * hemisphere * identifier * identifies * identify * implementation * implemented * implementing * including * inconsistency * indeterminate * indices * individual * information * initial * initialization * initialize * initialized * initializes * instantiate * instantiated * instantiates * interface * internal * interpreted * interpreter * into * it * iterators * length * likely * log * longer * mainly * mark * maximum * message * minimum * module * must * name * namespace * necessary * nonexistent * not * notifications * notifier * number * objects * occurred * operations * original * otherwise * output * overridden * override * overriding * overwriting * ownership * parameters * particular * payload * persistent * potential * precision * preexisting * preservation * preserved * primarily * probably * procedure * proceed * process * processed * processes * processing * propagate * propagated * prototype * provides * publishing * purposes * queue * reached * reason * reassem * reassemble * reassembler * recommend * record * reduction * reference * regularly * representation * request * reserved * retrieve * returning * separate * should * shouldn't * significant * signing * simplified * simultaneously * single * somebody * sources * specific * specification * specified * specifies * specify * statement * subdirectories * succeeded * successful * successfully * supplied * synchronization * tag * temporarily * terminating * that * the * transmitted * true * truncated * try * understand * unescaped * unforwarding * unknown * unknowndata * unspecified * update * usually * which * wildcard Signed-off-by: Josh Soref <2119212+jsoref@users.noreply.github.com>	2022-11-09 12:08:15 -05:00
Vern Paxson	d758585e42	updated Bro->Zeek in comments in the source tree	2022-01-24 14:26:20 -08:00
Tim Wojtulewicz	d50dade24c	GH-1768: Properly cleanup existing log stream when recreated on with the same ID	2021-12-03 13:46:28 -07:00
Tim Wojtulewicz	331161138a	Unify all of the Tag types into one type - Remove tag types for each component type (analyzer, etc) - Add deprecated versions of the old types - Remove unnecessary tag element from templates for TaggedComponent and ComponentManager - Enable TaggedComponent to pass an EnumType when initializing Tag objects - Update some tests that are affected by the tag enum values changing order	2021-11-23 19:36:49 -07:00
Tim Wojtulewicz	b2f171ec69	Reformat the world	2021-09-16 15:35:39 -07:00
Christian Kreibich	795a7ea98e	Add a global log policy hook to the logging framework This addresses the need for a central hook on any log write, which wasn't previously doable without a lot of effort. The log manager invokes the new Log::log_stream_policy hook prior to any filter-specific hooks. Like filter-level hooks, it may veto a log write. Even when it does, filter-level hooks still get invoked, but cannot "un-veto". Includes test cases.	2021-07-02 12:42:45 -07:00
Tim Wojtulewicz	4ad08172d0	Remove obsolete ZEEK_FORWARD_DECLARE_NAMESPACED macros	2021-02-24 14:35:44 -07:00
Tim Wojtulewicz	0618be792f	Remove all of the random single-file deprecations These are the changes that don't require a ton of changes to other files outside of the original removal.	2021-01-27 10:52:40 -07:00
Tim Wojtulewicz	96d9115360	GH-1079: Use full paths starting with zeek/ when including files	2020-11-12 12:15:26 -07:00
Tim Wojtulewicz	fe0c22c789	Base: Clean up explicit uses of namespaces in places where they're not necessary. This commit covers all of the common and base classes.	2020-08-24 12:07:00 -07:00
Tim Wojtulewicz	4b61d60e80	Fix indentation of namespaced aliases	2020-08-20 16:11:46 -07:00
Tim Wojtulewicz	45b5c6e619	Move logging code to zeek namespaces	2020-08-20 15:55:17 -07:00
Tim Wojtulewicz	c9ab1f93e7	Move a few low-use classes to namespaces	2020-07-31 16:25:47 -04:00
Jon Siwek	a06ef66edc	Add Log::rotation_format_func and Log::default_rotation_dir options These may be redefined to customize log rotation path prefixes, including use of a directory. File extensions are still up to individual log writers to add themselves during the actual rotation. These new also allow for some simplication to the default ASCII postprocessor function: it eliminates the need for it doing an extra/awkward rename() operation that only changes the timestamp format. This also teaches the supervisor framework to use these new options to rotate ascii logs into a log-queue/ directory with a specific file name format (intended for an external archiver process to monitor separately).	2020-07-07 18:42:37 -07:00
Jon Siwek	11949ce37a	Implement leftover log rotation/archival for supervised nodes This helps prevent a node from being killed/crashing in the middle of writing a log, restarting, and eventually clobbering that log file that never underwent the rotation/archival process. The old `archive-log` and `post-terminate` scripts as used by ZeekControl previously implemented this behavior, but the new logic is entirely in the ASCII writer. It uses ".shadow" log files stored alongside the real log to help detect such scenarios and rotate them correctly upon the next startup of the Zeek process.	2020-07-07 18:39:23 -07:00
Tim Wojtulewicz	64332ca22c	Move all Val classes to the zeek namespaces	2020-06-30 20:48:09 -07:00
Tim Wojtulewicz	137e416a03	Rename BroType to Type	2020-06-10 14:27:36 -07:00
Tim Wojtulewicz	ed13972924	Move Type types to zeek namespace	2020-06-09 17:20:45 -07:00
Johanna Amann	876c803d75	Merge remote-tracking branch 'origin/topic/timw/776-using-statements' * origin/topic/timw/776-using-statements: Remove 'using namespace std' from SerialTypes.h Remove other using statements from headers GH-776: Remove using statements added by PR 770 Includes small fixes in files that changed since the merge request was made. Also includes a few small indentation fixes.	2020-04-09 13:31:07 -07:00
Tim Wojtulewicz	cb01e098df	iosource/threading/input/logging: Replace nulls with nullptr	2020-04-07 16:08:34 -07:00
Tim Wojtulewicz	d53c1454c0	Remove 'using namespace std' from SerialTypes.h This unfortunately cuases a ton of flow-down changes because a lot of other code was depending on that definition existing. This has a fairly large chance to break builds of external plugins, considering how many internal ones it broke.	2020-04-07 15:59:59 -07:00
Tim Wojtulewicz	5a237d3a3f	Use const-references in lots of places (preformance-unnecessary-value-param)	2020-02-11 14:11:18 -08:00
Max Kellermann	0db61f3094	include cleanup The Zeek code base has very inconsistent #includes. Many sources included a few headers, and those headers included other headers, and in the end, nearly everything is included everywhere, so missing #includes were never noticed. Another side effect was a lot of header bloat which slows down the build. First step to fix it: in each source file, its own header should be included first to verify that each header's includes are correct, and none is missing. After adding the missing #includes, I replaced lots of #includes inside headers with class forward declarations. In most headers, object pointers are never referenced, so declaring the function prototypes with forward-declared classes is just fine. This patch speeds up the build by 19%, because each compilation unit gets smaller. Here are the "time" numbers for a fresh build (with a warm page cache but without ccache): Before this patch: 3144.94user 161.63system 3:02.87elapsed 1808%CPU (0avgtext+0avgdata 2168608maxresident)k 760inputs+12008400outputs (1511major+57747204minor)pagefaults 0swaps After this patch: 2565.17user 141.83system 2:25.46elapsed 1860%CPU (0avgtext+0avgdata 1489076maxresident)k 72576inputs+9130920outputs (1667major+49400430minor)pagefaults 0swaps	2020-02-04 20:51:02 +01:00
Dominik Charousset	c1f3fe7829	Switch from header guards to pragma once	2019-09-17 14:10:30 +02:00
Johanna Amann	dcd6454530	Remove RemoteSerializer and related code/types. Also removes broccoli from the source tree.	2019-05-03 15:00:13 -07:00
Robin Sommer	fe7e1ee7f0	Merge topic/actor-system throug a squashed commit.	2018-05-18 22:39:23 +00:00
Johanna Amann	1f2bf50b49	Remove unimplemented & unused functions from header files. All of these functions were defined in header files without ever being implemented or used.	2018-03-16 18:38:04 -07:00
Robin Sommer	5cf7803e68	Fix some minor issues. From Daniel, thanks!	2017-02-23 17:18:43 -08:00
Robin Sommer	511ca9e043	Adding Broker ifdefs for new remote logging code.	2017-02-17 16:28:20 -08:00
Robin Sommer	a5e9a535a5	Changing semantics of Broker's remote logging to match old communication framework. Broker had changed the semantics of remote logging: it sent over the original Bro record containing the values to be logged, which on the receiving side would then pass through the logging framework normally, including triggering filters and events. The old communication system however special-cases logs: it sends already processed log entries, just as they go into the log files, and without any receiver-side filtering etc. This more efficient as it short-cuts the processing path, and also avoids the more expensive Val serialization. It also lets the sender determine the specifics of what gets logged (and how). This commit changes Broker over to now use the same semantics as the old communication system. TODOs: - The new Broker code doesn't have consistent #ifdefs yet. - Right now, when a new log receiver connects, all existing logs are broadcasted out again to all current clients. That doesn't so any harm, but is unncessary. Need to add a way to send the existing logs to just the new client.	2017-02-10 18:46:45 -08:00
Jon Siwek	b06d82cced	broker integration: add API documentation (broxygen/doxygen) Also changed asynchronous data store query code a bit; trying to make memory management and handling of corner cases a bit clearer (former maybe could still be better, but I need to lookup queries by memory address to associate response cookies to them, and so wrapping pointers kind of just gets in the way).	2015-02-17 10:50:57 -06:00
Jon Siwek	2b598e3d5a	broker integration: add remote logging It now works a bit differently than before: whether to send a remote log write is now a property of the logging stream, not the logging filter and it's now up the the receiver side filters to instantiate the desired writer. i.e. the sender now has no say in what the receiver should use as the log writer backend. Under the new style of remote logging, the "Log::enable_remote_logging" option is repurposed to set the default behavior for new logging streams. There's also "Comm::{enable,disable}_remote_logging()" to explicitly set the desired behavior for a given logging stream. To receive remote logs, one calls "Comm::subscribe_to_logs(<topic>)", where senders implicitly use topics of the form "bro/log/<stream id>".	2015-01-26 14:24:42 -06:00
Robin Sommer	f4cbcb9b03	Converting log writers and input readers to plugins.	2014-07-20 19:17:58 +02:00
Bernhard Amann	65b56479d2	(hopefully) fix mutex lock problem. log writers were removed on shutdown while frontends still had pointers to it. A similar fix will be necessary for the input framework (tomorrow :) )	2013-05-17 14:08:43 -07:00
Robin Sommer	4b86730ef2	Reintroducing the logging::Manager's Terminate() method. It doesn't do anything else than simply forwarding to FlushBuffers(). This is just for consistency in terminate_bro() where components get their Terminate() called so that the main code doesn't need to know anything more specific about what particular action to take at shutdown.	2013-05-15 17:19:52 -07:00
Bernhard Amann	39f1b9e01f	Change thread shutdown again to also work with input framework. Seems to work, tests pass, but not really verified. Major change 1: finished flag in MsgThread was replaced by 2 flags: child_finished and main_finished. child_finished is set by child_thread and means that the processing loop is stopped immediately (no longer needed, no new input messages will be processed, if loop continues running there is an ugly delay on shutdown). (This took me a while to realize...) main_finished is set by a message that is sent back by the child to the main thread when Finished() is called (and child_finished is set). when main_finished is set, processing of output messages stops. But all messages that the child thread pushed in the queue before calling Finish() are still processed. Change 2: Logging terminate call was replaced by a smaller call that just flushes out the cache held by the main thread. This call has to be done before thread shutdown is called - otherwhise the threads will be shut down before all messages are pushed on them. (This also took me a while to realize...). Change 3: Input framework actually calls it stop methods correctly (everything was prepared, function call was missing)	2013-05-14 23:45:55 -07:00
Robin Sommer	38e1dc9ca4	Support for cleaning up threads that have terminated. Once a BasicThread leaves its run() method, a thread is now marked for cleaning up, and the ThreadMgr will soon join it to release the OS resources. Also, adding a function Log::remove_stream() that remove a logging stream, stopping all writer threads that are associated with it. Note, however, that removing a filter from a stream still doesn't clean up any threads. The problem is that because of the output paths potentially being created dynamically it's unclear if the writer thread will still be needed in the future. We could add clean writers up with timeouts, but that doesn't sound great either. So for now, the only way to sure clean up logging threads is to remove the entire stream. Also note that cleanup doesn't work with input threads yet, which don't seem to terminate (at least in the case I tried).	2013-03-14 14:59:05 -07:00
Jon Siwek	7b2c3db488	Improve log filter compatibility with remote logging. If a log filter attempts to write to a path for which a writer is already instantiated due to remote logging, it will re-use the writer as long as the fields of the filter and writer are compatible, else the filter path will be auto-adjusted to not conflict with existing writer's. Conflicts between two local filters are still always auto-adjusted even if field types agree (since they could still be semantically different). Addresses #842.	2012-07-30 13:17:49 -05:00
Robin Sommer	4ba038070f	Tweaking writer API for failed rotations. There are now two FinishedRotation() methods, one that triggers post-processing and one that doesn't. There's also insurance built in against a writer not calling either (or both), in which case we abort with an internal error.	2012-07-28 16:38:22 -07:00
Jon Siwek	4359bf6b42	Fix log manager hanging on waiting for pending file rotations. This changes writer implementations to always respond to rotation messages in their DoRotate() method, even for failure/no-op cases with a new RotationFailedMessage. This informs the manager to decrement its count of pending rotations. Addresses #860.	2012-07-28 16:23:59 -07:00
Jon Siwek	2fafadd930	Fix differing log filters of streams from writing to same writer/path. Since WriterFrontend objects are looked up internally by writer type and path, and they also expect to write consistent field arguments, it could be the case that more than one filter of a given stream attempts to write to the same path (derived either from $path or $path_func fields of the filter) with the same writer type. This won't work, so now WriterFrontend objects are bound to the filter that instantiated them so that we can warn about other filters attempting to write to the conflicting writer/path and the write can be skipped. Remote logs don't appear to suffer the same issue due to pre-filtering. Addresses #842.	2012-07-25 12:20:12 -05:00
Robin Sommer	87e10b5f97	Further threading and API restructuring for logging and input frameworks. There were a number of cases that weren't thread-safe. In particular, we don't use std::string anymore for anything that's passed between threads (but instead plain old const char*, with manual memmory managmenet). This is still a check-point commit, I'll do more testing.	2012-07-19 22:28:30 -07:00
Robin Sommer	b38d1e1ec2	Reworking log writer API to make it easier to pass additional information to a writer's initialization method. However, for now the information provided is still the same.	2012-06-21 11:57:45 -07:00
Robin Sommer	5dae925f67	Fixing a rotation race condition at termination. Noticed with DS, but could just as well happen with ASCII.	2012-05-16 18:24:55 -07:00
Robin Sommer	952b6b293a	Merging in DataSeries support from topic/gilbert/logging. I copied the code over manually, no merging, because (1) it needed to be adapted to the new threading API, and (2) there's more stuff in the branch that I haven't ported yet. The DS output generally seems to work, but it has seen no further testing yet. Not unit tests yet either.	2012-04-03 22:14:56 -07:00

1 2

58 commits