Mirror/zeek - git.uphillsecurity.com: We code.

mirror of https://github.com/zeek/zeek.git synced 2025-10-02 22:58:20 +00:00

Author	SHA1	Message	Date
Tim Wojtulewicz	0ec2161b04	Add options to filter at the stream level as well as globally	2025-08-12 17:31:28 -07:00
Tim Wojtulewicz	339d46ae26	Add a weird that gets emitted when strings/containers are over the limits	2025-08-12 17:31:28 -07:00
Tim Wojtulewicz	837fde1a08	Add metrics to track string and container fields limited by length	2025-08-12 17:31:28 -07:00
Tim Wojtulewicz	e2e7ab28da	Implement string- and container-length filtering at the log record level	2025-08-12 17:31:28 -07:00
Tim Wojtulewicz	e458da944f	Return weird if a log line is over a configurable size limit	2025-07-21 09:14:52 -07:00
Johanna Amann	0c875220e9	Default canonifier change to only remove first timestamp in line In the past, we used a default canonifier, which removes everything that looks like a timestamp from log files. The goal of this is to prevent logs from changing, e.g., due to local system times ending up in log files. This, however, also has the side-effect of removing information that is parsed from protocols which probably should be part of our tests. There is at least one test (1999 certificates) where the entire test output was essentially removed by the canonifier. GH-4521 was similarly masked by this. This commit changes the default canonifier, so that only the first timestamp in a line is removed. This should skip timestamps that are likely to change while keeping timestamps that are parsed from protocol information. A pass has been made over the tests, with some additional adjustments for cases which require the old canonifier. There are some cases in which we probably could go further and not remove timestamps at all - that, however, seems like a follow-up project.	2025-06-18 15:41:48 +01:00
Arne Welzel	bcca7702cd	btest/logging: Fly-by cleanup	2025-06-16 14:56:30 +02:00
Arne Welzel	45f5a4c1b8	logging/Ascii: Fix abort() for non-existing postrotation functions When looking up the postprocessor function from shadow files, id::find_func() would abort() if the function wasn't available instead of falling back to the default postprocessor. Fix by using id::find() and checking the type explicitly and also adding a strict type check while at it. This issue was tickled by loading the json-streaming-logs package, Zeek creating shadow files containing its custom postprocessor function, then restarting Zeek without the package loaded. Closes #4562	2025-06-16 14:55:49 +02:00
Vern Paxson	614eb8d343	minor BTest maintenance updates for -O gen-C++	2025-05-31 12:52:44 -07:00
Arne Welzel	93813a5079	logging/ascii/json: Make TS_MILLIS signed, add TS_MILLIS_UNSIGNED It seems TS_MILLIS is specifically for Elasticsearch and starting with Elasticsearch 8.2 epoch_millis does (again?) support negative epoch_millis, so make Zeek produce that by default. If this breaks a given deployment, they can switch Zeek back to TS_MILLIS_UNSIGNED. https://discuss.elastic.co/t/migration-from-es-6-8-to-7-17-issues-with-negative-date-epoch-timestamp/335259 https://github.com/elastic/elasticsearch/pull/80208 Thanks for @timo-mue for reporting! Closes #4494	2025-05-30 17:23:29 +02:00
Arne Welzel	9365f71965	btest/frameworks/logging: Use generic cluster-layout.zeek	2025-05-20 20:30:01 +02:00
Arne Welzel	0e327a0c12	testing/btest: Fix double commented @TEST- lines sed -i 's/^# # @/# @/g'	2025-05-06 14:06:29 +02:00
Arne Welzel	85b8c8866b	testing/btest/*zeek: Comment all @TEST lines	2025-04-17 16:30:23 +02:00
Johanna Amann	7b582bc345	Merge remote-tracking branch 'origin/topic/johanna/sqlite-pragmas' * origin/topic/johanna/sqlite-pragmas: Options for SQLite log writer, eliminate duplicate definitions Test synchronous/journal mode options for SQLite log writer Added default options for synchronous and journal mode Support for synchronous and journal_mode	2024-11-27 08:32:08 +00:00
Johanna Amann	d592942ccb	Test synchronous/journal mode options for SQLite log writer Also adds some small tweaks and adds the new feature to NEWS.	2024-11-26 12:26:38 +00:00
Tim Wojtulewicz	87717fed0a	Remove prefix column from telemetry.log	2024-06-04 14:14:58 -07:00
Tim Wojtulewicz	00b24b043a	Set running_under_test for scripts.base.frameworks.logging.telemetry test	2024-06-04 14:14:57 -07:00
Tim Wojtulewicz	017ee4509c	Update telemetry log policy due to the fact that unit will not be filled in anymore	2024-05-31 13:30:31 -07:00
Arne Welzel	e2ce929fa4	logging: Better error messages for invalid Log::delay() calls Add a test for Log::delay() usage within filter policy hooks, too.	2023-11-29 11:53:11 +01:00
Arne Welzel	5e046eee58	logging/Manager: Implement DelayTokenType as an actual opaque With a bit of tweaking in the JavaScript plugin to support opaque types, this will allow the delay functionality to work there, too. Making the LogDelayToken an actual opaque seems reasonable, too. It's not supposed to be user inspected.	2023-11-29 11:53:11 +01:00
Arne Welzel	2dbb467ba2	logging: Implement get_delay_queue_size() Primarily for introspection given that re-delaying may exceed queue sizes.	2023-11-29 11:53:11 +01:00
Arne Welzel	f0e67022fd	logging: Introduce Log::delay() and Log::delay_finish() This is a verbose, opinionated and fairly restrictive version of the log delay idea. Main drivers are explicitly, foot-gun-avoidance and implementation simplicity. Calling the new Log::delay() function is only allowed within the execution of a Log::log_stream_policy() hook for the currently active log write. Conceptually, the delay is placed between the execution of the global stream policy hook and the individual filter policy hooks. A post delay callback can be registered with every Log::delay() invocation. Post delay callbacks can (1) modify a log record as they see fit, (2) veto the forwarding of the log record to the log filters and (3) extend the delay duration by calling Log::delay() again. The last point allows to delay a record by an indefinite amount of time, rather than a fixed maximum amount. This should be rare and is therefore explicit. Log::delay() increases an internal reference count and returns an opaque token value to be passed to Log::delay_finish() to release a delay reference. Once all references are released, the record is forwarded to all filters attached to a stream when the delay completes. This functionality separates Log::log_stream_policy() and individual filter policy hooks. One consequence is that a common use-case of filter policy hooks, removing unproductive log records, may run after a record was delayed. Users can lift their filtering logic to the stream level (or replicate the condition before the delay decision). The main motivation here is that deciding on a stream-level delay in per-filter hooks is too late. Attaching multiple filters to a stream can additionally result in hard to understand behavior. On the flip side, filter policy hooks are guaranteed to run after the delay and can be used for further mangling or filtering of a delayed record.	2023-11-29 11:53:11 +01:00
Arne Welzel	d88b147ac9	cluster: Deprecate the Cluster::Node$interface field This field isn't required by a worker and it's certainly not used by a worker to listen on that specific interface. It also isn't required to be set consistently and its use in-tree limited to the old load-balancing script. There's a bif called packet_source() which on a worker will provide information about the actually used packet source. Relates to zeek/zeek#2877.	2023-11-07 16:06:16 +01:00
Arne Welzel	54a08a74da	base/frameworks/spicy: Do not load base/misc/version Unsure what it's used for today and also results in the situation that on some platforms we generate a reporter.log in bare mode, while on others where spicy is disabled, we do not. If we want base/frameworks/version loaded by default, should put it into init-bare.zeek and possibly remove the loading of the reporter framework from it - Reporter::error() would still work and be visible on stderr, just not create a reporter.log.	2023-10-24 13:15:21 +02:00
Tim Wojtulewicz	531276cfe0	Remove LogAscii::logdir (6.1 deprecation)	2023-06-14 10:07:22 -07:00
Tim Wojtulewicz	5a3abbe364	Revert "Merge remote-tracking branch 'origin/topic/vern/at-if-analyze'" This reverts commit `4e797ddbbc`, reversing changes made to `3ac28ba5a2`.	2023-05-31 09:20:33 +02:00
Vern Paxson	e749638380	a number of BTests updated with @if ... &analyze	2023-05-19 13:13:26 -07:00
Arne Welzel	d89f16dfc9	logging: Support rotation_postprocessor_command_env This new table provides a mechanism to add environment variables to the postprocessor execution. Use case is from ZeekControl to inject a suffix to be used when running with multiple logger.	2023-04-17 13:10:14 +00:00
Arne Welzel	a5e7faf564	logging/Manager: Fix crash for rotation format function not returning While working on a rotation format function, ran into Zeek crashing when not returning a value from it, fix and recover the same way as for scripting errors.	2023-04-13 09:23:51 +02:00
Christian Kreibich	4281d704c1	Tighten local-nets filtering in the logging framework's path-func-column-demote test With private addresses treated as local ones, this picked up some private-range flows in the test pcap involved.	2023-03-15 17:01:01 -07:00
Arne Welzel	69a98e2cbb	logging: Add telemetry for streams and log writers This adds one metric per log stream and one metric per log writer (path based) to track the number of writes on a stream level as well as on a writer level. $ curl -sSf localhost:8181/metrics \| grep Conn zeek_log_writer_writes_total{endpoint="",filter-name="default",module="HTTP",path="http",stream="HTTP::LOG",writer="Log::WRITER_SQLITE"} 1 1677497572770 zeek_log_stream_writes_total{endpoint="",module="HTTP",stream="HTTP::LOG"} 1 1677497572770 The initial version of this change also included metrics around log write vetoes, but given no log policies exist in the default configuration and they are mostly interesting for a few streams/writers only, skip this for now. These can always be added by the script writer, too. The difference between the stream level writes and concrete writers can be used to deduce the number of vetoes (or errors) as a starting point.	2023-02-27 12:51:03 +01:00
Christian Kreibich	f8dbf70e3b	Tighten the scripts.base.frameworks.logging.hooks test This avoids interference from other log streams in the policy hook test cases, which could cause deviations in output vs baselines depending on build configuration.	2023-02-01 15:12:20 -08:00
Christian Kreibich	b5c8421ac2	Fix two btest-diff checks that couldn't fail :-)	2023-02-01 15:12:20 -08:00
Robin Sommer	04a1ead978	Provide infrastructure to migrate legacy analyzers to Spicy. As initial examples, this branch ports the Syslog and Finger analyzers over. We leave the old analyzers in place for now and activate them iff we compile without any Spicy. Needs `zeek-spicy-infra` branches in `spicy/`, `spicy-plugin/`, `CMake/`, and `zeek/zeek-testing-private`. Note that the analyzer events remain associated with the Spicy plugin for now: that's where they will show up with `-NN`, and also inside the Zeekygen documentation. We switch CMake over to linking the runtime library into the plugin, vs. at the top-level through object libraries.	2023-02-01 11:33:48 +01:00
Arne Welzel	a0aa00fa81	logging: Add event_groups to Stream This commit adds an optional event_groups field to the Logging::Stream record to associated event groups with logging streams. This can be used to disable all event groups of a logging stream when it is disabled. It does require making an explicit connection between the logging stream and the involved groups, however.	2022-12-09 16:59:36 +01:00
Tim Wojtulewicz	d442ea1bb9	egrep reported as obsolete by opensuse-tumbleweed builds	2022-10-27 11:48:43 -07:00
Arne Welzel	654fd9c7da	Remove @load base/frameworks/dpd from tests Now that it's loaded in bare mode, no need to load it explicitly. The main thing that tests were relying on seems to be tracking of c$service for conn.log baselines. Very few were actually checking for dpd.log	2022-08-31 17:00:55 +02:00
Christian Kreibich	8d10cbfb36	Fix requirement check in a logging framework / sqlite btest	2022-07-13 17:20:03 -07:00
Arne Welzel	a2bcb1bf28	sqlite default-logdir test: Remove ls ./logs baseline Observed .sqlite-journal files and missing reporter.sqlite files in CI runs. Subsequently reading the ./test.sqlite file is more reliable and should be good enough.	2022-07-06 22:57:14 +02:00
Arne Welzel	93584c7c7f	logging/sqlite: Recognize Log::default_logdir and place files there if set	2022-07-06 18:54:29 +02:00
Arne Welzel	aaa47a709c	logging: Introduce Log::default_logdir deprecate LogAscii::logdir and per writer logdir Also modify FormatRotationPath to keep rotated logs within Log::default_logdir unless the rotation function explicitly set dir, e.g. by when the user redef'ed default_rotation_interval.	2022-07-06 18:54:29 +02:00
Arne Welzel	513ea7e04f	logging/ascii: Fix .shadow paths when using LogAscii::logdir With the introduction of LogAscii::logdir, log filenames can now include parent directories rather than being plain basenames. Enabling log rotation, leftover log rotation and setting LogAscii::logdir broke due to not handling this situation. This change ensures that .shadow files are placed within the directory where the respective .log file is created. Previously, the .shadow. (or .tmp.shadow.) prefix was simply prepended, yielding non-sensical paths such as .tmp.shadow.foo/bar/packet_filter.log for a logdir of foo/bar. Additionally, respect LogAscii::logdir when searching for leftover log files rather than defaulting to the current working directory. The following quirk exist around LogAscii::logdir, but will be addressed in a follow-up. * By default, logs are currently rotated into the working directory of the process, rather than staying confined within LogAscii::logdir. One of the added tests shows this behavior.	2022-07-06 13:21:21 +02:00
Benjamin Bannier	95aff9a1e3	Include spicy in build.	2022-05-16 09:07:11 +02:00
Christian Kreibich	1aaed1cc2e	Add LogAscii::json_include_unset_fields flag to control unset field rendering The flag controls whether JSON rendering includes unset &optional log fields (F, the default), or includes them with a null value (T).	2021-12-08 17:29:07 -08:00
Tim Wojtulewicz	0a0ed65306	Merge remote-tracking branch 'origin/topic/robin/gh-54-sanitize' * origin/topic/robin/gh-54-sanitize: Sanitize log files names before they go into system().	2021-09-22 12:17:05 -07:00
Seth Hall	a4ceb98bf8	Switch the TSV Zeek logs to be UTF8 by default. There is a paired zeek-testing branch for some updates there.	2021-09-07 09:16:53 -07:00
Tim Wojtulewicz	0369ca01bc	Disable the scripts.base.frameworks.logging.sqlite.simultaneous-writes test under TSan Due to a bug (or intentional code) in SQLite, we disabled enabling the shared cache in sqlite3 if running under ThreadSanitizer (see cf1fefbe0b0a6163b389cc92b5a6878c7fc95f1f). Unfortunately, this has the side-effect of breaking the simultaneous-writes test because the shared cache is disabled. This is hopefully a temporary fix until SQLite fixes the issue on their side.	2021-09-03 10:38:15 -07:00
Christian Kreibich	795a7ea98e	Add a global log policy hook to the logging framework This addresses the need for a central hook on any log write, which wasn't previously doable without a lot of effort. The log manager invokes the new Log::log_stream_policy hook prior to any filter-specific hooks. Like filter-level hooks, it may veto a log write. Even when it does, filter-level hooks still get invoked, but cannot "un-veto". Includes test cases.	2021-07-02 12:42:45 -07:00
Christian Kreibich	0b55c55140	Remove unnecessary -B arguments from Zeek invocations in testsuite Now that Zeek no longer silently accepts -B when not compiled in debug mode, these tests were failing.	2021-06-24 17:05:32 -07:00
Johanna Amann	e0d284ec9f	Merge branch 'logging/script-logdir' of https://github.com/kramse/zeek * 'logging/script-logdir' of https://github.com/kramse/zeek: Copy of ascii-empty test, just changed path in the beginning Logdir: Change requested by 0xxon, no problem Introduce script-land variable that can be used to set logdir. Closes GH-772	2021-06-10 12:19:15 +01:00

1 2 3 4

172 commits