Mirror/zeek - git.uphillsecurity.com: We code.

mirror of https://github.com/zeek/zeek.git synced 2025-10-02 06:38:20 +00:00

Author	SHA1	Message	Date
Arne Welzel	2dbb467ba2	logging: Implement get_delay_queue_size() Primarily for introspection given that re-delaying may exceed queue sizes.	2023-11-29 11:53:11 +01:00
Arne Welzel	f0e67022fd	logging: Introduce Log::delay() and Log::delay_finish() This is a verbose, opinionated and fairly restrictive version of the log delay idea. Main drivers are explicitly, foot-gun-avoidance and implementation simplicity. Calling the new Log::delay() function is only allowed within the execution of a Log::log_stream_policy() hook for the currently active log write. Conceptually, the delay is placed between the execution of the global stream policy hook and the individual filter policy hooks. A post delay callback can be registered with every Log::delay() invocation. Post delay callbacks can (1) modify a log record as they see fit, (2) veto the forwarding of the log record to the log filters and (3) extend the delay duration by calling Log::delay() again. The last point allows to delay a record by an indefinite amount of time, rather than a fixed maximum amount. This should be rare and is therefore explicit. Log::delay() increases an internal reference count and returns an opaque token value to be passed to Log::delay_finish() to release a delay reference. Once all references are released, the record is forwarded to all filters attached to a stream when the delay completes. This functionality separates Log::log_stream_policy() and individual filter policy hooks. One consequence is that a common use-case of filter policy hooks, removing unproductive log records, may run after a record was delayed. Users can lift their filtering logic to the stream level (or replicate the condition before the delay decision). The main motivation here is that deciding on a stream-level delay in per-filter hooks is too late. Attaching multiple filters to a stream can additionally result in hard to understand behavior. On the flip side, filter policy hooks are guaranteed to run after the delay and can be used for further mangling or filtering of a delayed record.	2023-11-29 11:53:11 +01:00
Arne Welzel	37113b4de6	frameworks/software: Fix stale value used for stripping There was some confusion around which value was used subsequent to a strip(), but sub not respecting anchors make it appear to work. Also seems that the `\(?` part seems redundant.	2023-11-17 14:37:28 +01:00
Arne Welzel	cd24acdfc8	time machine: Mark leftovers for removal in v7.1 I suspect we could just drop these directly, but lets follow the deprecation cycle.	2023-11-07 16:06:16 +01:00
Arne Welzel	d88b147ac9	cluster: Deprecate the Cluster::Node$interface field This field isn't required by a worker and it's certainly not used by a worker to listen on that specific interface. It also isn't required to be set consistently and its use in-tree limited to the old load-balancing script. There's a bif called packet_source() which on a worker will provide information about the actually used packet source. Relates to zeek/zeek#2877.	2023-11-07 16:06:16 +01:00
Arne Welzel	54a08a74da	base/frameworks/spicy: Do not load base/misc/version Unsure what it's used for today and also results in the situation that on some platforms we generate a reporter.log in bare mode, while on others where spicy is disabled, we do not. If we want base/frameworks/version loaded by default, should put it into init-bare.zeek and possibly remove the loading of the reporter framework from it - Reporter::error() would still work and be visible on stderr, just not create a reporter.log.	2023-10-24 13:15:21 +02:00
Arne Welzel	af1714853f	http: Prevent request/response de-synchronization and unbounded state growth When http_reply events are received before http_request events, either through faking traffic or possible re-ordering, it is possible to trigger unbounded state growth due to later http_requests never being matched again with responses. Prevent this by synchronizing request/response counters when late requests come in. Also forcefully flush pending requests when http_replies are never observed either due to the analyzer having been disabled or because half-duplex traffic. Fixes #1705	2023-08-28 15:02:58 +02:00
Tim Wojtulewicz	819b79e121	Merge remote-tracking branch 'origin/topic/vern/dyn-sig-actions' * origin/topic/vern/dyn-sig-actions: allow signature actions to be dynamically updated	2023-07-17 16:35:15 -07:00
Vern Paxson	781cc0dcf0	allow signature actions to be dynamically updated	2023-07-13 17:25:32 -07:00
Tim Wojtulewicz	f9904511ab	Merge remote-tracking branch 'origin/topic/awelzel/3145-dcerpc-state-clean' * origin/topic/awelzel/3145-dcerpc-state-clean: dce-rpc: Test cases for unbounded state growth dce-rpc: Handle smb2_close_request() in scripts smb/dce-rpc: Cleanup DCE-RPC analyzers when fid is closed and limit them dce-rpc: Do not repeatedly register removal hooks	2023-07-11 16:17:12 -07:00
Arne Welzel	0d6174a5d6	Remove icmp_conn leftovers Roughly 2.5 years ago all events taking the ``icmp_conn`` parameter were removed with `44ad614094` and the NetVar.cc type not populated anymore. Remove the left-overs in script land, too.	2023-07-04 17:57:20 +02:00
Arne Welzel	6517ed94f2	smb/dce-rpc: Cleanup DCE-RPC analyzers when fid is closed and limit them This patch does two things: 1) For SMB close requests, tear down any associated DCE-RPC analyzer if one exists. 2) Protect from fid_to_analyzer_map growing unbounded by introducing a new SMB::max_dce_rpc_analyzers limit and forcefully wipe the analyzers if exceeded. Propagate this to script land as event smb_discarded_dce_rpc_analyzers() for additional cleanup. This is mostly to fix how the binpac SMB analyzer tracks individual DCE-RPC analyzers per open fid. Connections that re-open the same or different pipe may currently allocate unbounded number of analyzers. Closes #3145.	2023-06-30 15:14:32 +02:00
Arne Welzel	0b317aced3	telemetry: Disable metrics centralization by default Move the telemetry/cluster.zeek file over into policy/frameworks/telemetry/prometheus.zeek. Mention it in local.zeek. Relates to zeek/broker#366.	2023-06-21 20:13:55 +02:00
Tim Wojtulewicz	0d25583049	Remove Supervisor::NodeConfig (6.1 deprecation)	2023-06-14 10:07:22 -07:00
Tim Wojtulewicz	531276cfe0	Remove LogAscii::logdir (6.1 deprecation)	2023-06-14 10:07:22 -07:00
Tim Wojtulewicz	a55e5e3724	Remove full scripts marked as 6.1 deprecations	2023-06-14 10:07:22 -07:00
Tim Wojtulewicz	7a867d52e2	Remove script functions marked as unused (6.1 deprecations)	2023-06-14 10:07:22 -07:00
Tim Wojtulewicz	4229af6820	Remove deprecations tagged for v6.1	2023-06-14 10:07:22 -07:00
Arne Welzel	7a043e5e8f	all: Fix typos identified by typos pre-commit hook	2023-06-13 17:57:32 +02:00
Arne Welzel	f53aefdd5b	Merge branch 'topic/awelzel/3112-log-suffix-left-over-log-rotation' * topic/awelzel/3112-log-suffix-left-over-log-rotation: cluster/logger: Fix leftover-log-rotation in multi-logger setups cluster/logger: Fix global var reference	2023-06-13 17:33:56 +02:00
Arne Welzel	6d1991fb6a	cluster/logger: Fix leftover-log-rotation in multi-logger setups Populating log_metadata during zeek_init() is too late for the leftover-log-rotation functionality, so do it at script parse time. Also, prepend archiver_ to the log_metadata table and encoding function due to being in the global namespace and to align with the archiver_rotation_format_func. This hasn't been in a released version yet, so fine to rename still. Closes #3112	2023-06-13 10:47:20 +02:00
Arne Welzel	27432c457c	cluster/logger: Fix global var reference	2023-06-13 10:47:20 +02:00
Arne Welzel	eef7acc1e9	cluster/main: Remove extra @if ( Cluster::is_enabled() ) These have been discussed in the context of "@if &analyze" [1] and am much in favor for not disabling/removing ~100 lines (more than fits on a single terminal) out from the middle of a file. There's no performance impact for having these handlers enabled unconditionally. Also, any future work on "@if &analyze" will look at them again which we could also skip. This also reverts back to the behavior where the Cluster::LOG stream is created even in non cluster setups like in previous Zeek versions. As long as no one writes to it there's essentially no difference. If someone does write to Cluster::LOG, I'd argue not black holing these messages is better. Schema generators using Log::active_streams will continue to discover Cluster::LOG even if they run in non-cluster mode. https://github.com/zeek/zeek/pull/3062#discussion_r1200498905	2023-06-06 15:20:10 +02:00
Tim Wojtulewicz	5a3abbe364	Revert "Merge remote-tracking branch 'origin/topic/vern/at-if-analyze'" This reverts commit `4e797ddbbc`, reversing changes made to `3ac28ba5a2`.	2023-05-31 09:20:33 +02:00
Vern Paxson	890010915a	change base scripts to use run-time if's or @if ... &analyze	2023-05-19 13:26:27 -07:00
Arne Welzel	d4c99e7c3f	files: Warn once for missing get_file_handle() Repeating the message for every new call to get_file_handle() is not very useful. It's pretty much an analyzer configuration issue so logging it once should be enough.	2023-05-19 09:37:51 -07:00
Arne Welzel	c2a07476cc	Merge remote-tracking branch 'jgras/topic/jgras/cluster-active-node-count-fix' * jgras/topic/jgras/cluster-active-node-count-fix: Fix get_active_node_count for node types not present. Changed over to explicit existence check instead to avoid the set() creation upon missed lookups.	2023-05-17 10:37:00 +02:00
Jan Grashoefer	e4f654c14c	Fix get_active_node_count for node types not present.	2023-05-16 17:47:50 +02:00
Robin Sommer	ecf00295c2	Move `spicy/misc` scripts to policy and clarify purpose.	2023-05-16 10:21:21 +02:00
Robin Sommer	0040111955	Integrate the Spicy plugin into Zeek proper. This reflects the `spicy-plugin` code as of `d8c296b81cc2a11`. In addition to moving the code into Zeek's source tree, this comes with a couple small functional changes: - `spicyz` no longer tries to infer if it's running from the build directory. Instead `ZEEK_SPICY_LIBRARY` can be set to a custom location. `zeek-set-path.sh` does that now. - ZEEK_CONFIG can be set to change what `spicyz -z` print out. This is primarily for backwards compatibility. Some further notes on specifics: - We raise the minimum Spicy version to 1.8 (i.e., current `main` branch). - Renamed the `compiler/` subdirectory to `spicyz` to avoid include-path conflicts with the Spicy headers. - In `cmake/`, the corresponding PR brings a new/extended version of `FindZeek`, which Spicy analyzer packages need. We also now install some of the files that the Spicy plugin used to bring for testing, so that existing packages keep working. - For now, this all remains backwards compatible with the current `zkg` analyzer templates so that they work with both external and integrated Spicy support. Later, once we don't need to support any external Spicy plugin versions anymore, we can clean up the templates as well. - All the plugin's tests have moved into the standard test suite. They are skipped if configure with `--disable-spicy`. This holds off on adapting the new code further to Zeek's coding conventions, so that it remains easier to maintain it in parallel to the (now legacy) external plugin. We'll make a pass over the formatting for (presumable) Zeek 6.1.	2023-05-16 10:17:45 +02:00
Arne Welzel	9330a74fe1	Merge remote-tracking branch 'origin/topic/awelzel/zeek-archiver-multiple-loggers' * origin/topic/awelzel/zeek-archiver-multiple-loggers: cluster/supervisor: Multi-logger awareness Bump zeek-archiver submodule	2023-05-09 15:20:53 +02:00
Arne Welzel	c813872915	cluster/supervisor: Multi-logger awareness When multiple loggers are configured in a Supervisor controlled cluster configuration, encode extra information into the rotated filename to identify which logger produced the log. This is similar to the approach taken for ZeekControl, re-using the log_suffix terminology, but as there's only a single zeek-archiver process and no postprocessors and no other side-channel for additional information, we encode extra metadata into the filename. zeek-archiver is extended to recognize the special metadata part of the filename. This also solves the issue that multiple loggers in a supervisor setup overwrite each others log files within a single log-queue directory.	2023-05-05 12:27:25 +02:00
Arne Welzel	3ac877e20d	scripts/smb2-main: Reset script-level state upon smb2_discarded_messages_state() This is similar to what the external corelight/zeek-smb-clear-state script does, but leverages the smb2_discarded_messages_state() event instead of regularly checking on the state of SMB connections. The pcap was created using the dperson/samba container image and mounting a share with Linux's CIFS filesystem, then copying the content of a directory with 100 files. The test uses a BPF filter to imitate mostly "half-duplex" traffic.	2023-05-03 11:22:01 +02:00
Arne Welzel	004dce2cf2	Merge remote-tracking branch 'origin/topic/awelzel/zeekctl-multiple-loggers' * origin/topic/awelzel/zeekctl-multiple-loggers: NEWS: Add entry for ZeekControl and multi-loggers Bump zeekctl to multi-logger version logging: Support rotation_postprocessor_command_env	2023-04-27 12:17:02 +02:00
Jan Grashoefer	88c86cc7d4	Add hook into cluster connection setup.	2023-04-21 19:04:52 +02:00
Jan Grashoefer	c7626d797f	Add broadcast_topics set. This set contains the topics to reach all cluster nodes. Due to broker's forwarding mechanism, we cannot define a single broadcast topic, as it would create routing loops.	2023-04-21 19:04:52 +02:00
Jan Grashoefer	3db8bb4a44	Generalize Cluster::worker_count.	2023-04-21 19:04:39 +02:00
Arne Welzel	d89f16dfc9	logging: Support rotation_postprocessor_command_env This new table provides a mechanism to add environment variables to the postprocessor execution. Use case is from ZeekControl to inject a suffix to be used when running with multiple logger.	2023-04-17 13:10:14 +00:00
Arne Welzel	a5e7faf564	logging/Manager: Fix crash for rotation format function not returning While working on a rotation format function, ran into Zeek crashing when not returning a value from it, fix and recover the same way as for scripting errors.	2023-04-13 09:23:51 +02:00
Arne Welzel	b8dc6ad120	smtp: Validate mail transaction and disable SMTP analyzer if excessive An invalid mail transaction is determined as * RCPT TO command without a preceding MAIL FROM * a DATA command without a preceding RCPT TO and logged as a weird. The testing pcap for invalid mail transactions was produced with a Python script against a local exim4 configured to accept more errors and unknown commands than 3 by default: # exim4.conf.template smtp_max_synprot_errors = 100 smtp_max_unknown_commands = 100 See also: https://www.rfc-editor.org/rfc/rfc5321#section-3.3	2023-03-27 18:41:47 +02:00
Jan Grashoefer	1882307cf3	Add pcap_file option to supervised nodes. This allows to start Supervised nodes with a pcap_file argument rather than interface. This is based on changes from @J-Gras.	2023-03-21 16:18:02 +01:00
Christian Kreibich	19829765d4	Provide a mechanism to suppress logging of internal config framework activity	2023-03-15 17:01:00 -07:00
Arne Welzel	f56785740c	ftp: Limit user, password, arg and reply_msg column sizes in log The user and password fields are replicated to each of the ftp.log entries. Using a very large username (100s of KBs) allows to bloat the log without actually sending much traffic. Further, limit the arg and reply_msg columns to large, but not unbounded values.	2023-02-21 12:28:07 -07:00
Arne Welzel	e4ab7b2d70	files/main: No empty file_ids When an analyzer calls DataIn(), there's a costly callback construct going through the event queue. If an analyzer does not have a get_file_handle() handler installed, the produced file_id would end up empty and ignored. Consequently, the get_file_handle() callback was invoked for every new DataIn() invocations. This is surprising and costly. Log a warning when this happens and instead set a generically generated file handle value instead to prevent the repeated get_file_handle() invocations.	2023-02-06 18:08:05 +01:00
Arne Welzel	f35cf228dc	broker/store: Extend SQLiteOptions around data safety and performance Add configurability of synchronous and journal_mode for SQLite backed Broker data stores. Setting these to synchronous=normal and journal_mode=wal can significantly improve throughput at the cost of some durability in the presence of power loss or OS crash. In the context of Zeek, this is likely more than acceptable. Additionally, add integrity_check and failure_mode options to support deleting and re-opening a corrupted SQLite database at store creation. Closes #2698	2023-01-30 10:25:37 +01:00
Arne Welzel	8be8c22b3e	smb1: Prevent accessing uninitialized referenced_tree The added pcap was created from an OSS Fuzz test case and is borderline valid SMB traffic, but it triggered a scripting error. Closes #2726	2023-01-27 19:22:13 +01:00
Christian Kreibich	12885c7475	Fix a docstring typo	2023-01-10 18:49:19 -08:00
Arne Welzel	2d852209b0	Merge remote-tracking branch 'origin/topic/awelzel/analyzer-log' * origin/topic/awelzel/analyzer-log: btest/net-control: Use different expiration times for rules analyzer: Add analyzer.log for logging violations/confirmations	2023-01-10 10:22:58 +01:00
Arne Welzel	17d0ade26a	analyzer: Add analyzer.log for logging violations/confirmations By default this only logs all the violations, regardless of the confirmation state (for which there's still dpd.log). It includes packet, protocol and file analyzers. This uses options, change handlers and event groups for toggling the functionality at runtime. Closes #2031	2023-01-09 18:11:49 +01:00
Arne Welzel	4e75d54d49	scripts/analyzer: Introduce Analyzer::requested_analyzers In certain deployment scenarios, all analyzers are disabled by default. However, conditionally/optionally loaded scripts may rely on analyzers functioning and declare a request for them. Add a global set set to the Analyzer module where external scripts can record their requirement/request for a certain analyzer. Analyzers found in this set are enabled at zeek_init() time.	2022-12-13 14:28:16 +01:00

1 2 3 4 5 ...

1208 commits