Mirror/zeek - git.uphillsecurity.com: We code.

mirror of https://github.com/zeek/zeek.git synced 2025-10-02 06:38:20 +00:00

Author	SHA1	Message	Date
Tim Wojtulewicz	c8818d76bd	Remove length limiting on string fields for HTTP	2025-08-12 17:31:29 -07:00
Tim Wojtulewicz	0ec2161b04	Add options to filter at the stream level as well as globally	2025-08-12 17:31:28 -07:00
Tim Wojtulewicz	d95affde4d	Remove deprecations tagged for v8.1	2025-08-12 10:19:03 -07:00
Johanna Amann	8de178d923	Move c$service_violation to deprecated-dpd-log.zeek This moves c$service_violation to the deprecated-dpd-log policy script. This is the only script in the distribution that uses the field, and it is unlikely to be used externally. It is also responsible for a significant amount of memory use by itself. This also restores the field being populated, which was broken in GH-4362	2025-07-30 07:58:36 +01:00
Johanna Amann	83d5243cf6	Optimize Conn::set_conn to minimize operations Now that Conn::set_conn is guaranteed to be called at the beginning and at the end of the connection, we can skip re-setting the elements that we know will not have changed. This prevents repeated lookups, e.g. to check that addresses are in the local networks. During `connection_state_remove`, only the duration, number of packets, service, and history fields are updated. local_orig and local_resp are updated when the connection is flipped. A test was added for that purpose. It uses the already existing http.zeek-image-post-1080-8000-x.pcap, which was slightly rewritten for this, so that one side of the connection has IP addresses different from 127.0.0.1. The existing history-flip test also was updated to have one side being in a local-net, to check that the flipping of local_orig and local_resp works correctly at the beginning of a connection.	2025-07-29 09:11:57 +01:00
Johanna Amann	3e4f67e67c	Move Conn::set_conn() from connection_state_remove to new_connection c$conn is often needed for connection events, but it being established in connection_state_removed can be a problem because event handlers have to remember to call Con::set_conn(). This commit moves to call Conn::set_conn() in new_connection. Addresses GH-4202 update logs	2025-07-29 09:11:57 +01:00
Johanna Amann	e5a434c392	PPPoE: add session id logging This adds a new PacketAnalyzer::PPPoE::session_id bif, which extracts the PPPoE session ID from the current packet. Furthermore, a new policy script is added which adds the pppoe session id to the connection log. Related to GH-4602	2025-07-23 13:43:45 +01:00
Arne Welzel	112e3c1c03	btest: Update tests and baselines after adding ctx to conn_id	2025-07-03 18:19:46 +02:00
Arne Welzel	4c34274a6c	cluster: Introduce telemetry component	2025-06-25 16:59:49 +02:00
Johanna Amann	58613f0313	Introduce new c$failed_analyzers field This field is used internally to trace which analyzers already had a violation. This is mostly used to prevent duplicate logging. In the past, c$service_violation was used for a similar purpose - however it has slightly different semantics. Where c$failed_analyzers tracks analyzers that were removed due to a violation, c$service_violation tracks violations - and doesn't care if an analyzer was actually removed due to it.	2025-06-04 12:07:13 +01:00
Johanna Amann	42ba2fcca0	Settle on analyzer.log for the dpd.log replacement This commit renames analyzer-failed.log to analyzer.log, and updates the respective news entry.	2025-06-03 17:33:36 +01:00
Johanna Amann	130c89a0a7	dpd->analyzer.log change - rename files To address review feedback in GH-4362: rename analyzer-failed-log.zeek to loggig.zeek, analyzer-debug-log.zeek to debug-logging.zeek and dpd-log.zeek to deprecated-dpd-log.zeek. Includes respective test, NEWS, etc updates.	2025-06-03 16:32:52 +01:00
Johanna Amann	af77a7a83b	Analyzer failure logging: tweaks and test fixes The main part of this commit are changes in tests. A lot of the tests that previously relied on analyzer.log or dpd.log now use the new analyzer-failed.log. I verified all the changes and, as far as I can tell, everything behaves as it should. This includes the external test baselines. This change also enables logging of file and packet analyzer to analyzer_failed.log and fixes some small behavior issues. The analyzer_failed event is no longer raised when the removal of an analyzer is vetoed. If an analyzer is no longer active when an analyzer violation is raised, currently the analyzer_failed event is raised. This can, e.g., happen when an analyzer error happens at the very end of the connection. This makes the behavior more similar to what happened in the past, and also intuitively seems to make sense. A bug introduced in the failed service logging was fixed.	2025-06-03 15:56:42 +01:00
Tim Wojtulewicz	58ee8d3c5c	Add Storage::is_connected BIF	2025-05-07 08:13:16 -07:00
Arne Welzel	0e027fa4e3	EventMgr: Add Dispatch() with handler and args Allow users to call event_mgr.Dispatch(handler, args) instead of constructing the Event instance themselves. Deprecate the old API and replace users. There's a subtle change that net_done() may be propagated via auto_publish() now, but that still needs opt-in from script land and likely no one did that, or else they'd expected to have it work anyhow.	2025-04-11 09:58:07 +02:00
Tim Wojtulewicz	43faea880b	Add analyzer registration from VLAN to VNTAG	2025-03-18 11:51:27 -07:00
Tim Wojtulewicz	c7015e8250	Split storage.bif file into events/sync/async, add more comments	2025-03-18 10:20:34 -07:00
Tim Wojtulewicz	2ea0f3e70a	Lay out initial parts for the Storage framework This includes a manager, component manager, BIF and script code, and parts to support new storage backend plugins.	2025-03-18 09:32:34 -07:00
Arne Welzel	6032741868	cluster/websocket: Implement WebSocket server	2025-03-10 17:07:30 +01:00
Arne Welzel	776c003033	PacketAnalyzer::Geneve: Add get_options() Allow to extract Geneve options on-demand, for example during a new_connection() event.	2025-02-22 12:19:42 -08:00
Johanna Amann	2f712c3c24	Allow to track service violations in conn.log. This introduces ian options, DPD::track_removed_services_in_connection. It adds failed services to the services column, prefixed with a "-". Alternatively, this commit also adds policy/protocols/conn/failed-services.zeek, which provides the same information in a new column in conn.log.	2025-01-30 16:59:44 +00:00
Johanna Amann	c72c1cba6f	DPD: change handling of pre-confirmation violations, remove max_violations This commit revamps the handling of analyzer violations that happen before an analyzer confirms the protocol. The current state is that an analyzer is disabled after 5 violations, if it has not been confirmed. If it has been confirmed, it is disabled after a single violation. The reason for this is a historic mistake. In Zeek up to versions 1.5, analyzers were unconditianally removed when they raised the first protocol violation. When this script was ported to the new layout for Zeek 2.0 in `b4b990cfb5`, a logic error was introduced that caused analyzers to no longer be disabled if they were not confirmed. This was the state for ~8 years, till the DPD::max_violations options was added, which instates the current approach of disabling unconfirmed analyzers after 5 violations. Sadly, there is not much discussion about this change - from my hazy memory, I think this was discovered during performance tests and the new behavior was added without checking into the history of previous changes. This commit reinstates the originally intended behavior of DPD. When an analyzer that has not been confirmed raises a protocol violation, it is immediately removed from the connection. This also makes a lot of sense - this allows the analyzer to be in a "tasting" phase at the beginning of the connection, and to error out quickly once it realizes that it was attached to a connection not containing the desired protocol. This change also removes the DPD::max_violations option, as it no longer serves any purpose after this change. (In practice, the option remains with an &deprecated warning, but it is no longer used for anything). There are relatively minimal test-baseline changes due to this; they are mostly triggered by the removal of the data structure and by less analyzer errors being thrown, as unconfirmed analyzers are disabled after the first error.	2025-01-30 16:59:44 +00:00
Tim Wojtulewicz	0fcbc8546e	Update btests for new local-only subnets	2025-01-09 22:16:42 -07:00
Arne Welzel	35c79ab2e3	cluster/backend/zeromq: Add ZeroMQ based cluster backend This is a cluster backend implementation using a central XPUB/XSUB proxy that by default runs on the manager node. Logging is implemented leveraging PUSH/PULL sockets between logger and other nodes, rather than going through XPUB/XSUB. The test-all-policy-cluster baseline changed: Previously, Broker::peer() would be called from setup-connections.zeek, causing the IO loop to be alive. With the ZeroMQ backend, the IO loop is only alive when Cluster::init() is called, but that doesn't happen anymore.	2024-12-10 20:33:02 +01:00
Christian Kreibich	1c42bfc715	Merge branch 'topic/christian/disconnect-slow-peers' * topic/christian/disconnect-slow-peers: Bump cluster testsuite to pull in Broker backpressure tests Expand documentation of Broker events. Add sleep() BiF. Add backpressure disconnect notification to cluster.log and via telemetry Remove unneeded @loads from base/misc/version.zeek Add Cluster::nodeid_to_node() helper function Support re-peering with Broker peers that fall behind Add Zeek-level configurability of Broker slow-peer disconnects Bump Broker to pull in disconnect feature and infinite-loop fix No need to namespace Cluster:: functions in their own namespace	2024-12-09 23:33:35 -08:00
Tim Wojtulewicz	ccefd66d37	Move python signatures to a separate file	2024-12-09 11:08:30 -07:00
Christian Kreibich	0010e65f6d	Support re-peering with Broker peers that fall behind This adds re-peering at the Broker level for peers that Broker decided to unpeer. We keep this at the Broker level since this behavior is specific to it (as opposed to other cluster backends). Includes baseline updates for btests that pick up on the new script's @load.	2024-12-06 15:18:05 -08:00
Arne Welzel	51836d08ae	protocol: Add StreamEvent analyzer This analyzer can be used to transport raw stream data for a given connection to the script layer. For example, adding this analyzer into the HTTP::upgrade_analyzer or using it to configure a child WebSocket analyzer allows to get access to the raw stream data in script land when no more appropriate protocol analyzer is available.	2024-12-06 16:12:40 +01:00
Arne Welzel	ef04a199c8	cluster: Add Cluster scoped bifs ... and a broker based test using Cluster::publish() and Cluster::subscribe().	2024-11-26 12:58:23 +01:00
Tim Wojtulewicz	5e5aceb6f7	Rename protocol_id field to ip_proto and similar renaming for name field	2024-11-13 12:02:00 -07:00
Tim Wojtulewicz	35ec9733c0	Add conn.log entries for connections with unhandled IP protocols	2024-11-13 11:25:40 -07:00
Christian Kreibich	71f7e89974	Telemetry framework: move BIFs to the primary-bif stage This moves the Telemetry framework's BIF-defined functionalit from the secondary-BIFs stage to the primary one. That is, this functionality is now available from the end of init-bare.zeek, not only after the end of init-frameworks-and-bifs.zeek. This allows us to use script-layer telemetry in our Zeek's own code that get pulled in during init-frameworks-and-bifs. This change splits up the BIF features into functions, constants, and types, because that's the granularity most workable in Func.cc and NetVar. It also now defines the Telemetry::MetricsType enum once, not redundantly in BIFs and script layer. Due to subtle load ordering issues between the telemetry and cluster frameworks this pushes the redef stage of Telemetry::metrics_port and address into base/frameworks/telemetry/options.zeek, which is loaded sufficiently late in init-frameworks-and-bifs.zeek to sidestep those issues. (When not doing this, the effect is that the redef in telemetry/main.zeek doesn't yet find the cluster-provided values, and Zeek does not end up listening on these ports.) The need to add basic Zeek headers in script_opt/ZAM/ZBody.cc as a side-effect of this is curious, but looks harmless. Also includes baseline updates for the usual btests and adds a few doc strings.	2024-10-18 09:56:29 -07:00
Vern Paxson	61258587bf	BTest baseline update for more complete function/lambda names	2024-09-27 14:16:10 -07:00
Arne Welzel	cf9fe91705	pop3: Prevent unbounded state growth The cmds list may grow unbounded due to the POP3 analyzer being in multiLine mode after seeing `AUTH` in a Redis connection, but never a `.` terminator. This can easily be provoked by the Redis ping command. This adds two heuristics: 1) Forcefully process the oldest commands in the cmds list and cap it at max_pending_commands. 2) Start raising analyzer violations if the client has been using more than max_unknown_client_commands commands (default 10). Closes #3936	2024-09-18 19:05:39 +02:00
Arne Welzel	a5d93c4dec	btest: Update baselines for removal-hooks addition The removal_hooks field exists in bare mode (seems fine) and moved within the connection record to earlier, so a bunch of baselines changed	2024-09-17 18:15:15 +02:00
Tim Wojtulewicz	7ac7ce1d2b	Process metric callbacks from the main-loop thread This avoids the callbacks from being processed on the worker thread spawned by Civetweb. It fixes data race issues with lookups involving global variables, amongst other threading issues.	2024-08-02 15:30:47 -07:00
Jan Grashoefer	c6c8d078c0	Extend btest for logging of disabled analyzers	2024-07-09 20:15:46 +02:00
Tim Wojtulewicz	a63ea5a04e	Btest updates due to recent changes	2024-05-31 13:30:31 -07:00
Vern Paxson	e84b60762a	added a space when rendering some expressions so they're more readable	2024-05-29 12:40:05 -07:00
Arne Welzel	efc2681152	WebSocket: Introduce new analyzer and log This adds a new WebSocket analyzer that is enabled with the HTTP upgrade mechanism introduced previously. It is a first implementation in BinPac with manual chunking of frame payload. Configuration of the analyzer is sketched via the new websocket_handshake() event and a configuration BiF called WebSocket::__configure_analyzer(). In short, script land collects WebSocket related HTTP headers and can forward these to the analyzer to change its parsing behavior at websocket_handshake() time. For now, however, there's no actual logic that would change behavior based on agreed upon extensions exchanged via HTTP headers (e.g. frame compression). WebSocket::Configure() simply attaches a PIA_TCP analyzer to the WebSocket analyzer for dynamic protocol detection (or a custom analyzer if set). The added pcaps show this in action for tunneled ssh, http and https using wstunnel. One test pcap is Broker's WebSocket traffic from our own test suite, the other is the Jupyter websocket traffic from the ticket/discussion. This commit further adds a basic websocket.log that aggregates the WebSocket specific headers (Sec-WebSocket-*) headers into a single log. Closes #3424	2024-01-22 18:54:38 +01:00
Arne Welzel	2a858d252e	MIME: Cap nested MIME analysis depth to 100 OSS-Fuzz managed to produce a MIME multipart message construction with thousands of nested entities (or that's what Zeek makes out of it anyhow). Prevent such deep analysis by capping at a nesting depth of 100, preventing unnecessary resource usage. A new weird named exceeded_mime_max_depth is reported when this limit is reached. This change reduces the runtime of the OSS-Fuzz reproducer from ~45 seconds to ~2.5 seconds. The test PCAP was produced from a Python script using the email package and sending the rendered version via POST to a HTTP server. Closes #208	2024-01-17 10:18:13 -07:00
Arne Welzel	14949941ce	SMTP: Add BDAT support Closes #3264	2024-01-12 10:18:07 +01:00
Christian Kreibich	4e45a3462b	Update btest baselines to reflect introduction of mmdb.bif	2024-01-10 20:28:41 -08:00
Arne Welzel	e3796894c6	logging: Do not keep delay state persistent If Log::remove_stream() and Log::create_stream() is called for a stream, do not restore the previously used max delay or max queue size.	2023-11-29 11:53:11 +01:00
Arne Welzel	f0e67022fd	logging: Introduce Log::delay() and Log::delay_finish() This is a verbose, opinionated and fairly restrictive version of the log delay idea. Main drivers are explicitly, foot-gun-avoidance and implementation simplicity. Calling the new Log::delay() function is only allowed within the execution of a Log::log_stream_policy() hook for the currently active log write. Conceptually, the delay is placed between the execution of the global stream policy hook and the individual filter policy hooks. A post delay callback can be registered with every Log::delay() invocation. Post delay callbacks can (1) modify a log record as they see fit, (2) veto the forwarding of the log record to the log filters and (3) extend the delay duration by calling Log::delay() again. The last point allows to delay a record by an indefinite amount of time, rather than a fixed maximum amount. This should be rare and is therefore explicit. Log::delay() increases an internal reference count and returns an opaque token value to be passed to Log::delay_finish() to release a delay reference. Once all references are released, the record is forwarded to all filters attached to a stream when the delay completes. This functionality separates Log::log_stream_policy() and individual filter policy hooks. One consequence is that a common use-case of filter policy hooks, removing unproductive log records, may run after a record was delayed. Users can lift their filtering logic to the stream level (or replicate the condition before the delay decision). The main motivation here is that deciding on a stream-level delay in per-filter hooks is too late. Attaching multiple filters to a stream can additionally result in hard to understand behavior. On the flip side, filter policy hooks are guaranteed to run after the delay and can be used for further mangling or filtering of a delayed record.	2023-11-29 11:53:11 +01:00
Vern Paxson	23c08a05de	descriptions of "for" statements now include their "value variable" if present	2023-11-10 09:56:51 +01:00
Arne Welzel	54a08a74da	base/frameworks/spicy: Do not load base/misc/version Unsure what it's used for today and also results in the situation that on some platforms we generate a reporter.log in bare mode, while on others where spicy is disabled, we do not. If we want base/frameworks/version loaded by default, should put it into init-bare.zeek and possibly remove the loading of the reporter framework from it - Reporter::error() would still work and be visible on stderr, just not create a reporter.log.	2023-10-24 13:15:21 +02:00
Tim Wojtulewicz	6d9d4523bc	Add registration for GRE-over-UDP	2023-10-16 11:42:24 -07:00
Arne Welzel	07ac6fa074	btest/plugins/hooks: Run in bare mode Motivation is basically the same as in `88bb527026`. For plugin.hooks, one example is that adding a new option in the default script changes the baseline due registration of change handlers. Also, the connection record is printed in various places, resulting in churn when the default scripts change.	2023-10-09 16:13:59 +02:00
Johanna Amann	e18edfa452	Add extract_limit_includes_missing option for file extraction Setting this option to false does not count missing bytes in files towards the extraction limits, and allows to extract data up to the desired limit, even when partial files are written. When missing bytes are encountered, files are now written as sparse files. Using this option requires the underlying storage and utilities to support sparse files.	2023-09-14 12:11:42 -07:00

1 2 3 4 5 ...

374 commits