Mirror/zeek - git.uphillsecurity.com: We code.

mirror of https://github.com/zeek/zeek.git synced 2025-10-03 07:08:19 +00:00

Author	SHA1	Message	Date
Tim Wojtulewicz	7ac7ce1d2b	Process metric callbacks from the main-loop thread This avoids the callbacks from being processed on the worker thread spawned by Civetweb. It fixes data race issues with lookups involving global variables, amongst other threading issues.	2024-08-02 15:30:47 -07:00
Jan Grashoefer	c6c8d078c0	Extend btest for logging of disabled analyzers	2024-07-09 20:15:46 +02:00
Tim Wojtulewicz	a63ea5a04e	Btest updates due to recent changes	2024-05-31 13:30:31 -07:00
Vern Paxson	e84b60762a	added a space when rendering some expressions so they're more readable	2024-05-29 12:40:05 -07:00
Arne Welzel	efc2681152	WebSocket: Introduce new analyzer and log This adds a new WebSocket analyzer that is enabled with the HTTP upgrade mechanism introduced previously. It is a first implementation in BinPac with manual chunking of frame payload. Configuration of the analyzer is sketched via the new websocket_handshake() event and a configuration BiF called WebSocket::__configure_analyzer(). In short, script land collects WebSocket related HTTP headers and can forward these to the analyzer to change its parsing behavior at websocket_handshake() time. For now, however, there's no actual logic that would change behavior based on agreed upon extensions exchanged via HTTP headers (e.g. frame compression). WebSocket::Configure() simply attaches a PIA_TCP analyzer to the WebSocket analyzer for dynamic protocol detection (or a custom analyzer if set). The added pcaps show this in action for tunneled ssh, http and https using wstunnel. One test pcap is Broker's WebSocket traffic from our own test suite, the other is the Jupyter websocket traffic from the ticket/discussion. This commit further adds a basic websocket.log that aggregates the WebSocket specific headers (Sec-WebSocket-*) headers into a single log. Closes #3424	2024-01-22 18:54:38 +01:00
Arne Welzel	2a858d252e	MIME: Cap nested MIME analysis depth to 100 OSS-Fuzz managed to produce a MIME multipart message construction with thousands of nested entities (or that's what Zeek makes out of it anyhow). Prevent such deep analysis by capping at a nesting depth of 100, preventing unnecessary resource usage. A new weird named exceeded_mime_max_depth is reported when this limit is reached. This change reduces the runtime of the OSS-Fuzz reproducer from ~45 seconds to ~2.5 seconds. The test PCAP was produced from a Python script using the email package and sending the rendered version via POST to a HTTP server. Closes #208	2024-01-17 10:18:13 -07:00
Arne Welzel	14949941ce	SMTP: Add BDAT support Closes #3264	2024-01-12 10:18:07 +01:00
Christian Kreibich	4e45a3462b	Update btest baselines to reflect introduction of mmdb.bif	2024-01-10 20:28:41 -08:00
Arne Welzel	e3796894c6	logging: Do not keep delay state persistent If Log::remove_stream() and Log::create_stream() is called for a stream, do not restore the previously used max delay or max queue size.	2023-11-29 11:53:11 +01:00
Arne Welzel	f0e67022fd	logging: Introduce Log::delay() and Log::delay_finish() This is a verbose, opinionated and fairly restrictive version of the log delay idea. Main drivers are explicitly, foot-gun-avoidance and implementation simplicity. Calling the new Log::delay() function is only allowed within the execution of a Log::log_stream_policy() hook for the currently active log write. Conceptually, the delay is placed between the execution of the global stream policy hook and the individual filter policy hooks. A post delay callback can be registered with every Log::delay() invocation. Post delay callbacks can (1) modify a log record as they see fit, (2) veto the forwarding of the log record to the log filters and (3) extend the delay duration by calling Log::delay() again. The last point allows to delay a record by an indefinite amount of time, rather than a fixed maximum amount. This should be rare and is therefore explicit. Log::delay() increases an internal reference count and returns an opaque token value to be passed to Log::delay_finish() to release a delay reference. Once all references are released, the record is forwarded to all filters attached to a stream when the delay completes. This functionality separates Log::log_stream_policy() and individual filter policy hooks. One consequence is that a common use-case of filter policy hooks, removing unproductive log records, may run after a record was delayed. Users can lift their filtering logic to the stream level (or replicate the condition before the delay decision). The main motivation here is that deciding on a stream-level delay in per-filter hooks is too late. Attaching multiple filters to a stream can additionally result in hard to understand behavior. On the flip side, filter policy hooks are guaranteed to run after the delay and can be used for further mangling or filtering of a delayed record.	2023-11-29 11:53:11 +01:00
Vern Paxson	23c08a05de	descriptions of "for" statements now include their "value variable" if present	2023-11-10 09:56:51 +01:00
Arne Welzel	54a08a74da	base/frameworks/spicy: Do not load base/misc/version Unsure what it's used for today and also results in the situation that on some platforms we generate a reporter.log in bare mode, while on others where spicy is disabled, we do not. If we want base/frameworks/version loaded by default, should put it into init-bare.zeek and possibly remove the loading of the reporter framework from it - Reporter::error() would still work and be visible on stderr, just not create a reporter.log.	2023-10-24 13:15:21 +02:00
Tim Wojtulewicz	6d9d4523bc	Add registration for GRE-over-UDP	2023-10-16 11:42:24 -07:00
Arne Welzel	07ac6fa074	btest/plugins/hooks: Run in bare mode Motivation is basically the same as in `88bb527026`. For plugin.hooks, one example is that adding a new option in the default script changes the baseline due registration of change handlers. Also, the connection record is printed in various places, resulting in churn when the default scripts change.	2023-10-09 16:13:59 +02:00
Johanna Amann	e18edfa452	Add extract_limit_includes_missing option for file extraction Setting this option to false does not count missing bytes in files towards the extraction limits, and allows to extract data up to the desired limit, even when partial files are written. When missing bytes are encountered, files are now written as sparse files. Using this option requires the underlying storage and utilities to support sparse files.	2023-09-14 12:11:42 -07:00
Tim Wojtulewicz	5934e143aa	Revert "Add extract_limit_includes_missing option for file extraction" This reverts commit `f4d0fdcd5c`.	2023-09-14 12:10:40 -07:00
Johanna Amann	f4d0fdcd5c	Add extract_limit_includes_missing option for file extraction Setting this option to false does not count missing bytes in files towards the extraction limits, and allows to extract data up to the desired limit, even when partial files are written. When missing bytes are encountered, files are now written as sparse files. Using this option requires the underlying storage and utilities to support sparse files. (cherry picked from commit afa6f3a0d3b8db1ec5b5e82d26225504c2891089)	2023-09-12 12:00:36 -07:00
Arne Welzel	cea7c0ab46	ID/Stmt: Introduce INIT_SKIP and use in ForStmt Currently, loop vars are added to a function scope's inits and initialized upon entering a function with default values. This applies to vector, record and table types. This is unnecessary for variables used in for loops as they are guaranteed to be initialized while iterating.	2023-09-08 13:05:44 +02:00
Arne Welzel	af1714853f	http: Prevent request/response de-synchronization and unbounded state growth When http_reply events are received before http_request events, either through faking traffic or possible re-ordering, it is possible to trigger unbounded state growth due to later http_requests never being matched again with responses. Prevent this by synchronizing request/response counters when late requests come in. Also forcefully flush pending requests when http_replies are never observed either due to the analyzer having been disabled or because half-duplex traffic. Fixes #1705	2023-08-28 15:02:58 +02:00
Arne Welzel	ee12a7a6e7	PPP: Add PPP analyzer to handle LINKTYPE_PPP (0x9) Using pcaps from https://interop.seemann.io/ as samples for QUIC protocol data didn't produce a conn.log for the contained data. `tcpdump -r` and Wireshark do show the contained IP/UDP packets. Teach Zeek how to handle link type DLT_PPP 0x09 using a new PPP analyzer based on the PPPSerial analyzer code. Usual update to files/x509 baseline after adding new analyzer due to enum values changing.	2023-08-23 16:41:19 +02:00
Tim Wojtulewicz	5b74e717bc	Fix plugin.hooks test for opaque-printing change	2023-07-20 10:43:36 -07:00
Tim Wojtulewicz	4229af6820	Remove deprecations tagged for v6.1	2023-06-14 10:07:22 -07:00
Arne Welzel	eef7acc1e9	cluster/main: Remove extra @if ( Cluster::is_enabled() ) These have been discussed in the context of "@if &analyze" [1] and am much in favor for not disabling/removing ~100 lines (more than fits on a single terminal) out from the middle of a file. There's no performance impact for having these handlers enabled unconditionally. Also, any future work on "@if &analyze" will look at them again which we could also skip. This also reverts back to the behavior where the Cluster::LOG stream is created even in non cluster setups like in previous Zeek versions. As long as no one writes to it there's essentially no difference. If someone does write to Cluster::LOG, I'd argue not black holing these messages is better. Schema generators using Log::active_streams will continue to discover Cluster::LOG even if they run in non-cluster mode. https://github.com/zeek/zeek/pull/3062#discussion_r1200498905	2023-06-06 15:20:10 +02:00
Robin Sommer	a62e153dd3	Do not load Spicy scripts if Spicy is not available.	2023-05-16 10:21:21 +02:00
Robin Sommer	0040111955	Integrate the Spicy plugin into Zeek proper. This reflects the `spicy-plugin` code as of `d8c296b81cc2a11`. In addition to moving the code into Zeek's source tree, this comes with a couple small functional changes: - `spicyz` no longer tries to infer if it's running from the build directory. Instead `ZEEK_SPICY_LIBRARY` can be set to a custom location. `zeek-set-path.sh` does that now. - ZEEK_CONFIG can be set to change what `spicyz -z` print out. This is primarily for backwards compatibility. Some further notes on specifics: - We raise the minimum Spicy version to 1.8 (i.e., current `main` branch). - Renamed the `compiler/` subdirectory to `spicyz` to avoid include-path conflicts with the Spicy headers. - In `cmake/`, the corresponding PR brings a new/extended version of `FindZeek`, which Spicy analyzer packages need. We also now install some of the files that the Spicy plugin used to bring for testing, so that existing packages keep working. - For now, this all remains backwards compatible with the current `zkg` analyzer templates so that they work with both external and integrated Spicy support. Later, once we don't need to support any external Spicy plugin versions anymore, we can clean up the templates as well. - All the plugin's tests have moved into the standard test suite. They are skipped if configure with `--disable-spicy`. This holds off on adapting the new code further to Zeek's coding conventions, so that it remains easier to maintain it in parallel to the (now legacy) external plugin. We'll make a pass over the formatting for (presumable) Zeek 6.1.	2023-05-16 10:17:45 +02:00
Arne Welzel	3ac877e20d	scripts/smb2-main: Reset script-level state upon smb2_discarded_messages_state() This is similar to what the external corelight/zeek-smb-clear-state script does, but leverages the smb2_discarded_messages_state() event instead of regularly checking on the state of SMB connections. The pcap was created using the dperson/samba container image and mounting a share with Linux's CIFS filesystem, then copying the content of a directory with 100 files. The test uses a BPF filter to imitate mostly "half-duplex" traffic.	2023-05-03 11:22:01 +02:00
Arne Welzel	004dce2cf2	Merge remote-tracking branch 'origin/topic/awelzel/zeekctl-multiple-loggers' * origin/topic/awelzel/zeekctl-multiple-loggers: NEWS: Add entry for ZeekControl and multi-loggers Bump zeekctl to multi-logger version logging: Support rotation_postprocessor_command_env	2023-04-27 12:17:02 +02:00
Tim Wojtulewicz	7aa7909c94	Add forwarding from VLAN analyzer into LLC, SNAP, and Novell 802.3 analyzers	2023-04-25 12:29:55 -07:00
Tim Wojtulewicz	c5b8603218	Remove non-standard way of forwarding out of the Ethernet analyzer	2023-04-25 12:29:55 -07:00
Tim Wojtulewicz	7e88a2b3fb	Add basic LLC, SNAP, and Novell 802.3 packet analyzers	2023-04-25 12:29:54 -07:00
Tim Wojtulewicz	f62f8e5cc9	Remove workaround for tunnels from IEEE 802.11 analyzer	2023-04-25 09:28:20 -07:00
Tim Wojtulewicz	5b1c6216bd	Fix IEEE 802.11 analyzer to properly forward tunneled packets This mostly happens with Aruba, but could possibly happen with other tunnels too.	2023-04-25 09:28:20 -07:00
Arne Welzel	ffb73e4de9	Merge remote-tracking branch 'origin/topic/awelzel/add-community-id' * origin/topic/awelzel/add-community-id: testing/external: Bump hashes for community_id addition NEWS: Add entry for Community ID policy: Import zeek-community-id scripts into protocols/conn frameworks/notice Add community_id_v1() based on corelight/zeek-community-id	2023-04-24 10:12:56 +02:00
Christian Kreibich	99de7b7526	Add community_id_v1() based on corelight/zeek-community-id "Community ID" has become an established flow hash for connection correlation across different monitoring and storage systems. Other NSMs have had native and built-in support for Community ID since late 2018. And even though the roots of "Community ID" are very close to Zeek, Zeek itself has never provided out-of-the-box support and instead required users to install an external plugin. While we try to make that installation as easy as possible, an external plugin always sets the bar higher for an initial setup and can be intimidating. It also requires a rebuild operation of the plugin during upgrades. Nothing overly complicated, but somewhat unnecessary for such popular functionality. This isn't a 1:1 import. The options are parameters and the "verbose" functionality has been removed. Further, instead of a `connection` record, the new bif works with `conn_id`, allowing computation of the hash with little effort on the command line: $ zeek -e 'print community_id_v1([$orig_h=1.2.3.4, $orig_p=1024/tcp, $resp_h=5.6.7.8, $resp_p=80/tcp])' 1:RcCrCS5fwYUeIzgDDx64EN3+okU Reference: https://github.com/corelight/zeek-community-id/	2023-04-21 20:44:09 +02:00
Jan Grashoefer	3db8bb4a44	Generalize Cluster::worker_count.	2023-04-21 19:04:39 +02:00
Arne Welzel	89c828ac14	Merge remote-tracking branch 'origin/topic/vern/record-optimizations.Apr23B' * origin/topic/vern/record-optimizations.Apr23B: different fix for MSVC compiler issues more general approach for addressing MSVC compiler issues with IntrusivePtr restored RecordType::Create, now marked as deprecated tidying of namespaces and private class members simplification of flagging record field initializations that should be skipped address peculiar MSVC compilation complaint for IntrusivePtr's clarifications and tidying for record field initializations optimize record construction by deferring initializations of aggregates compile-scripts-to-C++ speedups by switching to raw record access logging speedup by switching to raw record access remove redundant record coercions Removed the `#if 0` hunk during merging: Probably could have gone with a doctest instead.	2023-04-19 11:59:56 +02:00
Arne Welzel	d89f16dfc9	logging: Support rotation_postprocessor_command_env This new table provides a mechanism to add environment variables to the postprocessor execution. Use case is from ZeekControl to inject a suffix to be used when running with multiple logger.	2023-04-17 13:10:14 +00:00
Vern Paxson	f866252e5e	remove redundant record coercions	2023-04-10 11:42:48 -07:00
Arne Welzel	b8dc6ad120	smtp: Validate mail transaction and disable SMTP analyzer if excessive An invalid mail transaction is determined as * RCPT TO command without a preceding MAIL FROM * a DATA command without a preceding RCPT TO and logged as a weird. The testing pcap for invalid mail transactions was produced with a Python script against a local exim4 configured to accept more errors and unknown commands than 3 by default: # exim4.conf.template smtp_max_synprot_errors = 100 smtp_max_unknown_commands = 100 See also: https://www.rfc-editor.org/rfc/rfc5321#section-3.3	2023-03-27 18:41:47 +02:00
Christian Kreibich	c3cde56a0a	Update plugins.hooks baseline to reflect added config framework activity	2023-03-15 17:11:08 -07:00
Christian Kreibich	1843e2daae	Update btest baselines to reflect the use of local address ranges.	2023-03-15 17:11:04 -07:00
Arne Welzel	f56785740c	ftp: Limit user, password, arg and reply_msg column sizes in log The user and password fields are replicated to each of the ftp.log entries. Using a very large username (100s of KBs) allows to bloat the log without actually sending much traffic. Further, limit the arg and reply_msg columns to large, but not unbounded values.	2023-02-21 12:28:07 -07:00
Eldon Koyle	32afbae9db	Use a default analyzer Use a default analyzer instead of hardcoding a protocol number.	2023-02-16 19:39:27 -07:00
Eldon Koyle	56aa03031d	Simplify PBB analyzer by using Ethernet analyzer After the first 4 bytes, this traffic actually just looks like Ethernet. Rather than try to re-implement the ethernet analyzer, just check the length, skip 4 bytes, and pass it on.	2023-02-16 08:19:30 -07:00
Eldon Koyle	1e73716172	Add btest for PBB and update baselines	2023-02-15 14:36:26 -07:00
Robin Sommer	bc252c63dc	Add BIF `have_spicy_analyzers()`. We previously used the Spicy plugin's `Spicy::available` to test for Spicy support. However, having Spicy support does not necessarily mean that we have built Zeek with its in-tree Spicy analyzers: the Spicy plugin could have been pulled in from external. The new BIF now reliably tells us whether the Spicy analyzers are available; its result corresponds to what `zeek-config --have-spicy-analyzers` returns as well. We also move the two current checks over to use this BIF. (Note: I refrained from renaming the CMake-side `USE_SPICY_ANALYERS` to `HAVE_SPICY_ANALYZERS`. We should do this eventually for consistency, but I didn't want to make more changes than necessary right now.)	2023-02-03 13:47:26 +01:00
Robin Sommer	04a1ead978	Provide infrastructure to migrate legacy analyzers to Spicy. As initial examples, this branch ports the Syslog and Finger analyzers over. We leave the old analyzers in place for now and activate them iff we compile without any Spicy. Needs `zeek-spicy-infra` branches in `spicy/`, `spicy-plugin/`, `CMake/`, and `zeek/zeek-testing-private`. Note that the analyzer events remain associated with the Spicy plugin for now: that's where they will show up with `-NN`, and also inside the Zeekygen documentation. We switch CMake over to linking the runtime library into the plugin, vs. at the top-level through object libraries.	2023-02-01 11:33:48 +01:00
Arne Welzel	17d0ade26a	analyzer: Add analyzer.log for logging violations/confirmations By default this only logs all the violations, regardless of the confirmation state (for which there's still dpd.log). It includes packet, protocol and file analyzers. This uses options, change handlers and event groups for toggling the functionality at runtime. Closes #2031	2023-01-09 18:11:49 +01:00
Arne Welzel	a0aa00fa81	logging: Add event_groups to Stream This commit adds an optional event_groups field to the Logging::Stream record to associated event groups with logging streams. This can be used to disable all event groups of a logging stream when it is disabled. It does require making an explicit connection between the logging stream and the involved groups, however.	2022-12-09 16:59:36 +01:00
Arne Welzel	eb3bea4e4a	mqtt: Move from policy/ into base/ Register dpd signatures and the analyzer when running in default mode. Closes #2583	2022-11-30 10:14:20 +01:00

1 2 3 4 5 ...

339 commits