Mirror/zeek - git.uphillsecurity.com: We code.

mirror of https://github.com/zeek/zeek.git synced 2025-10-02 14:48:21 +00:00

Author	SHA1	Message	Date
Tim Wojtulewicz	7e88a2b3fb	Add basic LLC, SNAP, and Novell 802.3 packet analyzers	2023-04-25 12:29:54 -07:00
Tim Wojtulewicz	f62f8e5cc9	Remove workaround for tunnels from IEEE 802.11 analyzer	2023-04-25 09:28:20 -07:00
Tim Wojtulewicz	5b1c6216bd	Fix IEEE 802.11 analyzer to properly forward tunneled packets This mostly happens with Aruba, but could possibly happen with other tunnels too.	2023-04-25 09:28:20 -07:00
Tim Wojtulewicz	69d72f3bbb	Expand support for Aruba protocol types in GRE analyzer This also fixes the GRE analyzer to forward into the IEEE 802.11 analyzer if it encounters Aruba packets with the proper protocol types. This way the QoS header can be handled correctly.	2023-04-25 09:28:20 -07:00
Arne Welzel	1b69b4d26f	Merge branch 'topic/amazingpp/irc-fuid-missing' of github.com:AmazingPP/zeek * 'topic/amazingpp/irc-fuid-missing' of github.com:AmazingPP/zeek: Add irc_dcc_send_ack event and fix missing fields I've moved IRC_Data back into the zeek::analyzer::file namespace, but we did move the declaration from protocol/file/File.h to protocol/irc/IRC.h. But, if someone actually customized IRC_Data and didn't include protocol/irc/IRC.h for other reasons, I'll be surprised (and also just suggest to update the include).	2023-04-24 18:22:50 +02:00
Arne Welzel	ffb73e4de9	Merge remote-tracking branch 'origin/topic/awelzel/add-community-id' * origin/topic/awelzel/add-community-id: testing/external: Bump hashes for community_id addition NEWS: Add entry for Community ID policy: Import zeek-community-id scripts into protocols/conn frameworks/notice Add community_id_v1() based on corelight/zeek-community-id	2023-04-24 10:12:56 +02:00
Fupeng Zhao	161ffb4192	Add irc_dcc_send_ack event and fix missing fields	2023-04-24 07:29:51 +00:00
Christian Kreibich	99de7b7526	Add community_id_v1() based on corelight/zeek-community-id "Community ID" has become an established flow hash for connection correlation across different monitoring and storage systems. Other NSMs have had native and built-in support for Community ID since late 2018. And even though the roots of "Community ID" are very close to Zeek, Zeek itself has never provided out-of-the-box support and instead required users to install an external plugin. While we try to make that installation as easy as possible, an external plugin always sets the bar higher for an initial setup and can be intimidating. It also requires a rebuild operation of the plugin during upgrades. Nothing overly complicated, but somewhat unnecessary for such popular functionality. This isn't a 1:1 import. The options are parameters and the "verbose" functionality has been removed. Further, instead of a `connection` record, the new bif works with `conn_id`, allowing computation of the hash with little effort on the command line: $ zeek -e 'print community_id_v1([$orig_h=1.2.3.4, $orig_p=1024/tcp, $resp_h=5.6.7.8, $resp_p=80/tcp])' 1:RcCrCS5fwYUeIzgDDx64EN3+okU Reference: https://github.com/corelight/zeek-community-id/	2023-04-21 20:44:09 +02:00
Jan Grashoefer	88c86cc7d4	Add hook into cluster connection setup.	2023-04-21 19:04:52 +02:00
Jan Grashoefer	c7626d797f	Add broadcast_topics set. This set contains the topics to reach all cluster nodes. Due to broker's forwarding mechanism, we cannot define a single broadcast topic, as it would create routing loops.	2023-04-21 19:04:52 +02:00
Jan Grashoefer	3db8bb4a44	Generalize Cluster::worker_count.	2023-04-21 19:04:39 +02:00
Arne Welzel	d89f16dfc9	logging: Support rotation_postprocessor_command_env This new table provides a mechanism to add environment variables to the postprocessor execution. Use case is from ZeekControl to inject a suffix to be used when running with multiple logger.	2023-04-17 13:10:14 +00:00
Arne Welzel	a5e7faf564	logging/Manager: Fix crash for rotation format function not returning While working on a rotation format function, ran into Zeek crashing when not returning a value from it, fix and recover the same way as for scripting errors.	2023-04-13 09:23:51 +02:00
Tim Wojtulewicz	f701f1fc94	Merge remote-tracking branch 'security/topic/awelzel/152-smtp-validate-mail-transactions' * security/topic/awelzel/152-smtp-validate-mail-transactions: smtp: Validate mail transaction and disable SMTP analyzer if excessive generic-analyzer-fuzzer: Detect disable_analyzer() from scripts	2023-04-11 15:16:25 -07:00
Tim Wojtulewicz	c670f3fdb2	Merge remote-tracking branch 'security/topic/awelzel/148-ftp-skip-get-pending-commands-multi-line-response' * security/topic/awelzel/148-ftp-skip-get-pending-commands-multi-line-response: ftp/main: Special case for intermediate reply lines ftp/main: Skip get_pending_command() for intermediate reply lines	2023-04-11 14:50:55 -07:00
Tim Wojtulewicz	67802e711a	Report packet statistics via the telemetry framework	2023-04-06 13:41:09 -07:00
Tim Wojtulewicz	ae3d6a4df0	Add optional packet filtered statistics for packet sources	2023-04-06 09:47:04 -07:00
Arne Welzel	7665e808a2	ftp/main: Special case for intermediate reply lines The medium.trace in the private external test suite contains one session/server that violates the multi-line reply protocol and happened to work out fairly well regardless due to how we looked up the pending commands unconditionally before. Continue to match up reply lines that "look like they contain status codes" even if cont_resp = T. This still improves runtime for the OSS-Fuzz generated test case and keeps the external baselines valid. The affected session can be extracted as follows: zcat Traces/medium.trace.gz \| tcpdump -r - 'port 1491 and port 21' We could push this into the analyzer, too, minimally the RFC says: > If an intermediary line begins with a 3-digit number, the Server > must pad the front to avoid confusion.	2023-04-03 14:05:13 +02:00
Arne Welzel	f00d6198af	PktSrc: Introduce Pcap::non_fd_timeout Increasing this value 10x has lowered CPU usage on a Myricom based deployment significantly with reportedly no adverse side-effects. After reviewing the Zeek 3 IO loop, my hunch is that previously when no packets were available, we'd sleep 20usec every loop iteration after calling ->Process() on the packet source. With current master ->Process() is called 10 times on a packet source before going to sleep just once for 20 usec. Likely this explains the increased CPU usage reported. It's probably too risky to increase the current value, so introduce a const &redef value for advanced users to tweak it. A middle ground might be to lower ``io_poll_interval_live`` to 5 and increase the new ``Pcap::non_fd_timeout`` setting to 100usec. While this doesn't really fix #2296, we now have enough knobs for tweaking. Closes #2296.	2023-03-31 18:48:08 +02:00
Arne Welzel	b8dc6ad120	smtp: Validate mail transaction and disable SMTP analyzer if excessive An invalid mail transaction is determined as * RCPT TO command without a preceding MAIL FROM * a DATA command without a preceding RCPT TO and logged as a weird. The testing pcap for invalid mail transactions was produced with a Python script against a local exim4 configured to accept more errors and unknown commands than 3 by default: # exim4.conf.template smtp_max_synprot_errors = 100 smtp_max_unknown_commands = 100 See also: https://www.rfc-editor.org/rfc/rfc5321#section-3.3	2023-03-27 18:41:47 +02:00
Arne Welzel	1b3e8a611e	ftp/main: Skip get_pending_command() for intermediate reply lines Intermediate lines of multiline replies usually do not contain valid status codes (even if servers may opt to include them). Their content may be anything and likely unrelated to the original command. There's little reason for us trying to match them with a corresponding command. OSS-Fuzz generated a large command reply with very many intermediate lines which caused long processing times due to matching every line with all currently pending commands. This is a DoS vector against Zeek. The new ipv6-multiline-reply.trace and ipv6-retr-samba.trace files have been extracted from the external ipv6.trace.	2023-03-23 13:50:36 +01:00
Arne Welzel	d4e31e7d2b	RunState: Implement forward_network_time_if_applicable() Add a central place where the decision when it's okay to update network time to the current time (wallclock) is. It checks for pseudo_realtime and packet source existence as well as packet source idleness. A new const &redef allows to completely disable forwarding of network time.	2023-03-23 12:40:39 +01:00
Jan Grashoefer	1882307cf3	Add pcap_file option to supervised nodes. This allows to start Supervised nodes with a pcap_file argument rather than interface. This is based on changes from @J-Gras.	2023-03-21 16:18:02 +01:00
Arne Welzel	46c432dc8b	iosource: Make poll intervals configurable This probably should not be changed by users, but it's useful for testing and experimentation rather than needing to recompile. Processing 100 packets without checking an FD based IO source can actually mean that FD based sources are never checked during a read of a very small pcap...	2023-03-21 09:15:33 +01:00
Arne Welzel	cf2da5160b	dns: Remove AD and CD flags from log There was a misunderstanding whether to include them by default in the dns.log, so remove them again. There had also been a discussion and quirk that AD of a request would always be overwritten by reply in the dns.log unless the reply is missing. For now, let users extend dns.log themselves for what best fits their requirements, rather than adding these flags by default. Add a btest to print AD and CD flags for smoke testing still.	2023-03-16 10:09:27 +01:00
Christian Kreibich	693d8e9251	Treat private address space as site-local by default This makes Site::private_address_space work like a subset of Site::local_nets, to match many user's intuition of how we should treat site locality out of the box. As config options, changes/redefs to Site::private_address_space propagate to Site::local_nets, while changes to the latter don't affect the former. A new global bit `Site::private_address_space_is_local` controls the behavior. It defaults to true, and redefing to false brings back the original behavior.	2023-03-15 17:01:00 -07:00
Christian Kreibich	19829765d4	Provide a mechanism to suppress logging of internal config framework activity	2023-03-15 17:01:00 -07:00
Arne Welzel	33090d7a27	Merge branch 'dnssec-flag-parse' of github.com:micrictor/zeek-codespace * 'dnssec-flag-parse' of github.com:micrictor/zeek-codespace: Update external testing commit hash for DNS flag changes Parse DNSSEC AD and CD bits Updated dump-events baseline which seemed unrelated.	2023-03-14 10:35:50 +01:00
Michael R. Torres	fe8390c646	Parse DNSSEC AD and CD bits Parse authentic data (AD) and checking disabled (CD) bits according to RFC 2535. Leaves the Z field as-is, in case users are already handling this elsewhere and depend on the value being the integer for all 3 bits. https://www.rfc-editor.org/rfc/rfc2535#section-6.1 Fixes #2672	2023-03-13 14:35:06 -07:00
Arne Welzel	2251c67e56	get_dns_stats: Expose total cache size and cached text entries It wasn't possible from script land to determine the total size of the cache table held by the DNS_Mgr. Add the total and also also the TEXT entries count.	2023-03-10 09:22:45 +01:00
Johanna Amann	989e9c29d2	X.509: expose the signature type inside the tbs certificate This change exposes the signature tyope inside the signed portion of an X.509 certificate. In the past, we only exposed the signature type that is contained inside the signature, which is outside the signed portion of the X.509 certificate. In theory, both signature fields should have the same value; it is, however, possible to encode differing values in both fields. The new field is not logged by default.	2023-02-28 19:24:16 +00:00
Arne Welzel	f56785740c	ftp: Limit user, password, arg and reply_msg column sizes in log The user and password fields are replicated to each of the ftp.log entries. Using a very large username (100s of KBs) allows to bloat the log without actually sending much traffic. Further, limit the arg and reply_msg columns to large, but not unbounded values.	2023-02-21 12:28:07 -07:00
Tim Wojtulewicz	8cf1e51623	Add max_size argument for find_all/find_all_ordered BIFs	2023-02-21 12:27:54 -07:00
Eldon Koyle	32afbae9db	Use a default analyzer Use a default analyzer instead of hardcoding a protocol number.	2023-02-16 19:39:27 -07:00
Eldon Koyle	56aa03031d	Simplify PBB analyzer by using Ethernet analyzer After the first 4 bytes, this traffic actually just looks like Ethernet. Rather than try to re-implement the ethernet analyzer, just check the length, skip 4 bytes, and pass it on.	2023-02-16 08:19:30 -07:00
Eldon Koyle	269cc15888	Cleanup and add customer MAC addresses * Put c-dst/c-src in l2_dst/l2_src * use #define instead of const int and move to PBB.h	2023-02-10 17:42:25 -07:00
Eldon Koyle	28d540483e	Add PBB (802.1ah) support	2023-02-10 15:30:01 -07:00
Arne Welzel	e4ab7b2d70	files/main: No empty file_ids When an analyzer calls DataIn(), there's a costly callback construct going through the event queue. If an analyzer does not have a get_file_handle() handler installed, the produced file_id would end up empty and ignored. Consequently, the get_file_handle() callback was invoked for every new DataIn() invocations. This is surprising and costly. Log a warning when this happens and instead set a generically generated file handle value instead to prevent the repeated get_file_handle() invocations.	2023-02-06 18:08:05 +01:00
Robin Sommer	bc252c63dc	Add BIF `have_spicy_analyzers()`. We previously used the Spicy plugin's `Spicy::available` to test for Spicy support. However, having Spicy support does not necessarily mean that we have built Zeek with its in-tree Spicy analyzers: the Spicy plugin could have been pulled in from external. The new BIF now reliably tells us whether the Spicy analyzers are available; its result corresponds to what `zeek-config --have-spicy-analyzers` returns as well. We also move the two current checks over to use this BIF. (Note: I refrained from renaming the CMake-side `USE_SPICY_ANALYERS` to `HAVE_SPICY_ANALYZERS`. We should do this eventually for consistency, but I didn't want to make more changes than necessary right now.)	2023-02-03 13:47:26 +01:00
Tim Wojtulewicz	0fd335f7f0	Merge remote-tracking branch 'security/topic/timw/131-smb-fscontrol-overflow' * security/topic/timw/131-smb-fscontrol-overflow: Restore/rename field in SMB2::Fscontrol record type	2023-02-01 10:48:16 -07:00
Robin Sommer	04a1ead978	Provide infrastructure to migrate legacy analyzers to Spicy. As initial examples, this branch ports the Syslog and Finger analyzers over. We leave the old analyzers in place for now and activate them iff we compile without any Spicy. Needs `zeek-spicy-infra` branches in `spicy/`, `spicy-plugin/`, `CMake/`, and `zeek/zeek-testing-private`. Note that the analyzer events remain associated with the Spicy plugin for now: that's where they will show up with `-NN`, and also inside the Zeekygen documentation. We switch CMake over to linking the runtime library into the plugin, vs. at the top-level through object libraries.	2023-02-01 11:33:48 +01:00
Arne Welzel	be44c642e1	Merge remote-tracking branch 'origin/topic/awelzel/move-disabling-analyzer-out-of-global' * origin/topic/awelzel/move-disabling-analyzer-out-of-global: analyzer: Move disabling_analyzer() hook into Analyzer module	2023-01-31 14:48:56 +01:00
Arne Welzel	f35cf228dc	broker/store: Extend SQLiteOptions around data safety and performance Add configurability of synchronous and journal_mode for SQLite backed Broker data stores. Setting these to synchronous=normal and journal_mode=wal can significantly improve throughput at the cost of some durability in the presence of power loss or OS crash. In the context of Zeek, this is likely more than acceptable. Additionally, add integrity_check and failure_mode options to support deleting and re-opening a corrupted SQLite database at store creation. Closes #2698	2023-01-30 10:25:37 +01:00
Tim Wojtulewicz	84ac362c67	Restore/rename field in SMB2::Fscontrol record type `b41a4bf06d` removed a field from this record because it had a duplicate name as another field. The field does need to exist, but it needs the correct name.	2023-01-27 17:39:10 -07:00
Arne Welzel	8be8c22b3e	smb1: Prevent accessing uninitialized referenced_tree The added pcap was created from an OSS Fuzz test case and is borderline valid SMB traffic, but it triggered a scripting error. Closes #2726	2023-01-27 19:22:13 +01:00
Arne Welzel	672602dae7	MySQL: Fix endianness, introduce mysql_eof() event We were parsing MySQL using bigendian even though the protocol is specified as with "least significant byte first" [1]. This is most problematic when parsing length encoded strings with 2 byte length fields... Further, I think, the EOF_Packet parsing was borked, either due to testing the CLIENT_DEPRECATE_EOF with the wrong endianness, or due to the workaround in Resultset processing raising mysql_ok(). Introduce a new mysql_eof() that triggers for EOF_Packet's and remove the fake mysql_ok() Resultset invocation to fix. Adapt the mysql script and tests to account for the new event. This is a quite backwards incompatible change on the event level, but due to being quite buggy in general, doubt this matters to many. I think there is more buried, but this fixes the violation of the simple "SHOW ENGINE INNODB STATUS" and the existing tests continue to succeed... [1] https://dev.mysql.com/doc/dev/mysql-server/latest/page_protocol_basic_dt_integers.html	2023-01-27 10:59:23 +01:00
Tim Wojtulewicz	6cfb45d24f	Merge remote-tracking branch 'jeff-bb/patch-2' * jeff-bb/patch-2: Log raw keyboard value on best guess Avoid excessive fmt calls, return default behavior on unknown "Best Guess" unknown keyboard / language variants	2023-01-23 12:50:23 -07:00
jeff-bb	7085104c33	Log raw keyboard value on best guess	2023-01-23 09:12:48 -06:00
Arne Welzel	26b1558cd1	analyzer: Move disabling_analyzer() hook into Analyzer module When disabling_analyzer() was introduced, it was added to the GLOBAL module. The awkward side-effect is that implementing a hook handler in another module requires to prefix it with GLOBAL. Alternatively, one can re-open the GLOBAL module and implement the handler in that scope. Both are not great, and prefixing with GLOBAL is ugly, so move the identifier to the Analyzer module and ask users to prefix with Analyzer.	2023-01-23 12:22:05 +01:00
jeff-bb	04113b13d5	Avoid excessive fmt calls, return default behavior on unknown Using "in" to query the language const. This also handles the case of not having a best guess and continue using the existing behavior. Given keyboard_layout = 1033 (0x0409), "keyboard-English - United States" keyboard_layout = 66569 (0x00010409), "keyboard-English - United States (Best Guess)" keyboard_layout = 12345 (0x3039), "keyboard-12345"	2023-01-20 08:29:55 -06:00

... 3 4 5 6 7 ...

2808 commits