Mirror/zeek - git.uphillsecurity.com: We code.

mirror of https://github.com/zeek/zeek.git synced 2025-10-02 14:48:21 +00:00

Author	SHA1	Message	Date
Christian Kreibich	7787d84739	Management framework: track instances by their Broker IDs This allows us to handle loss of Broker peerings, updating instance state as we see instances go away. This also tweaks logging slightly to differentiate between an instance checking in for the first time, and checking in when the controller already knows it.	2022-06-21 17:22:45 -07:00
Christian Kreibich	633535d8da	Management framework: tweak Supervisor event logging We now log Supervisor event interaction just like we do transmission/receipt of other Management framework events.	2022-06-21 17:22:45 -07:00
Christian Kreibich	d7e88fc079	Management framework: make helper function a local	2022-06-21 17:22:45 -07:00
Christian Kreibich	35ea566223	Management framework: rename "log_level" to "level" "Management::Log::log_level" looks redundant.	2022-06-21 17:22:45 -07:00
Christian Kreibich	8bc142f73c	Management framework: add "finish" callback to requests These callbacks are handy for stringing together codepaths separated by event request/response transactions: when such a transaction completes, the callback allows locating a parent request for the finished one, to continue its processing.	2022-06-21 17:22:45 -07:00
Christian Kreibich	a2525e44ba	Management framework: add a helper for rendering result vectors to a string	2022-06-21 17:22:45 -07:00
Christian Kreibich	d367f1bad9	Management framework: agents now skip re-deployment of current config When an agent is already running the configuration it's asked to deploy, it will now recognize this and by default do nothing. The requester can force it if needed, via a new argument to the deploy_request event.	2022-06-21 17:22:45 -07:00
Christian Kreibich	a68ee13939	Management framework: suppress notify_agent_hello upon Supervisor peering The agent's Broker::peer_added handler now recognizes the Supervisor and does not trigger a notify_agent_hello event upon it. It might still send such events repeatedly as other things peer with the agent.	2022-06-21 17:22:45 -07:00
Christian Kreibich	46db4a0e71	Management framework: introduce state machine for configs and persist them The controller now knows three states that a cluster configuration can be in: - STAGED: as uploaded by the client - READY: with needed tweaks applied, e.g. to fill in ports - DEPLOYED: as sent off to agents for deployment These states aren't exclusive, they represent checkpoints that a config goes through from upload through deployment. A deployed configuration will also exist in its STAGED and READY versions, unless a client has uploaded a new configuration, which will overwrite the STAGED and READY ones. The controller saves all of these in a table, which lets us use Broker to persist all states to disk. We use &broker_allow_complex_type, since we only ever store entire configurations.	2022-06-21 17:22:45 -07:00
Christian Kreibich	77556e9f11	Management framework: introduce deployment API in controller This separates uploading a configuration from deploying it to the instances into separate event transactions. set_configuration_request/response remains, but now only conducts validation and storage of the new configuration (upon validation success, and not yet persisted to disk). The response event indicates success or the list of validation errors. Successful upload now returns the configuration's ID in the result record's data struct. The new deploy_request/response event takes a previously uploaded configuration and deploys it to the agents. The controller now tracks uploaded and deployed configurations separately. Uploading assigns g_config_staged; deployment assigns g_config_deployed. Deployment does not affect g_config_staged. The get_config_request/response event pair now allows selecting the configuration the caller would like to retrieve.	2022-06-21 17:22:45 -07:00
Christian Kreibich	0480b5f39c	Management framework: rename agent "set_configuration" to "deploy" This renames the agent's functionality for setting a configuration to reflect the controller's upcoming separation of set_configuration and deployment.	2022-06-21 17:22:45 -07:00
Christian Kreibich	f353ac22a5	Management framework: consistency fixes to the Result record The instance and error fields are now optional instead of defaulting to empty strings, which caused minor output deviations in the client. Agents now ensure that any Result record they create has the instance field filled in.	2022-06-21 17:22:45 -07:00
Christian Kreibich	3ac5fdfc59	Management framework: trivial changes and comment-only rewording	2022-06-21 17:22:45 -07:00
Christian Kreibich	d6042cf516	Management framework: add config validation During `set_configuration_request` handling the controller now validates received configurations, checking for a few common gotchas around naming and port use. Validation continues once it finds a problem, resulting in a list summarizing all identified problems.	2022-06-19 01:20:16 -07:00
Christian Kreibich	620db4d4eb	Management framework: improvements to port auto-enumeration The numbering process now accounts for the possibility of colliding with the agent port, as well as with ports explicitly assigned in the configuration. It also avoids nondeterminism that could result from traversal of sets.	2022-06-19 01:19:54 -07:00
Christian Kreibich	0c20f16055	Management framework: control output-to-console in Supervisor It helps during testing to be able to control whether the Supervisor process also routs node output to the console, in addition to writing to output files. Since the Supervisor runs as the main process in Docker containers, its output becomes visible in "docker logs" that way, simplifying diagnostics.	2022-06-19 01:19:54 -07:00
Christian Kreibich	5592beaf31	Management framework: handle no-instances corner case in set-config correctly When the controller receives a configuration with no instances (and thus no nodes), it needs to roundtrip to agents and can send the response right away.	2022-06-19 01:19:47 -07:00
Christian Kreibich	a3fcd1462d	Management framework: make agents support zeek-archiver invocations This makes agents handle log archival automatically. By default, they invoke zeek-archiver once every log rotation interval to archive rotated files from the log-queue spool directory into the installation's log directory. The user can disable the feature, customize the command to invoke, and adjust the rotation interval.	2022-06-14 12:32:17 -07:00
Christian Kreibich	4c0543d0ed	Management framework: fix module naming typo This had no effect since this module name wasn't used anywhere else.	2022-06-14 12:32:17 -07:00
Christian Kreibich	64741b571e	Management framework: switch default network visibilities Up to now, agents and controllers listened locally only, and the Supervisor (which listens when we run an agent) listened globally. It's now the other way around: controllers and agents listen globally and the Supervisor, when listening, does so locally.	2022-06-08 15:00:19 -07:00
Christian Kreibich	9b4841912c	Management framework: also use send_set_configuration_response_error elsewhere	2022-06-07 13:42:07 -07:00
Christian Kreibich	ccf3c24e23	Management framework: minor log formatting tweak, for consistency	2022-06-07 13:41:47 -07:00
Christian Kreibich	7a471df1a1	Management framework: support auto-assignment of ports in cluster nodes This enables the controller to assign listening ports to managers, loggers, and proxies. (We don't currently make the workers listen.) The feature is controlled by the Management::Controller::auto_assign_ports flag. When enabled (the default), enumeration starts from Management::Controller::auto_assign_start_port, beginning with the manager, then the logger(s), then proxy(s). When the feature is disabled and nodes that require a port lack it, the controller rejects the configuration.	2022-06-07 13:38:04 -07:00
Tim Wojtulewicz	41aa8b2349	Merge remote-tracking branch 'origin/topic/christian/is_used_in_netcontrol_sumstats' * origin/topic/christian/is_used_in_netcontrol_sumstats: Additional &is_used tags in the Netcontrol and Sumstats frameworks	2022-06-03 09:50:54 -07:00
Tim Wojtulewicz	febdc97f09	Merge remote-tracking branch 'origin/topic/christian/management-instance-handling' * origin/topic/christian/management-instance-handling: Management framework: bump zeek-client to pull in rendering tweaks Management framework: bump external cluster testsuite Management framework: improve address and port handling Management framework: broaden get_instances response data to connected instances Management framework: expand notify_agent_hello event arguments Management framework: comment-only tweaks and typo fixes	2022-06-03 09:50:21 -07:00
Christian Kreibich	c53044981a	Management framework: improve address and port handling The get-nodes command also benefits from showing the state on connected agents more broadly (as opposed to just the one for the current configuration). Also a bugfix: ensure we use an agent's IP address as seen by the controller. This avoids reporting "0.0.0.0" in some cases.	2022-06-03 02:14:07 -07:00
Christian Kreibich	0c47d45bb9	Management framework: broaden get_instances response data to connected instances This response so far contained only the connected instances that are relevant to the current configuration, but this isn't very helpful when troubleshooting instance connectivity. It now reports all currently connected instances, with network addresses & ports as known to Broker.	2022-06-03 02:13:30 -07:00
Christian Kreibich	72acf24f52	Management framework: expand notify_agent_hello event arguments This swaps the host event argument for the Broker ID. The latter is more useful, since the sending agent doesn't necessarily know its IP address as visible to the controller, and the controller can pull up the full Broker context via the ID. It also adds an explicit argument to the event to indicate whether the agent connected to the controller or vice versa. This simplifies the controller's internal logic. Also minor tweaks to logging to show Broker IDs.	2022-06-03 02:12:19 -07:00
Christian Kreibich	aa689807fa	Management framework: comment-only tweaks and typo fixes	2022-06-03 02:12:12 -07:00
Christian Kreibich	edef3736fb	Additional &is_used tags in the Netcontrol and Sumstats frameworks When running a cluster, these functions only get called in select node types and could trigger no-caller warnings on stderr.	2022-06-02 22:57:07 -07:00
Tim Wojtulewicz	535a6013aa	Merge remote-tracking branch 'zeek-as-org/as-org' * zeek-as-org/as-org: Mark lookup_asn() BIF as deprecated in v6.1 Define geo_autonomous_system record type Add lookup_autonomous_system() BIF that returns AS number and org	2022-06-02 16:59:29 -07:00
Christian Kreibich	1cebdd569d	Merge branch 'topic/christian/gh-2134-fix-intel-test-races' * topic/christian/gh-2134-fix-intel-test-races: Expand scripts.base.frameworks.intel.cluster-transparency test Fix races in scripts.base.frameworks.intel.cluster-transparency-with-proxy test Add Intel::send_store_on_node_up boolean to control min_data_store delivery	2022-06-02 12:20:06 -07:00
Robin Sommer	d99f041ac5	Add WebSocket support for exchanging events with external clients. This exposes Broker's new WebSocket support in Zeek. To enable it, call `Broker::listen_websocket()`. Zeek will then start listening on port 9997 for incoming WebSocket connections. See the Broker documentation for a description of the message format expected over these WebSocket connections.	2022-06-02 10:31:52 +02:00
Phil Rzewski	c08fe7c237	Define geo_autonomous_system record type	2022-06-01 18:39:07 -07:00
Christian Kreibich	892a3a8452	Add Intel::send_store_on_node_up boolean to control min_data_store delivery This adds a redefinable const to the internals of the Intel framework, to allow suppression of the manager sending its current min_data_store when a worker connects. This feature is desirable for nodes that check in "late" to bring them up to speed, but during testing it introduces nondeterminism.	2022-06-01 17:45:19 -07:00
Christian Kreibich	f10b94de39	Management framework: enable stdout/stderr reporting This uses the new frameworks/management/supervisor functionality to maintain stdout/stderr files, and hooks output context into set_configuration error results.	2022-05-31 12:55:21 -07:00
Christian Kreibich	24a495da42	Management framework: Supervisor extensions for stdout/stderr handling This improves the framework's handling of Zeek node stdout and stderr by extending the (script-layer) Supervisor functionality. - The Supervisor _either_ directs Zeek nodes' stdout/stderr to files _or_ lets you hook into it at the script level. We'd like both: files make sense to allow inspection outside of the framework, and the framework would benefit from tapping into the streams e.g. for error context. We now provide the file redirection functionality in the Supervisor, in addition to the hook mechanism. The hook mechanism also builds up rolling windows of up to 100 lines (configurable) into stdout/stderr. - The new Mangement::Supervisor::API::notify_node_exit event notifies subscribers (agents, really) that a particular node has exited (and is possibly being restarted by the Supervisor). The event includes the name of the node, plus its recent stdout/stderr context.	2022-05-31 12:55:21 -07:00
Christian Kreibich	f74f21767a	Management framework: disambiguate redef field names in agent and controller During Zeekygen's doc generation both the agent's and controller's main.zeek get loaded. This just happened to not throw errors so far because the redefs either matched perfectly or used different field names.	2022-05-31 12:55:21 -07:00
Christian Kreibich	49b9f1669c	Management framework: move to ResultVec in agent's set_configuration response We so far reported one result record per agent, which made it hard to report per-node outcomes for the new configuration. Agents now report one result record per node they're responsible for.	2022-05-31 12:55:21 -07:00
Christian Kreibich	83c60fd8ac	Management framework: tune request timeout granularity and interval When the controller relays requests to agents, we want agents to time out more quickly than the corresponding controller requests. This allows agents to respond with more meaningful errors, while the controller's timeout acts mostly as a last resort to ensure a response to the client actually happens. This dials down the table_expire_interval to 2 seconds in both agent and controller, for more predictable timeout behavior. It also dials the agent-side request expiration interval down to 5 seconds, compared to the agent's 10 seconds. We may have to revisit this to allow custom expiration intervals per request/response message type.	2022-05-31 12:55:21 -07:00
Christian Kreibich	4371c17d4c	Management framework: verify node starts when deploying a configuration We so far hoped for the best when an agent asked the Supervisor to launch a node. Since the Management::Node::API::notify_node_hello events arriving from new nodes signal when such nodes are up and running, we can use those events to track once/whether all launched nodes have checked in, and respond accordingly. This delays the set_configuration_response event until these checkins have occurred, or a timeout kicks in. In case of error, the agent's response to the controller is in error state and has the remaining, unresponsive/failed set of nodes as its data member.	2022-05-31 12:55:21 -07:00
Christian Kreibich	c922f749c5	Management framework: a bit of debug-level logging for troubleshooting	2022-05-31 12:55:21 -07:00
Christian Kreibich	93bed5a261	Merge branch 'topic/christian/node-status-notification' * topic/christian/node-status-notification: Add Supervisor::node_status notification event	2022-05-31 12:53:18 -07:00
Christian Kreibich	14188fc7a7	Add Supervisor::node_status notification event The Supervisor generates this event every time it receives a status update from the stem, meaning a node got created or re-created. A corresponding SupervisorControl::node_status event relays the same information for users interacting with the Supervisor over Broker.	2022-05-30 21:36:35 -07:00
Vern Paxson	07cf5cb089	deprecation messages for unused base script functions	2022-05-27 14:36:30 -07:00
Vern Paxson	6dc711c39e	annotate orphan base script components with &deprecated	2022-05-26 17:39:17 -07:00
Vern Paxson	9b8ac44169	annotate base scripts with &is_used as needed	2022-05-26 17:39:17 -07:00
Christian Kreibich	415bbe17d6	Merge branch 'topic/christian/management-cluster-dirs' * topic/christian/management-cluster-dirs: Management framework: bump zeek-client to pull in instance serialization fixes Management framework: bump external cluster testsuite Management framework: update agent-checkin test to reflect recent changes Management framework: place each Zeek process in its own working dir Management framework: set defaults for log rotation and persistent state Management framework: add spool and state directory config settings Management framework: establish stdout/stderr files also for cluster nodes Management framework: default to having agents check in with the (local) controller Management framework: move role variable from logging into framework-wide config Management framework: distinguish supervisor/supervisee when loading agent/controller Management framework: simplify agent and controller stdout/stderr files Management framework: prefix the management logs with "management-" Management framework: comment and layouting tweaks, no functional change Management framework: rename env var that labels agents/controllers Management framework: increase robustness of agent/controller naming	2022-05-26 16:10:14 -07:00
Christian Kreibich	93ea03a081	Management framework: place each Zeek process in its own working dir This establishes a directory "nodes" in Management::state_dir and places each Zeek process into a subdirectory in it, named after the Zeek process. For example, node "worker-01" runs with cwd <state_dir>/nodes/worker-01/. Explicitly configured directories can override the naming logic, and also ignore the state directory if they're absolute paths. One exception remains: the Supervisor itself -- we'd have to use LogAscii::logdir to automatically place it too in its own directory, but that feature currently does not interoperate with log rotation.	2022-05-26 12:56:02 -07:00
Christian Kreibich	d1cd409e59	Management framework: set defaults for log rotation and persistent state This adds management/persistence.zeek to establish common configuration for log rotation and persistent variable state. Log-writing Zeek processes initially write locally in their working directory, and rotate into subdirectory "log-queue" of the spool. Since agent and controller have no logger, persistence.zeek puts in place compatible configurations for them. Storage folders for Broker-backed tables and clusterized stores default to subdirectories of the new Zeek-level state folder. When setting the ZEEK_MANAGEMENT_TESTING environment variable, persistent state is kept in the local directory, and log rotation remains disabled. This also tweaks @loads a bit in favor of simply loading frameworks/management, which is easier to keep track of.	2022-05-26 12:55:10 -07:00

1 2 3 4 5 ...

2957 commits