Mirror/zeek - git.uphillsecurity.com: We code.

mirror of https://github.com/zeek/zeek.git synced 2025-10-03 15:18:20 +00:00

Author	SHA1	Message	Date
Josh Soref	21e0d777b3	Spelling fixes: scripts * accessing * across * adding * additional * addresses * afterwards * analyzer * ancillary * answer * associated * attempts * because * belonging * buffer * cleanup * committed * connects * database * destination * destroy * distinguished * encoded * entries * entry * hopefully * image * include * incorrect * information * initial * initiate * interval * into * java * negotiation * nodes * nonexistent * ntlm * occasional * omitted * otherwise * ourselves * paragraphs * particular * perform * received * receiver * referring * release * repetitions * request * responded * retrieval * running * search * separate * separator * should * synchronization * target * that * the * threshold * timeout * transaction * transferred * transmission * triggered * vetoes * virtual Signed-off-by: Josh Soref <2119212+jsoref@users.noreply.github.com>	2022-11-02 17:36:39 -04:00
Christian Kreibich	e73b561dca	Update Management framework to new Supervisor::NodeConfig script fields	2022-09-02 12:12:19 -07:00
Christian Kreibich	fb733eb664	Management framework: log node set in dispatch requests cleanly Converting to a (sorted) vector both renders the empty set cleanly (without whitespace) and ensures consistent ordering.	2022-08-09 15:12:39 -07:00
Christian Kreibich	7d4dd22aba	Management framework: log additional node events	2022-08-09 15:12:10 -07:00
Christian Kreibich	63291ba2df	Management framework: upon deployment, make agent log multiple node results This erroneously only logged the result of the last node iterated over.	2022-08-09 15:11:31 -07:00
Christian Kreibich	e947e1d1c2	Management framework: additional context in a few log messages This adds request IDs in a few places that didn't mention them, and makes requests to the Supervisor that act on all current nodes explicit.	2022-07-11 13:00:35 -07:00
Christian Kreibich	f6597ffabf	Management framework: await Supervisor peering before sending agent's hello Failing to do so could open a race condition in which a quickly connecting controller could send instructions whose resulting Supervisor interactions got lost.	2022-07-11 13:00:35 -07:00
Christian Kreibich	a505a7814f	Management framework: remove outdated comment The agent has a request_expired timeout handler at this point.	2022-07-11 13:00:35 -07:00
Christian Kreibich	3aa0409792	Management framework: edit pass over docstrings This expands cross-referencing in the doc strings and adds a bit more explanation.	2022-06-22 23:26:11 -07:00
Christian Kreibich	b9879a50a0	Management framework: node restart support This adds restart request/response event pairs that restart nodes in the running Zeek cluster. The implementation is very similar to get_id_value, which also involves distributing a list of nodes to agents and aggregating the responses.	2022-06-22 23:26:11 -07:00
Christian Kreibich	bd39207772	Management framework: more consistent Supervisor interaction in the agent This declares our helper functions for sending events to the Supervisor, and makes them return the created request objects to enable the caller to modify them. It also adds a helper for restart and status requests, uses the helpers throughout the module, and makes all handlers more resilient in case Supervisor events other than the agent's arrive.	2022-06-22 23:26:11 -07:00
Christian Kreibich	1af9bba76e	Management framework: minor timeout bugfix The timeout result wasn't actually stored in requests timing out in the agent. (So far that's for deployment requests.) Also log the timing out of any request state, similar to the controller.	2022-06-22 23:25:15 -07:00
Christian Kreibich	b2f9e29bae	Management framework: make "result" argument plural in multi-result response events No functional change, just a consistency tweak. Since agent and controller send response events via Broker::publish(), the arguments aren't named and so this only affects the API definition.	2022-06-22 23:25:15 -07:00
Christian Kreibich	a622e28eab	Management framework: more resilient node shutdown upon deployment When agents had to terminate existing Zeek cluster nodes at the beginning of a new deployment, they so far used their internal state to look up the nodes and fired off requests to the Supervisor to shut these down. This has a problem: when an agent restarts unexpectedly, it has no internal state, and when it then tries to create nodes that already exist, the Supervisor complains with error messages. To avoid this, the agent now tears down all Supervised nodes other than agents and controllers. In order to do so, it first needs to query the Supervisor for the current node status, which means there are now two such status requests: one upon deployment, and one during get_nodes requests. In order to disambiguate these contexts in the SupervisorControl::status_request/response transactions, we use the finish() callback in the corresponding request state to continue execution as needed.	2022-06-21 17:22:45 -07:00
Christian Kreibich	633535d8da	Management framework: tweak Supervisor event logging We now log Supervisor event interaction just like we do transmission/receipt of other Management framework events.	2022-06-21 17:22:45 -07:00
Christian Kreibich	d367f1bad9	Management framework: agents now skip re-deployment of current config When an agent is already running the configuration it's asked to deploy, it will now recognize this and by default do nothing. The requester can force it if needed, via a new argument to the deploy_request event.	2022-06-21 17:22:45 -07:00
Christian Kreibich	a68ee13939	Management framework: suppress notify_agent_hello upon Supervisor peering The agent's Broker::peer_added handler now recognizes the Supervisor and does not trigger a notify_agent_hello event upon it. It might still send such events repeatedly as other things peer with the agent.	2022-06-21 17:22:45 -07:00
Christian Kreibich	0480b5f39c	Management framework: rename agent "set_configuration" to "deploy" This renames the agent's functionality for setting a configuration to reflect the controller's upcoming separation of set_configuration and deployment.	2022-06-21 17:22:45 -07:00
Christian Kreibich	f353ac22a5	Management framework: consistency fixes to the Result record The instance and error fields are now optional instead of defaulting to empty strings, which caused minor output deviations in the client. Agents now ensure that any Result record they create has the instance field filled in.	2022-06-21 17:22:45 -07:00
Christian Kreibich	a3fcd1462d	Management framework: make agents support zeek-archiver invocations This makes agents handle log archival automatically. By default, they invoke zeek-archiver once every log rotation interval to archive rotated files from the log-queue spool directory into the installation's log directory. The user can disable the feature, customize the command to invoke, and adjust the rotation interval.	2022-06-14 12:32:17 -07:00
Christian Kreibich	4c0543d0ed	Management framework: fix module naming typo This had no effect since this module name wasn't used anywhere else.	2022-06-14 12:32:17 -07:00
Christian Kreibich	64741b571e	Management framework: switch default network visibilities Up to now, agents and controllers listened locally only, and the Supervisor (which listens when we run an agent) listened globally. It's now the other way around: controllers and agents listen globally and the Supervisor, when listening, does so locally.	2022-06-08 15:00:19 -07:00
Christian Kreibich	72acf24f52	Management framework: expand notify_agent_hello event arguments This swaps the host event argument for the Broker ID. The latter is more useful, since the sending agent doesn't necessarily know its IP address as visible to the controller, and the controller can pull up the full Broker context via the ID. It also adds an explicit argument to the event to indicate whether the agent connected to the controller or vice versa. This simplifies the controller's internal logic. Also minor tweaks to logging to show Broker IDs.	2022-06-03 02:12:19 -07:00
Christian Kreibich	aa689807fa	Management framework: comment-only tweaks and typo fixes	2022-06-03 02:12:12 -07:00
Christian Kreibich	f10b94de39	Management framework: enable stdout/stderr reporting This uses the new frameworks/management/supervisor functionality to maintain stdout/stderr files, and hooks output context into set_configuration error results.	2022-05-31 12:55:21 -07:00
Christian Kreibich	f74f21767a	Management framework: disambiguate redef field names in agent and controller During Zeekygen's doc generation both the agent's and controller's main.zeek get loaded. This just happened to not throw errors so far because the redefs either matched perfectly or used different field names.	2022-05-31 12:55:21 -07:00
Christian Kreibich	49b9f1669c	Management framework: move to ResultVec in agent's set_configuration response We so far reported one result record per agent, which made it hard to report per-node outcomes for the new configuration. Agents now report one result record per node they're responsible for.	2022-05-31 12:55:21 -07:00
Christian Kreibich	83c60fd8ac	Management framework: tune request timeout granularity and interval When the controller relays requests to agents, we want agents to time out more quickly than the corresponding controller requests. This allows agents to respond with more meaningful errors, while the controller's timeout acts mostly as a last resort to ensure a response to the client actually happens. This dials down the table_expire_interval to 2 seconds in both agent and controller, for more predictable timeout behavior. It also dials the agent-side request expiration interval down to 5 seconds, compared to the agent's 10 seconds. We may have to revisit this to allow custom expiration intervals per request/response message type.	2022-05-31 12:55:21 -07:00
Christian Kreibich	4371c17d4c	Management framework: verify node starts when deploying a configuration We so far hoped for the best when an agent asked the Supervisor to launch a node. Since the Management::Node::API::notify_node_hello events arriving from new nodes signal when such nodes are up and running, we can use those events to track once/whether all launched nodes have checked in, and respond accordingly. This delays the set_configuration_response event until these checkins have occurred, or a timeout kicks in. In case of error, the agent's response to the controller is in error state and has the remaining, unresponsive/failed set of nodes as its data member.	2022-05-31 12:55:21 -07:00
Christian Kreibich	93ea03a081	Management framework: place each Zeek process in its own working dir This establishes a directory "nodes" in Management::state_dir and places each Zeek process into a subdirectory in it, named after the Zeek process. For example, node "worker-01" runs with cwd <state_dir>/nodes/worker-01/. Explicitly configured directories can override the naming logic, and also ignore the state directory if they're absolute paths. One exception remains: the Supervisor itself -- we'd have to use LogAscii::logdir to automatically place it too in its own directory, but that feature currently does not interoperate with log rotation.	2022-05-26 12:56:02 -07:00
Christian Kreibich	d1cd409e59	Management framework: set defaults for log rotation and persistent state This adds management/persistence.zeek to establish common configuration for log rotation and persistent variable state. Log-writing Zeek processes initially write locally in their working directory, and rotate into subdirectory "log-queue" of the spool. Since agent and controller have no logger, persistence.zeek puts in place compatible configurations for them. Storage folders for Broker-backed tables and clusterized stores default to subdirectories of the new Zeek-level state folder. When setting the ZEEK_MANAGEMENT_TESTING environment variable, persistent state is kept in the local directory, and log rotation remains disabled. This also tweaks @loads a bit in favor of simply loading frameworks/management, which is easier to keep track of.	2022-05-26 12:55:10 -07:00
Christian Kreibich	e305d9c613	Management framework: establish stdout/stderr files also for cluster nodes	2022-05-25 13:56:23 -07:00
Christian Kreibich	da016b8a68	Management framework: default to having agents check in with the (local) controller This allows single-machine settings to work out of the box when agent and cluster are loaded in Supervisor mode.	2022-05-25 13:56:23 -07:00
Christian Kreibich	b96a4276eb	Management framework: move role variable from logging into framework-wide config The role isn't just about logging, it can also act as a general indicator to key in on in role-specific code elsewhere, such as @if.	2022-05-25 13:56:23 -07:00
Christian Kreibich	e78fdc39e4	Management framework: distinguish supervisor/supervisee when loading agent/controller Load the agent/controller bootstrapping code only from the Supervisor, and the basic config only from a supervisee. When we're neither (which is likely a mistake), we do nothing.	2022-05-25 13:56:23 -07:00
Christian Kreibich	d40bb6e85f	Management framework: simplify agent and controller stdout/stderr files Moving to a model in which every Zeek process runs out of its own working directory simplifies the handling of those files.	2022-05-25 13:56:23 -07:00
Christian Kreibich	bd6c1683a2	Management framework: comment and layouting tweaks, no functional change Also remove additional instances of the term "data cluster".	2022-05-25 13:56:23 -07:00
Christian Kreibich	d4d6f10299	Management framework: rename env var that labels agents/controllers Just a consistency tweak to avoid confusion with "cluster".	2022-05-25 13:56:23 -07:00
Christian Kreibich	d2903bb645	Management framework: increase robustness of agent/controller naming The fallback mechanism when no explicit agent/controller names are configured didn't work properly, because many places in the code relied on accessing the name via the variables meant for explicit configuration, such as Management::Agent::name. Agent and controller now offer functions for computing the correct effective name, and we use that throughout.	2022-05-25 13:56:23 -07:00
Christian Kreibich	b23d292410	Management framework: consistency fixes around event() vs Broker::publish() Switch to using Broker::publish() for any event we only send to a peered entity, and not to drive local processing. Also minor indentation cleanup.	2022-04-26 23:23:58 -07:00
Christian Kreibich	7edd1a2651	Management framework: allow selecting cluster nodes in get_id_value This adds an optional set of cluster node names to narrow the querying to. It similarly expands the dispatch mechanism, since it likely most sense for any such request to apply only to a subset of nodes. Requests for invalid nodes trigger Response records in error state.	2022-04-18 12:38:54 -07:00
Christian Kreibich	fcef7f4925	Management framework: improve handling of node run states When agents receive a configuration, we don't currently honor requested run states (there's no such thing as registering a node but not running it, for example). To reflect this, we now start off nodes in state PENDING as we launch them via the Supervisor, and move them to RUNNING when they check in with us via Management::Node::API::notify_node_hello.	2022-04-15 18:51:56 -07:00
Christian Kreibich	497b2723d7	Management framework: add get_id_value dispatch This adds support for retrieving the value of a global identifier from any subset of cluster nodes. It relies on the lookup_ID() BiF to retrieve the val, and to_json() to render the value to an easily parsed string. Ideally we'd send the val directly, but this hits several roadblocks, including the fact that Broker won't serialize arbitrary values.	2022-04-15 18:51:56 -07:00
Christian Kreibich	788348f9d6	Management framework: allow dispatching "actions" on cluster nodes. This adds request/response event pairs to enable the controller to dispatch "actions" (pre-implemented Zeek script actions) on subsets of Zeek cluster nodes and collect the results. Using generic events to carry multiple such "run X on the nodes" scenarios simplifies adding these in the future.	2022-04-15 18:51:56 -07:00
Christian Kreibich	0020cc4af0	Management framework: some renaming to avoid the term "data cluster"	2022-04-15 18:51:56 -07:00
Christian Kreibich	337c7267e0	Management framework: allow agents to communicate with cluster nodes This provides Broker-level plumbing that allows agents to reach out to their managed Zeek nodes and collect responses. As a first event, it establishes Management::Node::API::notify_agent_hello, to notify the agent when the cluster node is ready to communicate. Also a bit of comment rewording to replace use of "data cluster" with simply "cluster", to avoid ambiguity with data nodes in SumStats, and expansion of test-all-policy.zeek and related/dependent tests, since we're introducing new scripts.	2022-04-15 18:51:54 -07:00
Christian Kreibich	54aaf3a623	Reorg of the cluster controller to new "Management framework" layout - This gives the cluster controller and agent the common name "Management framework" and changes the start directory of the sources from "policy/frameworks/cluster" to "policy/frameworks/management". This avoids ambiguity with the existing cluster framework. - It renames the "ClusterController" and "ClusterAgent" script modules to "Management::Controller" and "Management::Agent", respectively. This allows us to anchor tooling common to both controller and agent at the "Management" module. - It moves common configuration settings, logging, requests, types, and utilities to the common "Management" module. - It removes the explicit "::Types" submodule (so a request/response result is now a Management::Result, not a Management::Types::Result), which makes typenames more readable. - It updates tests that depend on module naming and full set of scripts.	2022-02-09 18:09:42 -08:00

47 commits