Mirror/zeek - git.uphillsecurity.com: We code.

mirror of https://github.com/zeek/zeek.git synced 2025-10-03 23:28:20 +00:00

Author	SHA1	Message	Date
Dominik Charousset	647fdf7737	Add facade types to avoid using raw Broker types By avoiding to use `broker::data` directly, we gain a degree of freedom that allows us to swap out `broker::data` for something else (e.g., `broker::variant`) in the future. Furthermore, it also helps us to keep Broker types "local" to the Broker manager and gives us a nicer interface. Also replaces uses of `broker::expected` with `std::optional`. While an `expected `can carry additional information as to why a value is not present, nothing in Zeek ever cared about that. Hence, using `std::optional` removes an unnecessary dependency on a Broker detail while also being more efficient (no extra heap allocation when no value is present).	2023-12-04 15:23:28 +01:00
Benjamin Bannier	f5a76c1aed	Reformat Zeek in Spicy style This largely copies over Spicy's `.clang-format` configuration file. The one place where we deviate is header include order since Zeek depends on headers being included in a certain order.	2023-10-30 09:40:55 +01:00
Tim Wojtulewicz	90d0bc64fa	Replace empty destructor bodies with =default definitions	2023-07-07 09:17:05 -07:00
Arne Welzel	7a043e5e8f	all: Fix typos identified by typos pre-commit hook	2023-06-13 17:57:32 +02:00
Tim Wojtulewicz	3b0e8ee6f1	Fix a bunch of missing class member initializations	2023-01-27 13:03:18 -07:00
Josh Soref	cd201aa24e	Spelling src These are non-functional changes. * accounting * activation * actual * added * addresult * aggregable * aligned * alternatively * ambiguous * analysis * analyzer * anticlimactic * apparently * application * appropriate * arithmetic * assignment * assigns * associated * authentication * authoritative * barrier * boundary * broccoli * buffering * caching * called * canonicalized * capturing * certificates * ciphersuite * columns * communication * comparison * comparisons * compilation * component * concatenating * concatenation * connection * convenience * correctly * corresponding * could * counting * data * declared * decryption * defining * dependent * deprecated * detached * dictionary * directional * directly * directory * discarding * disconnecting * distinguishes * documentation * elsewhere * emitted * empty * endianness * endpoint * enumerator * essentially * evaluated * everything * exactly * execute * explicit * expressions * facilitates * fiddling * filesystem * flag * flagged * for * fragments * guarantee * guaranteed * happen * happening * hemisphere * identifier * identifies * identify * implementation * implemented * implementing * including * inconsistency * indeterminate * indices * individual * information * initial * initialization * initialize * initialized * initializes * instantiate * instantiated * instantiates * interface * internal * interpreted * interpreter * into * it * iterators * length * likely * log * longer * mainly * mark * maximum * message * minimum * module * must * name * namespace * necessary * nonexistent * not * notifications * notifier * number * objects * occurred * operations * original * otherwise * output * overridden * override * overriding * overwriting * ownership * parameters * particular * payload * persistent * potential * precision * preexisting * preservation * preserved * primarily * probably * procedure * proceed * process * processed * processes * processing * propagate * propagated * prototype * provides * publishing * purposes * queue * reached * reason * reassem * reassemble * reassembler * recommend * record * reduction * reference * regularly * representation * request * reserved * retrieve * returning * separate * should * shouldn't * significant * signing * simplified * simultaneously * single * somebody * sources * specific * specification * specified * specifies * specify * statement * subdirectories * succeeded * successful * successfully * supplied * synchronization * tag * temporarily * terminating * that * the * transmitted * true * truncated * try * understand * unescaped * unforwarding * unknown * unknowndata * unspecified * update * usually * which * wildcard Signed-off-by: Josh Soref <2119212+jsoref@users.noreply.github.com>	2022-11-09 12:08:15 -05:00
Tim Wojtulewicz	7c4fd382d9	Code modernization: Convert from deprecated C standard library headers	2022-06-27 09:47:31 -07:00
Tim Wojtulewicz	b2f171ec69	Reformat the world	2021-09-16 15:35:39 -07:00
Tim Wojtulewicz	0618be792f	Remove all of the random single-file deprecations These are the changes that don't require a ton of changes to other files outside of the original removal.	2021-01-27 10:52:40 -07:00
Tim Wojtulewicz	96d9115360	GH-1079: Use full paths starting with zeek/ when including files	2020-11-12 12:15:26 -07:00
Tim Wojtulewicz	4b61d60e80	Fix indentation of namespaced aliases	2020-08-20 16:11:46 -07:00
Tim Wojtulewicz	f310795d79	Move probabilistic code into zeek namespaces	2020-08-20 15:55:17 -07:00
Tim Wojtulewicz	95d2af4501	Move constructors/operators should be marked noexcept to avoid the compiler picking the copy constructor instead (performance-noexcept-move-constructor)	2020-02-11 11:02:08 -08:00
Max Kellermann	0db61f3094	include cleanup The Zeek code base has very inconsistent #includes. Many sources included a few headers, and those headers included other headers, and in the end, nearly everything is included everywhere, so missing #includes were never noticed. Another side effect was a lot of header bloat which slows down the build. First step to fix it: in each source file, its own header should be included first to verify that each header's includes are correct, and none is missing. After adding the missing #includes, I replaced lots of #includes inside headers with class forward declarations. In most headers, object pointers are never referenced, so declaring the function prototypes with forward-declared classes is just fine. This patch speeds up the build by 19%, because each compilation unit gets smaller. Here are the "time" numbers for a fresh build (with a warm page cache but without ccache): Before this patch: 3144.94user 161.63system 3:02.87elapsed 1808%CPU (0avgtext+0avgdata 2168608maxresident)k 760inputs+12008400outputs (1511major+57747204minor)pagefaults 0swaps After this patch: 2565.17user 141.83system 2:25.46elapsed 1860%CPU (0avgtext+0avgdata 1489076maxresident)k 72576inputs+9130920outputs (1667major+49400430minor)pagefaults 0swaps	2020-02-04 20:51:02 +01:00
Dominik Charousset	c1f3fe7829	Switch from header guards to pragma once	2019-09-17 14:10:30 +02:00
Jon Siwek	8f19bbe589	Improve C++ header includes to improve build time Recent changes ended up including all the Broker headers more places than necessary, causing compile time to increase 2x.	2019-06-20 19:50:23 -07:00
Robin Sommer	01e662b3e0	Reimplement serialization infrastructure for OpaqueVals. We need this to sender through Broker, and we also leverage it for cloning opaques. The serialization methods now produce Broker data instances directly, and no longer go through the binary formatter. Summary of the new API for types derived from OpaqueVal: - Add DECLARE_OPAQUE_VALUE(<class>) to the class declaration - Add IMPLEMENT_OPAQUE_VALUE(<class>) to the class' implementation file - Implement these two methods (which are declated by the 1st macro): - broker::data DoSerialize() const - bool DoUnserialize(const broker::data& data) This machinery should work correctly from dynamic plugins as well. OpaqueVal provides a default implementation of DoClone() as well that goes through serialization. Derived classes can provide a more efficient version if they want. The declaration of the "OpaqueVal" class has moved into the header file "OpaqueVal.h", along with the new serialization infrastructure. This is breaking existing code that relies on the location, but because the API is changing anyways that seems fine. This adds an internal BiF "Broker::__opaque_clone_through_serialization" that does what the name says: deep-copying an opaque by serializing, then-deserializing. That can be used to tests the new functionality from btests. Not quite done yet. TODO: - Not all tests pass yet: [ 0%] language.named-set-ctors ... failed [ 16%] language.copy-all-opaques ... failed [ 33%] language.set-type-checking ... failed [ 50%] language.table-init-container-ctors ... failed [ 66%] coverage.sphinx-zeekygen-docs ... failed [ 83%] scripts.base.frameworks.sumstats.basic-cluster ... failed (Some of the serialization may still be buggy.) - Clean up the code a bit more.	2019-06-17 16:13:54 +00:00
Johanna Amann	474efe9e69	Remove value serialization. Note - this compiles, but you cannot run Bro anymore - it crashes immediately with a 0-pointer access. The reason behind it is that the required clone functionality does not work anymore.	2019-05-09 11:54:38 -07:00
Robin Sommer	4d84ee82da	Merge remote-tracking branch 'origin/topic/johanna/bit-1612' Addig a new random seed for external tests. I added a wrapper around the siphash() function to make calling it a little bit safer at least. BIT-1612 #merged * origin/topic/johanna/bit-1612: HLL: Fix missing typecast in test case. Remove the -K/-J options for setting keys. Add test checking the quality of HLL by adding a lot of elements. Fix serializing probabilistic hashers. Baseline updates after hash function change. Also switch BloomFilters from H3 to siphash. Change Hashing from H3 to Siphash. HLL: Remove unnecessary comparison. Hyperloglog: change calculation of Rho	2016-07-14 16:26:17 -07:00
Johanna Amann	f1bae871e9	Also switch BloomFilters from H3 to siphash. This removes all dependencies on H3 in our source tree.	2016-07-13 09:04:10 -07:00
Johanna Amann	e1218cc7fa	Change Hashing from H3 to Siphash. This commit mostly changes the hash function that is used for Internal hashing of data < 36 bytes from H3 to Siphash. This change is motivated by the fact that it turns out that H3 apparently does not deliver a very good source of data uniqueness; running HLL with H3 as a hashing function results in quite poor results (up to of 75% off in my tests). In difference, running HLL with Siphash (or HMAC-MD5) changes this factor to ~2%. This also fixes a long-standing bug in Hash.h which truncated our hash values to 32 bit on most machines. Furthermore, it once again fixes a problem with the Rank function in HLL.	2016-07-13 06:44:51 -07:00
Johanna Amann	3aabe83ec6	Hyperloglog: change calculation of Rho This commit changes the calculation of the rho-value to be in line with the implementation of the original research paper, counting the number of zero bits before the data. This also fixes an infinite loop in case the hash value is 0. I also cleaned up the code a bit, converting the raw pointers that were used to a STL vector. Addresses BIT-1612	2016-06-13 15:18:44 -07:00
Robin Sommer	c6de23ebe1	Merge remote-tracking branch 'origin/topic/bernhard/ticket1072' * origin/topic/bernhard/ticket1072: and const 2 more functions update hll documentation, make a few functions private and create a new copy constructor. fix case where hll_error_margin could be undefined (thanks John) BIT-1072 #merged	2013-09-18 15:00:06 -07:00
Bernhard Amann	ecc20b932a	and const 2 more functions	2013-09-16 11:00:54 -07:00
Bernhard Amann	c0f780c728	update hll documentation, make a few functions private and create a new copy constructor.	2013-09-16 10:40:25 -07:00
Robin Sommer	6f9d28cc18	Merge branch 'topic/robin/hyperloglog-merge' * topic/robin/hyperloglog-merge: (35 commits) Making the confidence configurable. Renaming HyperLogLog->CardinalityCounter. Fixing bug introduced during merging. add clustered leak test for hll. No issues. make gcc happy (hopefully) fix refcounting problem in hll/bloom-filter opaque vals. Thanks Robin. re-use same hash class for all add operations get hll ready for merging and forgot a file... adapt to new structure fix opaqueval-related memleak. make it compile on case-sensitive file systems and fix warnings make error rate configureable add persistence test not using predetermined random seeds. update cluster test to also use hll persistence really works. well, with this commit synchronizing the data structure should work.. ...if we had consistent hashing. and also serialize the other things we need ok, this bug was hard to find. serialization compiles. ...	2013-08-31 10:42:42 -07:00
Robin Sommer	295987c8d0	Making the confidence configurable.	2013-08-31 10:34:50 -07:00
Robin Sommer	fb3ceae6d5	Renaming HyperLogLog->CardinalityCounter. For consistency with the class' name.	2013-08-31 10:22:27 -07:00

28 commits