summary refs log tree commit diff stats
path: root/lib/pure/collections/sets.nim
Commit message (Collapse)AuthorAgeFilesLines
* assignment -> shallowCopy for efficiency.Charles Blake2015-02-131-1/+1
|
* Update doc comments to mention rightSize.Charles Blake2015-02-131-6/+6
|
* Address Andreas' complaint about code duplication.Charles Blake2015-02-071-2/+3
|
* Fix unnecessarily slow set building from openArray.Charles Blake2015-02-071-3/+17
| | | | | | | | | | | | | | | | | | | The estimation of the initialSize as simply array len + 10 was too small for for all but the smallest sets. It would not elide/skip one final enlarge(). That last one is actually always the most expensive enlarge(). Indeed, in a series where one to start from tiny and build up the table..that last one is about 50% of all the enlarging time in general. So, this simple and reasonable optimization (compared to just starting at 64) was only helping about half as much as it could. Introduce a rightSize() proc to be the inverse to mustRehash(). Export it to clients since pre-sizing is externally useful in set construction and the current mustRehash rules are opaque and beyond the control of clients. Also add test module logic to check that rightSize() and mustRehash() are inverses in the appropriate sense..not really in a block/assertion throwing unit test since this is a peformance nice-to-have issue rather than about basic correctness. (Also, fix a too vs. two typo in doc comment.)
* Add hcode. Re-factor rawGet. Fix infinite loop.Charles Blake2015-02-061-41/+94
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Replace state enum with a cached hash code which has the same memory overhead and locality as the enum, but can really speed things up with non-integer-like keys (keys for which either hash() or == take more than couple cycles, or where the key data is "indirect" and might incur another cache miss). To function as both empty/filled state and a hash code cache, it only needs to be ensured that hash codes are non-zero for any real key. That is done at the one place in the whole file hash() is called. Keep convention clear via isFilled() & isEmpty(). An isDeleted state will no longer be necessary as per below excl/inf loop fix. Since some use sites know hc and some do not, re-factor rawGet into two forms - one with known hash code and one with an unknown HC that returns it. Both forms still return <0 on missing, but returns the much more informative "-1 - index". That return can be quickly inverted by -1 - result to recover the index where insert should happen, provided no modifications are made to the table in the meantime. This protocol retains the prior <0 interface and also makes it easy to avoid unnecessary duplicate search work in procs like containsOrInclImpl (which formerly searched in the initial get and AGAIN in rawInsert). Strip the searching part out of rawInsert to "make it even more raw". swap(s.data, n) a bit earlier so rawGet and rawGetKnownHC can have similar parameter lists and integrate well with rawInsert/code sharing between Set and OrderedSet impls. This PR also fixes infinite looping upon too many deletes. [ The deleted state (aka "tombstone") approach is vulnerable to the table filling up with deleted items which forces giant scans for missing keys which could be anywhere. In the version prior to this PR, table wraparound wasn't even detected yielding infinite loops. ] This PR changes excl() from marking slots as deleted to Knuth algo 6.4R, "local/incremental moveback rehashing" - adapted from Knuth's h->h-1 to the cache-friendlier h->h+1 probe sequence and adapted from "gotos" to a new doWhile template. This method restores the table to a state that would have resulted from pure inserts (in some order). Update nextTry accordingly. Since linear probing can degrade a little faster, 50% rather than 66% may be a better default growth threshold, but users should be able to adjust threshold anyway. Old unit tests all pass. More extensive testing in this module is probably warranted before taking similar enhancements over to collections.tables.
* documented new C++ supportAraq2015-01-281-1/+2
|
* Update sets.nimdumndummer2015-01-271-1/+1
| | | corrected misspelled word in doc comment
* fixes #1444Araq2014-08-311-1/+1
|
* big renameAraq2014-08-281-12/+12
|
* big renameAraq2014-08-281-48/+48
|
* big renameAraq2014-08-281-5/+5
|
* big renameAraq2014-08-271-22/+25
|
* renamefestAraq2014-08-231-1/+1
|
* fixes #1413Araq2014-08-131-1/+19
|
* Adds definition of card term to sets module.Grzegorz Adam Hankiewicz2014-07-271-0/+6
|
* Adds more docstrings to the sets module.Grzegorz Adam Hankiewicz2014-07-271-61/+374
|
* Adds TSet.init(), wraps initSet around it.Grzegorz Adam Hankiewicz2014-07-261-6/+46
|
* Adds TOrderedSet.init(), wraps initOrderedSet around it.Grzegorz Adam Hankiewicz2014-07-261-8/+49
|
* Adds test cases for remaining TSet procs.Grzegorz Adam Hankiewicz2014-07-261-0/+9
|
* Moves TSet procs to their code block.Grzegorz Adam Hankiewicz2014-07-261-21/+21
|
* Adds TOrderedSet.isValid().Grzegorz Adam Hankiewicz2014-07-261-0/+58
|
* Adds TSet.isValid().Grzegorz Adam Hankiewicz2014-07-261-0/+101
|
* More effificent TSet differencedef2014-07-141-3/+4
|
* Add missing difference and to sets moduledef2014-07-141-0/+10
|
* Fixed TSet proc names to conform with setFelix Krause2014-06-261-3/+3
|
* Fixed doc comments in sets.nimFelix Krause2014-06-261-4/+4
|
* Added logical set operations to TSetFelix Krause2014-06-251-0/+49
|
* added mget for TSetAndreas Rumpf2014-04-131-0/+9
|
* fixed a bug in `map` for setsSimon Hafner2014-02-101-2/+2
|
* added `map` to setsSimon Hafner2014-02-101-0/+4
|
* corrected docs and testsSimon Hafner2014-02-011-2/+2
|
* sets equivalenceSimon Hafner2014-01-311-0/+17
|
* case consistency part 7Araq2013-12-281-21/+19
|
* Removes executable bit for text files.Grzegorz Adam Hankiewicz2013-03-161-0/+0
|
* made more tests green; fixes #201Araq2012-09-121-1/+1
|
* documented hygienic templates; made tests green; fixed system.clampAraq2012-08-221-6/+8
|
* bugfix: collection/sets only worked by chance ...Araq2012-07-251-4/+4
|
* year 2012 for most copyright headersAraq2012-01-021-1/+1
|
* modifyable results for generics; teventemitter worksAraq2011-08-091-7/+2
|
* preparations for 0.8.12Araq2011-07-101-3/+3
|
* implemented tables.addAraq2011-06-111-2/+2
|
* basic generic collections implemented and testedAraq2011-06-071-0/+231