docs for muutils v0.8.10

PyPI - Downloads

GitHub commits GitHub commit activity GitHub closed pull requests code size, bytes

muutils, stylized as “μutils” or “μutils”, is a collection of miscellaneous python utilities, meant to be small and with no dependencies outside of standard python.

installation

PyPi: muutils

pip install muutils

Note that for using mlutils, tensor_utils, nbutils.configure_notebook, or the array serialization features of json_serialize, you will need to install with optional array dependencies:

pip install muutils[array]

documentation

hosted html docs: https://miv.name/muutils

modules

`statcounter`

an extension of collections.Counter that provides “smart” computation of stats (mean, variance, median, other percentiles) from the counter object without using Counter.elements()

`dictmagic`

has utilities for working with dictionaries, like:

converting dotlist-dictionaries to nested dictionaries and back: python >>> dotlist_to_nested_dict({'a.b.c': 1, 'a.b.d': 2, 'a.e': 3}) {'a': {'b': {'c': 1, 'd': 2}, 'e': 3}} >>> nested_dict_to_dotlist({'a': {'b': {'c': 1, 'd': 2}, 'e': 3}}) {'a.b.c': 1, 'a.b.d': 2, 'a.e': 3}
DefaulterDict which works like a defaultdict but can generate the default value based on the key
condense_tensor_dict takes a dict of dotlist-tensors and gives a more human-readable summary: python >>> model = MyGPT() >>> print(condense_tensor_dict(model.named_parameters(), 'yaml')) yaml embed: W_E: (50257, 768) pos_embed: W_pos: (1024, 768) blocks: '[0-11]': attn: '[W_Q, W_K, W_V]': (12, 768, 64) W_O: (12, 64, 768) '[b_Q, b_K, b_V]': (12, 64) b_O: (768,) <...>

`kappa`

Anonymous gettitem, so you can do things like

>>> k = Kappa(lambda x: x**2)
>>> k[2]
4

`sysinfo`

utility for getting a bunch of system information. useful for logging.

`misc`:

contains a few utilities: - stable_hash() uses hashlib.sha256 to compute a hash of an object that is stable across runs of python - list_join and list_split which behave like str.join and str.split but for lists - sanitize_fname and dict_to_filename for simplifying the creation of unique filename - shorten_numerical_to_str() and str_to_numeric turns numbers like 123456789 into "123M" and back - freeze, which prevents an object from being modified. Also see gelidum

`nbutils`

contains utilities for working with jupyter notebooks, such as:

quickly converting notebooks to python scripts (and running those scripts) for testing in CI
configuring notebooks, to make it easier to switch between figure output formats, locations, and more
shorthand for displaying mermaid diagrams and TeX

`json_serialize`

a tool for serializing and loading arbitrary python objects into json. plays nicely with ZANJ

[`tensor_utils`]

contains minor utilities for working with pytorch tensors and numpy arrays, mostly for making type conversions easier

`group_equiv`

groups elements from a sequence according to a given equivalence relation, without assuming that the equivalence relation obeys the transitive property

`jsonlines`

an extremely simple utility for reading/writing jsonl files

`ZANJ`

is a human-readable and simple format for ML models, datasets, and arbitrary objects. It’s build around having a zip file with json and npy files, and has been spun off into its own project.

There are a couple work-in-progress utilities in _wip that aren’t ready for anything, but nothing in this repo is suitable for production. Use at your own risk!

Submodules

json_serialize
logger
math
misc
nbutils
web
console_unicode
dbg
dictmagic
errormode
group_equiv
interval
jsonlines
kappa
mlutils
parallel
spinner
statcounter
sysinfo
tensor_info
tensor_utils
timeit_fancy
validate_type

Contents

installation

documentation

modules

misc:

[tensor_utils]

Submodules

muutils

installation

documentation

modules

misc:

[tensor_utils]

API Documentation

muutils.console_unicode

def get_console_safe_str

Parameters:

Returns:

Usage:

Contents

API Documentation

muutils.dbg

def dbg

def tensor_info

Contents

API Documentation

muutils.dictmagic

class DefaulterDict(typing.Dict[~_KT, ~_VT], typing.Generic[~_KT, ~_VT]):

Inherited Members

def defaultdict_to_dict_recursive

def dotlist_to_nested_dict

def nested_dict_to_dotlist

def update_with_nested_dict

Arguments

Returns

def kwargs_to_nested_dict

Arguments

def is_numeric_consecutive

def condense_nested_dicts_numeric_keys

Examples:

def condense_nested_dicts_matching_values

Examples: TODO

Parameters:

def condense_nested_dicts

NOTE: this process is not meant to be reversible, and is intended for pretty-printing and visualization purposes

Parameters:

def tuple_dims_replace

def condense_tensor_dict

Parameters:

Returns:

Examples:

Raises:

Contents

API Documentation

muutils.errormode

class WarningFunc(typing.Protocol):

WarningFunc

def GLOBAL_WARN_FUNC

def GLOBAL_LOG_FUNC

def custom_showwarning

class ErrorMode(enum.Enum):

def process

Parameters:

Raises:

def from_any

def serialize

def load

Inherited Members

Contents

API Documentation

muutils.group_equiv

def group_by_equivalence

Arguments

Contents

API Documentation

muutils.interval

class Interval:

Interval

def get_empty

def get_singleton

`misc`:

[`tensor_utils`]

`muutils`

`misc`:

[`tensor_utils`]

`muutils.console_unicode`

`def get_console_safe_str`

`muutils.dbg`

`def dbg`

`def tensor_info`

`muutils.dictmagic`

`class DefaulterDict(typing.Dict[~_KT, ~_VT], typing.Generic[~_KT, ~_VT]):`

`def defaultdict_to_dict_recursive`

`def dotlist_to_nested_dict`

`def nested_dict_to_dotlist`

`def update_with_nested_dict`

`def kwargs_to_nested_dict`

`def is_numeric_consecutive`

`def condense_nested_dicts_numeric_keys`

`def condense_nested_dicts_matching_values`

`def condense_nested_dicts`

`def tuple_dims_replace`

`def condense_tensor_dict`

`muutils.errormode`

`class WarningFunc(typing.Protocol):`

`WarningFunc`

`def GLOBAL_WARN_FUNC`

`def GLOBAL_LOG_FUNC`

`def custom_showwarning`

`class ErrorMode(enum.Enum):`

`def process`

`def from_any`

`def serialize`

`def load`

`muutils.group_equiv`

`def group_by_equivalence`

`muutils.interval`

`class Interval:`

`Interval`

`def get_empty`

`def get_singleton`

`def numerical_contained`

`def interval_contained`

`def from_str`

`def copy`

`def size`

`def clamp`

`def intersection`

`def union`

`class ClosedInterval(Interval):`

`ClosedInterval`

`class OpenInterval(Interval):`

`OpenInterval`

`muutils.json_serialize`

`def json_serialize`

`def serializable_dataclass`

`def serializable_field`

TODO: `custom_value_check_fn`: function taking the value of the field and returning whether the value itself is valid. if not provided, any value is valid as long as it passes the type test

`def arr_metadata`

`def load_array`

`class JsonSerializer:`

`JsonSerializer`

`def json_serialize`

`def hashify`

`def try_catch`

`def dc_eq`

`class SerializableDataclass(abc.ABC):`

`def serialize`

`def load`

`def validate_fields_types`

`def validate_field_type`

`def diff`