dragon.ai.inference.config.HardwareConfig

class HardwareConfig[source]

Bases: object

Hardware allocation and resource configuration.

__init__(num_nodes: int = -1, num_gpus: int = -1, num_inf_workers_per_cpu: int = 4, node_offset: int = 0) None

Methods

__init__([num_nodes, num_gpus, ...])

validate(all_nodes)

Validate hardware configuration against available resources.

Attributes

node_offset

num_gpus

num_inf_workers_per_cpu

num_nodes

num_nodes: int = -1
num_gpus: int = -1
num_inf_workers_per_cpu: int = 4
node_offset: int = 0
validate(all_nodes: dict ) None [source]

Validate hardware configuration against available resources.

Parameters:

all_nodes (dict ) – Dictionary of all available nodes in the cluster. Keys are hostnames, values are dragon.native.machine.Node objects.

Raises:

ValueError – If any configuration parameter is invalid or exceeds available resources.

__init__(num_nodes: int = -1, num_gpus: int = -1, num_inf_workers_per_cpu: int = 4, node_offset: int = 0) None