dragon.ai.inference.config.InferenceConfig
- class InferenceConfig[source]
Bases:
objectMaster configuration for the entire inference pipeline.
Only
modelis required. All other fields have sensible defaults suitable for agentic pipelines.- __init__(model: ModelConfig, hardware: HardwareConfig = <factory>, batching: BatchingConfig = <factory>, guardrails: GuardrailsConfig = <factory>, dynamic_worker: DynamicWorkerConfig = <factory>, flask_secret_key: str = '', run_type: str = 'chat', token: str = '') None
Methods
__init__(model, hardware, batching, ...)from_dict(config_dict)Create InferenceConfig from dictionary (loaded from YAML).
validate_all(all_nodes)Validate all configuration sections.
Attributes
- model: ModelConfig
- hardware: HardwareConfig
- batching: BatchingConfig
- guardrails: GuardrailsConfig
- dynamic_worker: DynamicWorkerConfig
- classmethod from_dict(config_dict: dict ) InferenceConfig[source]
Create InferenceConfig from dictionary (loaded from YAML).
- Parameters:
config_dict (dict ) – Configuration dictionary loaded from YAML.
- Returns:
InferenceConfig instance.
- Return type:
- validate_all(all_nodes: dict ) None [source]
Validate all configuration sections.
- Parameters:
all_nodes (dict ) – Dictionary of all available nodes in the cluster. Keys are hostnames, values are
dragon.native.machine.Nodeobjects.- Raises:
ValueError – If any configuration parameter is invalid.
- __init__(model: ModelConfig, hardware: HardwareConfig = <factory>, batching: BatchingConfig = <factory>, guardrails: GuardrailsConfig = <factory>, dynamic_worker: DynamicWorkerConfig = <factory>, flask_secret_key: str = '', run_type: str = 'chat', token: str = '') None