Optim algorithms like BFGS, multi-LSTM hybrids, yaml with bounds and PBM, regularization - L2, per parameter optimisers by BernhardAhrens · Pull Request #267 · EarthyScience/EasyHybrid.jl

BernhardAhrens · 2026-05-05T12:05:39Z

No description provided.

gemini-code-assist

Code Review

This pull request refactors the train function by extracting its core logic into a private _train function and updating the public train method to support both explicit configuration objects and flat keyword arguments. Feedback points out a logic error where explicit configuration objects are overwritten if keyword arguments are provided, potentially leading to lost settings. It is also recommended to align the implementation with the documentation regarding the deprecation of flat keyword arguments by adding formal warnings.

gemini-code-assist · 2026-05-05T12:07:50Z

+function train(
+    model, data;
+    train_cfg::TrainConfig = TrainConfig(),
+    data_cfg::DataConfig  = DataConfig(),
+    kwargs...,
+)
+    if !isempty(kwargs)
+        train_cfg, data_cfg = kwargs_to_configs((), kwargs)
+    end
+
+    return _train(model, data, train_cfg, data_cfg)
+end


The current implementation has a logic error when mixing explicit configuration objects (train_cfg, data_cfg) with flat keyword arguments. If kwargs is not empty, the provided train_cfg and data_cfg are completely overwritten by new instances created from kwargs and defaults, leading to silent loss of user configuration.

If mixing is not intended to be supported, it is safer to use nothing as default values to detect explicit usage and prevent accidental overwriting. If mixing is intended, kwargs_to_configs should be updated to merge with the existing configuration objects.

function train( model, data; train_cfg::Union{TrainConfig, Nothing} = nothing, data_cfg::Union{DataConfig, Nothing} = nothing, kwargs..., ) if !isempty(kwargs) if !isnothing(train_cfg) || !isnothing(data_cfg) throw(ArgumentError("Cannot mix explicit `train_cfg`/`data_cfg` with flat keyword arguments.")) end train_cfg, data_cfg = kwargs_to_configs((), kwargs) else train_cfg = isnothing(train_cfg) ? TrainConfig() : train_cfg data_cfg = isnothing(data_cfg) ? DataConfig() : data_cfg end return _train(model, data, train_cfg, data_cfg) end

gemini-code-assist · 2026-05-05T12:07:50Z

+# Keyword Arguments
+- `train_cfg`: Training configuration. See [`TrainConfig`](@ref) for all options.
+- `data_cfg`: Data preparation configuration. See [`DataConfig`](@ref) for all options.
+- Any other kwargs (deprecated) are forwarded as fields to `TrainConfig` / `DataConfig`.


The docstring labels flat keyword arguments as "deprecated" in the context of the new train method. However, they are being explicitly supported in the method signature. If the intention is to phase them out, consider adding a formal deprecation warning when they are used via the new API, or clarify if they are intended as a permanent convenience feature.

lazarusA · 2026-05-06T05:09:21Z

+
+    unknown = [k for k in keys(kwargs) if k ∉ train_keys && k ∉ data_keys]
+    if !isempty(unknown)
+        @warn "Unknown kwargs will be ignored: $(join(unknown, ", "))"


maybe is better to throw error instead?

…del state to not get epoch already exists error

lazarusA · 2026-05-08T12:41:36Z

        target_names = model.targets
        save_epoch = stopper.best_epoch == 0 ? 0 : stopper.best_epoch
-        save_ps_st!(paths.best_model, model, cfg.cdev(ps), cfg.cdev(st), cfg.tracked_params, save_epoch)
+        save_ps_st(paths.best_model, model, cfg.cdev(ps), cfg.cdev(st), cfg.tracked_params, save_epoch)


it should be !. At this stage, the initial file was created already. I think this will write fully the file, or?

lazarusA · 2026-05-08T12:42:55Z

 export get_all_groups
 export load_group
-function save_ps_st(file_name, hm, ps, st, save_ps)
+function save_ps_st(file_name, hm, ps, st, save_ps, epoch = 0)


oh, I see. Same function in both places, no need for the ! variant?

lazarusA · 2026-05-08T12:45:16Z

+when non-empty it is forwarded as `tracked_params` on the resulting `TrainConfig`.
 """
-function kwargs_to_configs(kwargs)
+function kwargs_to_configs(save_ps, kwargs)


so, this makes the interface compatible with the old syntax ? good.

BernhardAhrens · 2026-06-12T11:26:42Z

@lazarusA I'd say my PR from hell is ready. I think I'll do not split into several PRs and just have a length squash commit messsage

lazarusA · 2026-06-12T12:10:26Z

/gemini summary

gemini-code-assist · 2026-06-12T12:15:36Z

Summary of Changes

This pull request introduces significant architectural enhancements to the hybrid modeling framework, primarily by integrating Optimization.jl to support a wider range of optimization algorithms. It enables flexible per-branch optimizer configurations, adds robust L2 regularization for model weights, and improves the persistence of model configurations in YAML. Additionally, the training pipeline has been refactored to support both traditional Optimisers.jl loops and new Optimization.jl-based solvers, providing greater versatility for training complex hybrid models.

Highlights

Optimization Engine Integration: Integrated Optimization.jl to support advanced solvers like L-BFGS, enabling both full-batch and minibatch training strategies.
Per-Branch Optimizers: Enabled per-parameter/per-branch optimizer configurations using Optimisers.jl, allowing for more granular control over training dynamics.
Regularization and Utilities: Added L2 regularization support for model weights and introduced utility functions for weight extraction and norm calculation.
Enhanced Configuration Serialization: Improved YAML configuration saving to include parameter bounds and the full source code of mechanistic models for better reproducibility.
Data Handling Improvements: Refined data processing for MultiNNHybridModel, including better sequence filtering and support for multi-branch neural network architectures.

New Features

🧠 You can now enable Memory (public preview) to help Gemini Code Assist learn from your team's feedback. This makes future code reviews more consistent and personalized to your project's style. Click here to enable Memory in your admin console.

Activity

Author BernhardAhrens indicated the PR is ready and will be submitted as a single large commit.
Reviewer lazarusA requested a summary of the changes.
Previous automated reviews addressed logic for mixing configuration objects with keyword arguments and clarified deprecation warnings.

lazarusA · 2026-06-12T12:19:26Z

/gemini review

gemini-code-assist

Code Review

This pull request introduces an Optimization.jl-based training driver path alongside the existing Optimisers.jl loop, adds utilities for weight extraction and L2 regularization (extract_weights and weight_l2), updates the extra_loss signature to accept model parameters, and improves YAML serialization for configurations. Feedback on these changes highlights several key issues: Meta.parse error expressions are not properly caught during YAML serialization, eachslice indexing may cause runtime crashes on Julia versions prior to 1.9, and the new weight extraction utilities silently ignore parameters grouped in Tuple containers.

Important

The consumer version of Gemini Code Assist on GitHub is being sunset. Starting June 18, 2026, new organization installations will be blocked, and all code review activity will officially cease on July 17, 2026.
For more details on the timeline and next steps, please review the Help Documentation.

gemini-code-assist · 2026-06-12T12:23:41Z

+    expr_and_next = try
+        Meta.parse(text, idx; greedy = true, raise = false)
+    catch
+        return nothing
+    end
+    expr_and_next === nothing && return nothing
+    _, next_idx = expr_and_next


When Meta.parse is called with raise = false, parsing errors do not throw an exception. Instead, it returns an expression containing the error (e.g., Expr(:error, ...) or Expr(:incomplete, ...)). Checking if expr_and_next === nothing is insufficient to catch these parsing failures, which can lead to invalid source code being written to the YAML. We should explicitly check if the parsed expression is an error or incomplete.

expr_and_next = try Meta.parse(text, idx; greedy = true, raise = false) catch return nothing end expr_and_next === nothing && return nothing expr, next_idx = expr_and_next if Meta.isexpr(expr, :error) || Meta.isexpr(expr, :incomplete) return nothing end

gemini-code-assist · 2026-06-12T12:23:41Z

+        # `eachslice(...; dims = 1)` (instead of `eachrow`) so this works for both
+        # the feed-forward case (2D `(param, batch)` output) and the recurrent/LSTM
+        # case (3D `(param, time, batch)` sequence output).
+        nn_cols = eachslice(nn_outputs[nn_name]; dims = 1)

        # Create parameter for this NN
        nn_param = NamedTuple{(param_name,), Tuple{typeof(nn_cols[1])}}((nn_cols[1],))


In Julia versions prior to 1.9, eachslice returns a Generator which does not support indexing (e.g., nn_cols[1]), which will cause a runtime crash. Since out_dim is always 1, we only ever need the first slice along the first dimension. Using selectdim is fully compatible with all Julia versions, avoids creating an iterator, and directly returns the first slice as a view.

# Use selectdim to get the first slice along the first dimension. # This works for both 2D and 3D outputs and is fully compatible with Julia < 1.9. nn_col = selectdim(nn_outputs[nn_name], 1, 1) # Create parameter for this NN nn_param = NamedTuple{(param_name,), Tuple{typeof(nn_col)}}((nn_col,))

gemini-code-assist · 2026-06-12T12:23:42Z

+_collect!(_, _, ::Symbol) = nothing
+
+function _collect!(out, node::Union{NamedTuple, ComponentArray}, key::Symbol)


If the parameters are grouped in a Tuple (which is a common container in Julia and Lux), _collect! will silently ignore them because it only has methods for NamedTuple and ComponentArray. Adding a method for Tuple allows recursing into tuple elements.

_collect!(_, _, ::Symbol) = nothing function _collect!(out, node::Tuple, key::Symbol) for child in node _collect!(out, child, key) end return nothing end function _collect!(out, node::Union{NamedTuple, ComponentArray}, key::Symbol)

gemini-code-assist · 2026-06-12T12:23:42Z

+_weight_l2_stats(::Any, ::Symbol) = (0.0f0, 0)
+
+function _weight_l2_stats(node::Union{NamedTuple, ComponentArray}, key::Symbol)


Similarly to _collect!, _weight_l2_stats will silently return (0.0f0, 0) for any parameters grouped in a Tuple. Adding a method for Tuple ensures that L2 regularization correctly covers all parameters.

_weight_l2_stats(::Any, ::Symbol) = (0.0f0, 0) function _weight_l2_stats(node::Tuple, key::Symbol) s = 0.0f0 n = 0 for child in node cs, cn = _weight_l2_stats(child, key) s = s + cs n = n + cn end return s, n end function _weight_l2_stats(node::Union{NamedTuple, ComponentArray}, key::Symbol)

support kwargs and configs

e3439f4

gemini-code-assist Bot reviewed May 5, 2026

View reviewed changes

kwargs over_ride configs

4af107b

lazarusA reviewed May 6, 2026

View reviewed changes

BernhardAhrens added 2 commits May 8, 2026 11:40

Refactor save_ps_st function to include epoch parameter for saving mo…

714d6ea

…del state to not get epoch already exists error

kwargs to config two pos args

0495dd1

lazarusA reviewed May 8, 2026

View reviewed changes

BernhardAhrens added 3 commits May 8, 2026 14:52

parameter bounds in yaml, mechanistic model in yaml

a777da7

towards regularization

15ed575

L2 loss steps

4feb1cd

This was linked to issues May 20, 2026

nn weights regularisation #269

Open

Custom L2 and L1 regularization - only on NN parameters #207

Open

BernhardAhrens and others added 9 commits May 20, 2026 15:23

LSTM fix time is not sample anymore

3594793

per parameter optimiser

bdb8e54

hard sigmoid inverse

19142cf

multi lstm

b2daf95

l2

40690f9

Optimization.jl implementation with BFGS and LBFGS etc

a407bd5

warning of train test for evaluation

50fcc92

runic

aafa490

fix tests

0c1b1c6

BernhardAhrens changed the title ~~support train kwargs and configs~~ Optim algorithms like BFGS, multi-LSTM hybrids, yaml with bounds and PBM, regularization - L2, per parameter optimisers Jun 12, 2026

gemini-code-assist Bot reviewed Jun 12, 2026

View reviewed changes

		_collect!(_, _, ::Symbol) = nothing

		function _collect!(out, node::Union{NamedTuple, ComponentArray}, key::Symbol)

		_weight_l2_stats(::Any, ::Symbol) = (0.0f0, 0)

		function _weight_l2_stats(node::Union{NamedTuple, ComponentArray}, key::Symbol)

Conversation

BernhardAhrens commented May 5, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot May 5, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot May 5, 2026

Choose a reason for hiding this comment

Uh oh!

lazarusA May 6, 2026

Choose a reason for hiding this comment

Uh oh!

lazarusA May 8, 2026

Choose a reason for hiding this comment

Uh oh!

lazarusA May 8, 2026

Choose a reason for hiding this comment

Uh oh!

lazarusA May 8, 2026

Choose a reason for hiding this comment

Uh oh!

BernhardAhrens commented Jun 12, 2026

Uh oh!

lazarusA commented Jun 12, 2026

Uh oh!

gemini-code-assist Bot commented Jun 12, 2026

Summary of Changes

Highlights

Uh oh!

lazarusA commented Jun 12, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot Jun 12, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Jun 12, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Jun 12, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Jun 12, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants