To open hyperparameters settings, click Edit underneath the Hyperparameters dropdown menu to open a dialog where you can set custom hyperparameters.

1. Network Type

This tab define defines the deep learning network we would like to use.

...

Options	Description
RCAN	For denoising and super-resolution. This is also the model used by our Nature Methods papaerpaper. https://www.nature.com/articles/s41592-021-01155-x
UNet	For virtual staining and segmentation. [1505.04597] U-Net: Convolutional Networks for Biomedical Image Segmentation (arxiv.org)

Network Shape

Options: 2D or 3D

Description: By default, you should choose a 2D model for 2D image data and a 3D model for 3D data. But you can train a 2D model using 3D data. It will process the image slice-by-slice.

...

How to use it: Increase it for model complexity, reduce it for a smaller model..

Number of Residual Groups

...

How to use it: Increase it for model complexity, reduce it for a smaller model..

Channel Reduction Factor

...

Description: Channel reduction factor for the squeeze-and-excitation module. See

How to use it: Increase channel reduction factor for better performance.

...

How to use it: Increase it to build a more complex model, reduce it for reduce it for a smaller model.

Number of Initial Filter

...

How to use it: Increase it to build a more complex model, reduce it for reduce it for a smaller model.

Filter Growth Factor

...

How to use it: Increase it to build a more complex model, reduce it for reduce it for a smaller model.

Normalization Type

...

2. Training Parameters

This tab define defines some general parameters and how is Aivia going to update the model weights during training

...

Options	Description	When to use
None	Use the raw input to train deep learning models	Choose this option if you want to use the original data to train or your input images have been normalized (Note: If the image is 8-bit or 16-bit, the scripts will error out and ask users to choose one of the normalization methods.)
Percentile	Normalize input images using percentile method. Normalizes the image intensity so that the 2nd and 99th percentiles are converted to 0 and 1 respectively.	Generally good for fluorescence images
Divide by Max	Using the max intensity value to normalize images.	Useful for normalizing segmentation mask

...

Data Augmentation

Options	Description	When to use
None	No augmentation	If you believe you have enough image pair samples
Rotate_and_flip	Randomly rotate and flip data to increasing increase input data variety. Note that when this option is selected, you need to make sure the Block Size width and height is are the same.	If you have little a limited amount of data, allow allowing data augmentation generally gives you a better results and prevent prevents overfitting.

Block Size

Default: 256, 256, 16 (width, height, depth)

How to adjust: If your GPU is less capable, reduce the each default by several pixels until you can run the training on your computer without out-of-memory issueissues. Do not make block size too small, otherwise, the model may not have enough pixels/voxels to pass down the convolution neural networks.

...

Options	Description	When to use
Intensity threshold	If `intensity_threshold > 0`, pixels whose intensities are greater than this threshold will be considered as foreground.	Set the threshold when your images has have fewer foregroundforegrounds. Try to start with a small number such as 0.0525.
Area ratio threshold	If `intensity_threshold > 0`, the generator calculates the ratio of foreground pixels in a target patch , and rejects the patch if the ratio is smaller than this threshold.	Set the threshold when your images has have fewer foreground signals. Try to start with 0.2505.

Optimizer

...

Options

...

Description

...

The optimizer is used for updating model weights. Generally, Adam is good for all kinds of tasks.

Default: Adam

Options: sgd, rmsprop, adagrad, adadelta, adamax, nadam

Initial Learning Rate

Default: 0.0001

...

Options	Description	When to use
Staircase exponential decay	drop the learning rate by half every `100` epochs.	Default
Exponential Decay	Exponentially reduce the learning rate on every epoch using the function: learning_rate = learning_rate*0.5^(epoch/100)	If staircase exponential decay does not works work for your model
Reduce on Plateau	Reduce learning rate to 0.1*learning_rate when validation loss has stopped improving for more than 10 epochs.	For models that are harder to train.

...

How to use: Checked this if you want to stop training when validation loss has stopped improving for more than 10 epochs.

Batch Size

DefualtDefault: 1

How to adjust: Increase it if you have a larger GPU RAM to speed up the training.

...

How to adust: Reduce it to reduce training time. Increase it if the model has room to improve and not overfit.

...

Description: steps*batch_size examples will be given to the model to updates update the weights.

How to adjust: Increase it if you want your models to see more example examples per epoch

Loss Function

The goal function that the optimizer try tries to minimize when updating model weights. Usually the lower the loss the better the results.

Options	Description	When to use
Mean absolute error	Measures the mean absolute error (MAE) between each element in the input x and target y.Default Prediction(Pred) and Ground Truth(GT).	The default for Denoising, Super-Resolution, and Virtual Staining
balanced binary crossentropy (to be implemented)cross-entropy	Weighted verision version binary crossentropy cross-entropy loss for imbalanced data.	Default for Segmentation
Mean squared error	Measures the mean squared error (MAE) between each element in the input x and target yPrediction(Pred) and Ground Truth(GT).	More sensitive to outlier comparing compared to mean absolute error.
binary cross-entropy	BCE compares each of the predicted probabilities to binary Ground Truth	Good for segmentation, only when the data is balanced
dice loss	2*(Pred ∩ GT) / (Pred + GT)	Also good for imbalanced data

...

Options	Description	When to use
PSNR	Computes the peak signal-to-noise ratio between two images. Note that the maximum signal value is assumed to be 1.	Denoising, Super-Resolution, and Virtual Staining
SSIM	Computes the structural similarity index between two images. Note that the maximum signal value is assumed to be 1.	Denoising, Super-Resolution, and Virtual Staining
Accuracy	Correct outputs/Total outputs	Segmentation

3. Apply Parameters

This tab define defines how are we going to update the model weights during training

...

Intensity Normalization Method

Should It should be the same as the intensity normalization method in Training parameters.

...

Unless your GPU can process a larger block at a time, you should use the same block size in Training Parameters. Do not choose a s block size that is smaller than the training block size, the neural network will not have enough information to pass down.

...

Version	Old Version 5	New Version Current
Changes made by	Hung-yu Chang	Hung-yu Chang
Saved on	Oct 20, 2021	Feb 14, 2022

Versions Compared

Key

1. Network Type

Network Shape

Number of Residual Groups

Channel Reduction Factor

Number of Initial Filter

Filter Growth Factor

Normalization Type

2. Training Parameters

Data Augmentation

Block Size

Optimizer

Initial Learning Rate

Batch Size

Loss Function

3. Apply Parameters

Intensity Normalization Method

Content Comparison

Versions Compared

Key

1. Network Type

Network Shape

Number of Residual Groups

Channel Reduction Factor

Number of Initial Filter

Filter Growth Factor

Normalization Type

2. Training Parameters

Data Augmentation

Block Size

Optimizer

Initial Learning Rate

Batch Size

Loss Function

3. Apply Parameters

Intensity Normalization Method