Training the SALT3 Model

usage:

trainsalt -c <configfile> <options>

Although there are a number of training configuration files in the examples/ directory, the simplest way to train the SALT3.K21 model with all data and spectra and with the latest calibrations is to use the configuration files and data in the examples/SALT3TRAIN_K21_PUBLIC directory.

To train the SALT3.K21 model, run:

trainsalt -c Train_SALT3_public.conf

This directory contains all the lightcurves, spectra, and filter definition files needed to train the model, with outputs in the output directory.

The training is slow given the large data volume and takes approximately 1 to 1.5 days, but can be sped up with a couple reasonable choices. The first is changing the steps_between_errorfit argument to estimate model uncertainties less frequently, as uncertainty estimation (~4.5 hours) is the slowest component of the code:

trainsalt -c Train_SALT3_public.conf --steps_between_errorfit 15

Another option is to bin the spectra, which will reduce the amount of spectroscopic data points by an order of magnitude:

trainsalt -c Train_SALT3_public.conf --binspec True

This should not result in any noticeable difference to the model surfaces but hasn’t yet been tested fully. Additional speed and memory usage improvements are currently in progress.

SALT3 Training Configuration Options

See the examples/SALT3TRAIN_K21_PUBLIC/Train_SALT3_public.conf file and the examples/SALT3TRAIN_K21_PUBLIC/training.conf files for the full list of training options. Two configuration files are used with the goal that users should rarely have to modify the default training.conf options. Descriptions of each option are given below.

Name Default Description
main config file    
[iodata]    
snlists   ASCII file or comma-separated list of files. Each file contains a list of SN files (see Photometry and Spectroscopy Format for input file format)
tmaxlist   Time of maximum light for each SN in training. See examples/SALT3TRAIN_K21_PUBLIC/SALT3_PKMJD_INIT.LIST.
snparlist   initial list x0,x1,c and FITPROB (prob. that the data matches model, from SALT2). See examples/SALT3TRAIN_K21_PUBLIC/SALT3_PARS_INIT.LIST
specrecallist   Option to provide an initial set of spectral recalibration parameters. No longer recommended.
dospec True If set, use spectra in training
maxsn None Debug option to limit the training to a given number of SNe
outputdir   Directory for trained model outputs
keeponlyspec False Debug option - keep only those SNe with spectroscopic data
initm0modelfile Hsiao07.dat Initial SN SED model. Initial parameter guesses are derived from this file. Default is the Hsiao model.
initm1modelfile   Initial SN SED model. Will guess M1 from a time-dilated Hsiao model if no file is given.
initsalt2model True If True, use SALT2 as the initial guess. Otherwise use initm0modelfile.
initsalt2var False If set, initialize model uncertainties using SALT2 values. No longer recommended as SALT3 error prescription is different.
initbfilt Bessell90_B.dat Nominal B-filter for putting priors on the normalization
resume_from_outputdir   Resume the training from an existing output directory
resume_from_gnhistory   If resume_from_outputdir is set, set to same directory name to resume training from a gnhistory.pickle file. This is useful if training crashes.
loggingconfig logging.yaml Gives configuration options for the training logs
trainingconfig training.conf Additional configuration file. Will look in the package directory if it’s not found in the current directory
calibrationshiftfile   A file that can adjust the calibration of the input files, e.g. for estimating systematics
filter_mass_tolerance 0.01 Amount of filter “mass” allowed to be outside the SALT wavelength range
fix_salt2modelpars False Debug option - if True, does not fit for M0 and M1.
validate_modelonly False If True, only produces model validation plots but not plots spectra or lightcurves (slow, and occasionally crashes).
[survey_<SURVEY>]   The parameters file requires a category for every SURVEY key in SN data files
kcorfile   Kcorfile (includes filter ZPT offsets and filter definitions) for each SURVEY key in SN data files
subsurveylist   Comma-separated list of sub-surveys for every survey, e.g. CFA4 is the subsurvey for survey name PS1_LOWZ_COMBINED(CFA4)
[trainparams]    
gaussnewton_maxiter 30 Maximum number of Gauss-Newton iterations allowed if convergence (delta chi^2 < 1) is not reached
regularize True Include regularization if True
fitsalt2 False Try to fit SN parameters with SALT2 model in the validation stage if True
n_repeat 1 deprecated, leave alone
fit_model_err True If True, fits model errors every steps_between_errorfit iterations
fit_cdisp_only False If True and fit_model_err is True, fits for the color scatter but no other model errors
steps_between_errorfit 5 Estimate model errors every x iterations
model_err_max_chisq 4 Begin estimating model errors when the reduced chi^2 of the training is below this
condition_number 1e-80 Conditioning matrices for the Gauss-Newton process. Leave this alone.
fit_tpkoff False if true, fit for time of maximum light along with other parameters (not well tested yet)
fitting_sequence all optionally, can fit for different model components in sequence. Can make it hard for training to converge
training.conf file   In most cases, leave these alone
[trainingparams]    
specrecal 1 if 1 (or True), do the spectral recalibration
n_processes 1 deprecated
estimate_tpk False not recommended estimate time of maximum light for each SN before beginning the training. Not robust.
fix_t0 False deprecated
n_min_specrecal 3 minimum number of parameters for the spectral recalibration polynomial
n_max_specrecal 10 maximum number of parameters for the spectral recalibration polynomial
regulargradientphase 1e4 amplitude of gradient regularization chi^2 penalty for phase (semi-arbitrary)
regulargradientwave 1e5 amplitude of gradient regularization chi^2 penalty for wavelength (semi-arbitrary)
regulardyad 1e4 amplitude of dyadic regularization chi^2 penalty (semi-arbitrary)
m1regularization 100 multiply regularization amplitude for the M1 component by this amount (semi-arbitrary)
specrange_wavescale_specrecal 2500 normalizes the spectra for recalibration
n_specrecal_per_lightcurve 0.5 add one spectral recal parameter for every two photometric bands in a given SN
regularizationScaleMethod fixed options for adjusting regularization scale in training/saltresids.py
wavesmoothingneff 1 Gaussian smoothing scale for the amount of training data at each wavelength for smoothly varying Neff
phasesmoothingneff 3 Gaussian smoothing scale for the amount of training data at each phase for smoothly varying Neff
nefffloor 1e-4 below nefffloor, regularization does not continue to increase in strength
neffmax 0.01 above neffmax, regularization is turned off
binspec False use spectral binning if True
binspecres 29 resolution of the spectral binning
spec_chi2_scaling 0.5 tuned so that spectra and photometry contribute ~equally to total chi^2 in training
[modelparams]    
waverange 2000,11000 wavelength range over which the model is defined
colorwaverange 2800,8000 wavelength range over which the color law polynomial is fit
interpfunc bspline function for interpolating the model between control points (b-spline is default)
errinterporder 0 order of the spline interpolation for the errors
interporder 3 order of the spline interpolation for the model
wavesplineres 69.3 number of Angstroms between wavelength control points
waveinterpres 10 wavelength resolution of the model used during training (Angstroms)
waveoutres 10 wavelength resolution of the trained model written to output directory (Angstroms)
phaserange -20,50 phase range over which the model is defined (rest-frame days)
phasesplineres 3.0 phase resolution of the trained output model (days)
phaseinterpres 0.2 phase resolution of the model used during training (days)
phaseoutres 1 phase resolution of the trained model written to output directory (days)
n_colorpars 5 number of parameters used to define the color law polynomial
n_colorscatpars 5 number of parameters used to define the color scatter
n_components 2 number of model components (M0, M1) - additional components not yet allowed
error_snake_phase_binsize 6 spacing in days for the SALT error model B-spline interpolation
error_snake_wave_binsize 1200 spacing in Angstroms for the SALT error model B-spline interpolation
use_snpca_knots False if true, use the knot locations from the SALT2 training
[priors]   key is the name of a decorator in training/priors.py; value determines the (semi-arbitrary) width of each prior
x1mean 0.1 mean x1 = 0
x1std 0.1 standard deviation of x1 values = 1
m0endalllam 1e-5 at -20 days, M0 must go to zero flux
m1endalllam 1e-4 at -20 days, M1 must go to zero flux
colorstretchcorr 1e-4 color and stretch should not be correlated
colormean 1e-3 mean sample color is zero
m0positiveprior 1e-2 M0 is not allowed to be negative
recalprior 50 don’t allow spectral recalibration to go crazy
[bounds]    
x1 -5,5,0.01 min,max,prior width on x1