Population Annealing Monte Carlo pamc
¶
pamc
explores the parameter space by using the Population Annealing Monte Carlo (PAMC) method.
Preparation¶
mpi4py should be installed for the MPI parallelization.
$ python3 -m pip install mpi4py
Input parameters¶
This has two subsections algorithm.param
and algorithm.pamc
.
[algorithm.param
]¶
This defines a space to be explored.
When mesh_path
key is defined the discrete space is used.
Otherwise, continuous space is used.
Continuous space
initial_list
Format: List of float. Length should be equal to
dimension
.Description: Initial value of parameters. If not defined, these will be initialize randomly.
min_list
Format: List of float. Length should be equal to
dimension
.Description: Minimum value of each parameter. When a parameter falls below this value during the Monte Carlo search, the solver is not evaluated and the value is considered infinite.
max_list
Format: List of float. Length should be equal to
dimension
.Description: Maximum value of each parameter. When a parameter exceeds this value during the Monte Carlo search, the solver is not evaluated and the value is considered infinite.
step_list
Format: List of float. Length should be equal to
dimension
.Description: The step length in one Monte Carlo update (deviation of Gaussian distribution).
Discrete space
mesh_path
Format: string
Description: Path to the mesh definition file.
neighborlist_path
Format: string
Description: Path to the neighborhood-list file.
[algorithm.pamc
]¶
numsteps
Format: Integer
Description: The number of Monte Carlo steps.
numsteps_annealing
Format: Integer
Description: The number of interval Monte Carlo steps between lowering “temperature”.
Tnum
Format: Integer
Description: The number of “temperature” points.
Tmin
Format: Float
Description: The minimum value of the “temperature” (\(T\)).
Tmax
Format: Float
Description: The maximum value of the “temperature” (\(T\)).
bmin
Format: Float
Description: The minimum value of the “inverse temperature” (\(\beta = 1/T\)). One of the “temperature” and “inverse temperature” should be defined.
bmax
Format: Float
Description: The maximum value of the “inverse temperature” (\(\beta = 1/T\)). One of the “temperature” and “inverse temperature” should be defined.
Tlogspace
Format: Boolean (default: true)
Description: Whether to assign “temperature” to replicas equally spaced in the logarithmic space or not.
nreplica_per_proc
Format: Integer (default: 1)
Description: The number of replicas in a MPI process.
resampling_interval
Format: Integer (default: 1)
Description: The number of annealing processes between resampling of the replicas.
fix_num_replicas
Format: Boolean (default: true)
Description: Whether to fix the number of replicas or not on resampling.
About the number of steps¶
Specify just two of numstep
, numsteps_annealing
, and numT
.
The value of the remaining one will be determined automatically.
Reference file¶
Mesh definition file¶
Define the grid space to be explored in this file. The first column is the index of the mesh, and the second and subsequent columns are the values of variables. Note that the index of the mesh will be ignored for this “algorithm”.
Below, a sample file is shown.
1 6.000000 6.000000
2 6.000000 5.750000
3 6.000000 5.500000
4 6.000000 5.250000
5 6.000000 5.000000
6 6.000000 4.750000
7 6.000000 4.500000
8 6.000000 4.250000
9 6.000000 4.000000
...
Neighborhood-list file¶
Before searching in the discrete space by Markov Chain Monte Carlo method, we should define “neighborhoods” for each point \(i\), which are points that a walker can move from \(i\) A neighborhood-list file defines the list of neighborhoods. In this file, the index of an initial point \(i\) is specified by the first column, and the indices of final points \(j\) are specified by the second and successive columns.
An utility tool, odatse_neighborlist
is available for generating a neighborhood-list file from a mesh file. For details, please see Related Tools.
0 1 2 3
1 0 2 3 4
2 0 1 3 4 5
3 0 1 2 4 5 6 7
4 1 2 3 5 6 7 8
5 2 3 4 7 8 9
...
Output files¶
RANK/trial_T#.txt
¶
This file stores the suggested parameters and the corresponding value returned from the solver for each temperature point (specified by #
).
The first column (step
) is the index of the MC step.
The second column (walker
) is the index of the walker in the process.
The third column (beta
) is the inverse temperature of the replica.
The fourth column (fx
) is the value of the solver.
The fifth - (4+dimension)-th columns are the coordinates.
The last two columns (weight
and ancestor
) are the Neal-Jarzynsky weight and the grand-ancestor of the replica.
Example:
# step walker beta fx x1 weight ancestor
0 0 0.0 73.82799488298886 8.592321856342956 1.0 0
0 1 0.0 13.487174782058675 -3.672488908364282 1.0 1
0 2 0.0 39.96292704464803 -6.321623766458111 1.0 2
0 3 0.0 34.913851603463 -5.908794428939206 1.0 3
0 4 0.0 1.834671825646121 1.354500581633733 1.0 4
0 5 0.0 3.65151610695736 1.910894059585031 1.0 5
...
RANK/trial.txt
¶
This is a combination of all the trial_T#.txt
in one.
RANK/result_T#.txt
¶
This file stores the sampled parameters and the corresponding value returned from the solver for each replica and each temperature.
This has the same format as trial.txt
.
# step walker beta fx x1 weight ancestor
0 0 0.0 73.82799488298886 8.592321856342956 1.0 0
0 1 0.0 13.487174782058675 -3.672488908364282 1.0 1
0 2 0.0 39.96292704464803 -6.321623766458111 1.0 2
0 3 0.0 34.913851603463 -5.908794428939206 1.0 3
0 4 0.0 1.834671825646121 1.354500581633733 1.0 4
0 5 0.0 3.65151610695736 1.910894059585031 1.0 5
...
RANK/result.txt
¶
This is a combination of all the result_T#.txt
in one.
best_result.txt
¶
The optimal value of the solver and the corresponding parameter among the all samples.
nprocs = 4
rank = 2
step = 65
fx = 0.008233957976993406
z1 = 4.221129370933539
z2 = 5.139591716517661
fx.txt
¶
This file stores statistical metrics over the all replicas for each temperature. The first column is inverse temperature. The second and third column are the expectation value and the standard error of the solver’s output (\(f(x)\)), respectively. The fourth column is the number of replicas. The fifth column is the logarithmic of the ratio between the normalization factors (partition functions)
where \(\beta_0\) is the minimum value of \(\beta\) used in the calculation. The sixth column is the acceptance ratio of MC updates.
# $1: 1/T
# $2: mean of f(x)
# $3: standard error of f(x)
# $4: number of replicas
# $5: log(Z/Z0)
# $6: acceptance ratio
0.0 33.36426034198166 3.0193077565358273 100 0.0 0.9804
0.1 4.518006242920819 0.9535301415484388 100 -1.2134775491597027 0.9058
0.2 1.5919146358616842 0.2770369776964151 100 -1.538611313376179 0.9004
...
Restart¶
The execution mode is specified by the run_mode
parameter to the constructor.
The operation of each mode is described as follows.
The parameter values correspond to --init
, --resume
, and --cont
options of odatse
command, respectively.
"initial"
(default)The program is started from the initialized state. If the checkpointing is enabled, the intermediate states will be stored at the folloing occasions:
at the end of calculation at each temperature point, the specified number of steps has been done, or the specified period of time has passed.
at the end of the execution.
"resume"
The program execution is resumed from the latest checkpoint. The conditions such as the number of MPI processes should be kept the same.
"continue"
The program execution is continued from the previous run. The sequence of the temperature points should be specified so that it is continuous from that of the previous run.
Assume that the temperature has been lowered from
Tmax=
\(T^{(1)}\) toTmin=
\(T^{(2)}\) in the previous run, the next values should be taken asTmax=
\(T^{(2)}\) andTmin=
\(T^{(3)}\). In the new calculation, the temperature points are taken from \(T^{(2)}\) to \(T^{(3)}\) divided byTnum
, namely, \(T_0 = T^{(2)}\), \(T_1\),…, \(T_{\text{Tnum}-1}=T^{(3)}\). (Tnum
can be different from the previous run.)
Algorithm¶
Goal¶
When the weight of the configuration \(x\) under some parameter \(\beta_i\) is given as \(f_i(x)\) (e.g., the Bolzmann factor \(f_i(x) = \exp[-\beta_i E(x)]\) ), the expectation value of \(A\) is defined as
where \(Z = \int \mathrm{d} x f_i(x)\) is the normalization factor (partition function) and \(\tilde{f}(x) = f(x)/Z\) is the probability of \(x\).
Our goal is to numerically calculate the expectation value for each \(\beta_i\) and the (ratio of) the normalization factor.
Annealed Importance Sampling (AIS) [1]¶
First, we introduce a series of configurations \(\{x_i\}\) obeying the following joint probability
with
where \(T_i(x, x')\) is a transition probability from \(x\) to \(x'\) under \(\beta_i\) holding the balance condition,
It turns out that \(\tilde{f}_n(x_n)\) is the marginal distribution of \(\tilde{f}(x_0, x_1, \dots, x_n)\), that is,
from
Consequently, \(\langle A \rangle_n\) is represented by using the extended configuration \(\{x_i\}\) as
Unfortunately, it is difficult to generate directly a series of configurations \(\{x_i\}\) following the distribution \(\tilde{f}(x_0, x_1, \dots, x_n)\). Then, instead of \(\tilde{f}(x_0, x_1, \dots, x_n)\), we consider \(\{x_i\}\) obeying the joint distribution
by using the following the following scheme:
Generete \(x_0\) from the initial distribution \(\tilde{f}_0(x)\)
Generate \(x_{i+1}\) from \(x_i\) through \(T_{i+1}(x_i, x_{i+1})\)
By using the reweighting method (or importance sampling method), \(\langle A \rangle_n\) is rewritten as
Because the ratio between \(\tilde{f}\) and \(\tilde{g}\) is
the form of the expectation value will be
Finally, the ratio between the normalization factors \(Z_n/Z_0\) can be evaluated as
and therefore the expectation value of \(A\) can be evaluated as a weighted arithmetic mean:
This weight \(w_n\) is called as the Neal-Jarzynski weight.
population annealing (PA) [2]¶
Although the AIS method can estimate the expectation values of \(A\) for each parameter \(\beta\) as the form of weighted arithmetic mean, the variance of weights \(w\) is generally large and then the accuracy of the result gets worse. In order to overcome this problem, the population annealing Monte Carlo (PAMC) method resamples all the replicas according to the probability \(p^{(k)} = w^{(k)} / \sum_k w^{(k)}\) at some periods and resets all the weights to unity.
The following pseudo code describes the scheme of PAMC:
for k in range(K):
w[0, k] = 1.0
x[0, k] = draw_from(β[0])
for i in range(1, N):
for k in range(K):
w[i, k] = w[i-1, k] * ( f(x[i-1,k], β[i]) / f(x[i-1,k], β[i-1]) )
if i % interval == 0:
x[i, :] = resample(x[i, :], w[i, :])
w[i, :] = 1.0
for k in range(K):
x[i, k] = transfer(x[i-1, k], β[i])
a[i] = sum(A(x[i,:]) * w[i,:]) / sum(w[i,:])
There are two resampling methods: one with a fixed number of replicas[2] and one without[3].
References¶
[1] R. M. Neal, Statistics and Computing 11, 125-139 (2001).
[2] K. Hukushima and Y. Iba, AIP Conf. Proc. 690, 200 (2003).
[3] J. Machta, Phys. Rev. E 82, 026704 (2010).