论文解读:自适应参数控制方案介绍

[算法 1] A novel approach in parameter adaptive and diversity maintenance for genetic algorithms (APGAIN)

1. The techniques of PRAM

A probabilistic rule-driven adaptive model (PRAM) is used to adapt the crossover rate and the mutation rate automatically, where three different parameter values are used on the population for the adaptation. These values will be adjusted automatically during the search by a set of rules according to the fitness improvement gained.

PRAM can be applied to adapting both pc and pm by running two PRAMs concurrently, each PRAM is responsible for adapting a parameter.

Step 1: PRAM uses three parameter values ( v 0 v_0 v0; v 1 v_1 v1; v 2 v_2 v2 with v 0 < v 1 < v 2 v_0<v_1<v_2 v0<v1<v2) to adapt a control parameter (pm or pc). The run of a GA is divided into epochs.
Step 2: Each epoch is further divided into two periods. During the first period, the corresponding genetic operations (crossover or mutation) are applied on the chromosomes using any one of the three values randomly. In the second period, the probabilities of applying any one of the three values are proportional to their fitness improvement gained in the first period.
Step 3: At the end of each epoch, a set of ‘‘greedy rules’’ are applied to alter the values of v 0 v_0 v0; v 1 v_1 v1; v 2 v_2 v2 according to three fitness improvement rules: E x p a n d   r u l e Expand~rule Expand rule, S t a y   r u l e Stay~rule Stay rule, L e f t   m o v e   r u l e Left~move~rule Left move rule, and R i g h t   m o v e   r u l e Right~move~rule Right move rule;

在这里插入图片描述

2. The repelling algorithm

The repelling algorithm samples a population to a ‘‘representative’’. Population diversity is maintained by driving the population away from the representative. The fitness of a chromosome is modified to include the ‘‘diversity fitness’’, which is inversely proportional to the similarity between the chromosome and the representative. In other wards, the population diversity is maintained by giving a higher survival probability to chromosomes located in a sparsely populated region.

Step 1: The algorithm modifies the fitness evaluation function to increase the survival opportunity of chromosomes with almost lost alleles. The modified fitness of a chromosome ( f E f_E fE) is given as:
f E ( ω , r o b j , r d i v ) = ( 1 − ω ) × f r a n k O j b ( r o b j ) + ω × f r a n k D i v ( r d i v ) , f_E(\omega, r_{obj}, r_{div}) = (1-\omega)\times f_{rankOjb}(r_{obj}) + \omega \times f_{rankDiv}(r_{div}), fE(ω,robj,rdiv)=(1ω)×frankOjb(robj)+ω×frankDiv(rdiv), where ω \omega ω is the weight of diversity, r o j b r_{ojb} rojb ( r d i v r_{div} rdiv) is the ranking of the objective fitness (diversity fitness) of the chromosome, f r a n k O j b f_{rankOjb} frankOjb ( f r a n k D i v f_{rankDiv} frankDiv) returns a score directly proportional to the ranking of the objective fitness (diversity fitness).

Step 2: The diversity fitness is evaluated by comparing the chromosome with the population’s representative. The representative is represented by an array of real numbers in the range of [0, 1]. Assuming binary chromosomes, the probability of finding a ‘0’ at each allele of the chromosome is assigned to the corresponding array element of the representative. The algorithm to create the representative of a population in the repelling model is as follows.
在这里插入图片描述
Step 3: Get the diversity fitness of a chromosome by rewarding the chromosome that are dissimilar to the representative.
在这里插入图片描述
Step 4: The replacement scheme after a genetic operation (crossover or mutation) is done by tournament between the offspring and the parents.

3. The lazy repelling algorithm

A lazy repelling algorithm is proposed to further reduce the computational overheads. The diversity fitness is evaluated only if the following condition holds:
Σ i = 1 β ∣ R ′ [ i ] 2 − R o l d [ i ] 2 ∣ β ≥ φ \frac{\Sigma^\beta_{i=1}|R'[i]^2-R_{old}[i]^2|}{\beta}\geq \varphi βΣi=1βR[i]2Rold[i]2φ
where φ \varphi φ is lazy threshold, R ′ [ i ] R'[i] R[i] is the i i i-th allele of the latest representative, R o l d [ i ] R_{old}[i] Rold[i] is the i i i-th allele of the o l d old old representative computed last time when the diversity fitness is evaluated, β \beta β is the bit-length of a chromosome.

[算法 2] A Compass to guide genetic algorithms

1. Steps of the compass

Step 1: Let d i t d_{it} dit and q i t q_{it} qit be the population’s mean diversity variation and quality (fitness) variation, respectively, Define a vector o i t = ( d i t , q i t ) o_{it} = (d_{it}, q_{it}) oit=(dit,qit) to characterize the effects of the operator over the population in terms of variation of quality and diversity (axis Δ D \Delta D ΔD and Δ Q \Delta Q ΔQ);
Step 2: then normalize d i t n = d i t m a x i { ∣ d i t ∣ } d^n_{it} = \frac{d_{it}}{max_i\{|d_{it}|\}} ditn=maxi{ dit}dit and q i t n = q i t m a x i { ∣ q i t ∣ } q^n_{it} = \frac{q_{it}}{max_i\{|q_{it}|\}} qitn=maxi{ qit}qit, we have o i t n = ( d i t n , q i t n ) o^n_{it} = (d^n_{it}, q^n_{it}) oitn=(ditn,qitn)
在这里插入图片描述
Algorithms that just consider the fitness improvement to adjust the operator probabilities would only use the projection of o i t o_{it} oit over the y-axis (dotted lines in (Fig. b)). On the other hand, if diversity is solely taken in account, measures would be considered as the projection over the x-axis (Fig. c).
在这里插入图片描述

Step 3: To control the fitness improvement and the diversity together, a vector c c c (defined by its angle θ ∈ [ 0 , π 2 ] \theta \in [0,\frac{\pi}{2}] θ[0,2π]) that characterizes also its orthogonal plane P (see Fig. d).
在这里插入图片描述
Rewards are then based on the projection of vectors o i t n o^n_{it} oitn over c c c, i.e., ∣ o i t n ∣ c o s ( α i t ) |o^n_{it}|cos(\alpha_{it}) oitncos(αit), α i t \alpha_{it} αit being the angle between o i t n o^n_{it} oitn and c c c. A value of θ \theta θ close to 0 0 0 encourage exploration, while a value close to π / 2 \pi/2 π/2 will favor exploitation. In this way, the management of application rates is abstracted by the angle θ \theta θ, that guides the direction of the search as the needle of a compass shows the north.

Step 4: Projections are turned into positive values by subtracting the smallest one and dividing them by execution time, in order to award faster operators (Fig. e).
δ i t = ∣ o i t n ∣ c o s ( α i t ) − m i n i { ∣ o i t n ∣ c o s ( α i t ) } T i t \delta_{it} = \frac{|o^n_{it}|cos(\alpha_{it}) - min_i\{|o^n_{it}|cos(\alpha_{it})\}}{T_{it}} δit=Titoitncos(αit)mini{ o

评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值