排班问题——Nurse Rostering Problem（NRP）实战

最新推荐文章于 2025-03-06 09:42:51 发布

原创最新推荐文章于 2025-03-06 09:42:51 发布 · 7.5k 阅读

28 ·

CC 4.0 BY-SA版权

文章标签：

#python #机器学习

机器学习专栏收录该内容

2 篇文章

订阅专栏

本文介绍了如何利用Google的OR-Tools解决复杂的Nurse Rostering Problem（NRP），通过调研、代码实现和参数调整，展示了将机器学习应用于实际问题的实践过程，并以Python为例展示了代码示例。

文章目录

背景
- 问题调研
- 工具查找——找巨人的肩膀
Action
总结

背景

上次周末碰到女朋友在排班表，花了快一个小时，就想着看用代码帮她节省点时间。

问题调研

上网查了一下，看了几篇论文了解了背景。

http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.1030.5363&rep=rep1&type=pdf
https://arxiv.org/pdf/1804.05002.pdf

排班之类的问题都统称为 Nurse Rostering Problem（NRP）问题，就复杂性而言，这是一个NP-hard问题

P问题是在多项式时间内可以被解决的问题，而NP问题是在多项式时间内可以被验证其正确性的问题

多项式时间指的是什么？我理解就是平时计算的算法复杂度

O(1) – constant-time
O(log_2(n)) – logarithmic-time
O(n) – linear-time
O(n^2) – quadratic-time
O(n^k) – polynomial-time
O(k^n) – exponential-time
O(n!) – factorial-time

关于NP问题的描述可以看看我找到的这篇文章

提及复杂性理论，是因为更精准地分析问题后，才能找到合适的工具。

这类问题我们就会想到用计算机科学里面的方法去处理，例如machine learning。

工具查找——找巨人的肩膀

机器学习的技术栈，我学过的就是Python中的scikit-learn、TensorFlow和Keras上去做选择

据我了解，这是一个研究了多年的课题了。既然这样，应该是会有现成训练好的模型去做这类事情。

最后的最后，我就找到了Google开发的OR-Tools（Official Site）。它已经有训练过的模型去处理这类排班问题了。

Action

Code example

官方例子Source Code

主要就是对源码做调整了，以满足现实需求。

or-tools的基本用法

摘抄的伪代码，用于了解工作方式

// 选择和定义一个model，然后设置所需的前置数据和限制条件
model = cp_model.CpModel()
model.NewBoolVar(...)
model.AddExactlyOne(...)
model.Add(...)

// 选择和定义一个solver，然后针对model进行optimizate
solver = cp_model.CpSolver()
status = solver.Solve(model, solution_printer)

// 创建一个矩阵进行拟合
work = {}
for e in range(num_employees):
    for s in range(num_shifts):
        for d in range(num_days):
            work[e, s, d] = model.NewBoolVar('work%i_%i_%i' % (e, s, d))
// 下面是打印结果的算法
if status == cp_model.OPTIMAL or status == cp_model.FEASIBLE:
     print()
     header = '          '
     for w in range(num_weeks):
         header += 'M T W T F S S '
     print(header)
     for e in range(num_employees):
         schedule = ''
         for d in range(num_days):
             for s in range(num_shifts):
                 if solver.BooleanValue(work[e, s, d]): // NOTE: 获取优化算法出来的结果
                     schedule += shifts[s] + ' '
         print('worker %i: %s' % (e, schedule))
     print()

参数调整

例子源码中参数注释都很清楚地标明了用法了，适当对现实问题的需要作出调整就行了。下面伪代码只是针对参数作了一些说明。

num_employees = 4 // 员工数，改为目标的员工数
num_weeks = 5 // 工作天数，目标是排一个月的班表，改为5周
shifts = ['O', 'M', 'A', 'N'] // 班期，和目标一样

# Fixed assignment: (employee, shift, day).
# 固定前两天的，排班前得动态调整下
fixed_assignments = [
    (0, 0, 0),
    (1, 1, 0),
    (2, 2, 0),
    (3, 3, 0),
    (0, 0, 1),
    (1, 1, 1),
    (2, 2, 1),
    (3, 3, 1),
]

# Request: (employee, shift, day, weight)
# 员工想要固定休息的时间（位置），权重值我理解为代表拟合时可调整的优先级
requests = [
    # Employee 3 does not want to work on the first Saturday (negative weight for the Off shift).
    (3, 0, 5, -2),
    # Employee 2 does not want a night shift on the first Friday (positive weight).
    (2, 3, 4, 4)
]

# Shift constraints on continuous sequence :
#     (shift, hard_min, soft_min, min_penalty,
#             soft_max, hard_max, max_penalty)
# hard_min: 硬性限制，在周期内最少连续要上的班期天数
# soft_min: 软性限制，在周期内最少连续要上的班期天数
# soft_max和hard_max同上去理解
shift_constraints = [
    # One or two consecutive days of rest, this is a hard constraint.
    (0, 1, 1, 0, 2, 2, 0),
    # betweem 2 and 3 consecutive days of night shifts, 1 and 4 are
    # possible but penalized.
    (3, 1, 2, 20, 3, 4, 5),
]

# Weekly sum constraints on shifts days:
#     (shift, hard_min, soft_min, min_penalty,
#             soft_max, hard_max, max_penalty)
weekly_sum_constraints = [
    # 每周最少要休息的天数
    (0, 1, 2, 7, 2, 3, 4),
]

# Penalized transitions:
#     (previous_shift, next_shift, penalty (0 means forbidden))
penalized_transitions = [
    # 尽量避免午班换晚班
    (2, 3, 4),
    # 晚班不能接着早班
    (3, 1, 0),
]

# daily demands for work shifts (morning, afternon, night) for each day
# of the week starting on Monday.
# 每个班期最少要有多少人，我这里的现实问题是有一个人就行了
weekly_cover_demands = [
        (1, 1, 1),  # Monday
        (1, 1, 1),  # Tuesday
        (1, 1, 1),  # Wednesday
        (1, 1, 1),  # Thursday
        (1, 1, 1),  # Friday
        (1, 1, 1),  # Saturday
        (1, 1, 1),  # Sunday
]

# Penalty for exceeding the cover constraint per shift type.
excess_cover_penalties = (2, 2, 5)

num_days = num_weeks * 7
num_shifts = len(shifts)

画甘特图

技术栈中，选择了matplot去画甘特图，也是现学现卖。

# Generate plot image. [reference: https://www.geeksforgeeks.org/python-basic-gantt-chart-using-matplotlib]
	shifts_colors = ['white', 'tab:green', 'tab:orange', 'tab:blue']
    if status == cp_model.OPTIMAL or status == cp_model.FEASIBLE:
        offset = 2
        # Declaring a figure "gnt"
        fig, gnt = plt.subplots()
        # Setting Y-axis limits
        gnt.set_ylim(0, num_employees * 15)
        # Setting X-axis limits
        gnt.set_xlim(1, num_days + offset)
        # Setting labels for x-axis and y-axis
        gnt.set_xlabel('Dates')
        gnt.set_ylabel('Employees')
        # Setting ticks on y-axis
        # gnt.set_yticks([15, 25, 35, 45])
        gnt.set_xticks(list(range(1, num_days + offset, 1)))
        gnt.set_yticks(list(range(15, (num_employees + 1) * 10 + 5, 10)))
        # Labelling tickes of y-axis
        gnt.set_yticklabels(employees_names)
        # Setting graph attribute
        gnt.grid(True)

        for e in range(num_employees):
            for d in range(num_days):
                for s in range(num_shifts):
                    if solver.BooleanValue(work[e, s, d]):
                        gnt.broken_barh([(d+1, d+2)], (10*(e+1), 9),
                                        facecolors=(shifts_colors[s]))

        plt.savefig("gantt1.png")