什么是多线程“伪唤醒“以及为什么要用while循环进行wait判断-优快云博客

本文链接：https://blog.youkuaiyun.com/Wyf_Fj/article/details/126505728

生产者消费者模型中有这样一段代码：

while (isFull()) // 队列为满，此时等待消费者消费数据
{
    pthread_cond_wait(&can_produce, &lock); // 阻塞，等待被唤醒
}
// TODO 生产数据

为什么这里需要用while循环而不是直接用if呢？

if (isFull())
{
    pthread_cond_wait(&can_produce, &lock);
}
// TODO 生产数据

因为牵扯到了伪唤醒的问题。

所谓伪唤醒有两种情况：

wait函数调用失败：那么这种情况下线程仿佛被"唤醒"了一样，可以继续向下执行。但是由于没有进行二次条件判断，所以在条件不满足的情况下进行生产或消费必然会导致问题！
由于系统的原因，线程被误唤醒了：事实上这一点也可以归到wait函数调用失败的情况。
使用pthread_cond_broadcast()唤醒了一批线程，但是事实上生产者生产的数据量小于唤醒的线程量，那么就会导致过量消费！

比如，我们设置一个生产者，两个消费者：

生产者每3s生产一个数据，并使用pthread_cond_broadcast()唤醒两个消费者：

其中一个消费者优先获得锁，消费了仅有的一个数据。当它释放锁后，另一个线程获取到锁，但是由于使用的是if控制，所以没有进行判空检测，直接去消费，从而导致段错误！

注：这里使用3s生产一个数据只是为了模拟数据量小于生产者数量的情况！

补充
事实上，在man手册中我们可以看到官方的一个提示：

When using condition variables there is always a Boolean predicate involving shared variables associated with each condition wait that is true if the thread should proceed. Spurious wakeups from the pthread_cond_timedwait() or pthread_cond_wait() functions may occur. Since the return from pthread_cond_timedwait() or pthread_cond_wait() does not imply anything about the value of this predicate, the predicate should be re-evaluated upon such return.

当我们使用条件变量时，我们总会利用一个bool类型的变量判断当前线程是否应该运行(相当于生产着消费者模型中的isFull和isEmpty)。
但是伪唤醒(Spurious wakeups)是可能发生的，而且当前线程被唤醒并不意味着这个bool类型的变量成立。（可以理解，因为刚刚也分析过了，伪唤醒的三种原因，它们都和bool类型的变量无关）
因此，在pthread_cond_wait()返回时，应该再次进行条件判断。（即使用while循环！）