ProcDump-for-Linux 中实现新触发器的完整指南-优快云博客

本文链接：https://blog.youkuaiyun.com/gitblog_00379/article/details/148755562

ProcDump-for-Linux 中实现新触发器的完整指南

ProcDump-for-Linux A Linux version of the ProcDump Sysinternals tool 项目地址: https://gitcode.com/gh_mirrors/pr/ProcDump-for-Linux

概述

ProcDump-for-Linux 是一个强大的进程诊断工具，它允许开发人员基于特定条件（触发器）自动生成进程的核心转储文件。本文将详细介绍如何在该工具中添加一个新的触发器类型，以扩展其监控能力。

触发器架构解析

ProcDump 采用模块化架构设计，每个触发器类型都有独立的监控线程。核心组件包括：

监控线程：每个触发器类型对应一个专用线程，定期检查目标进程状态
配置系统：存储所有用户指定的参数和阈值
事件处理：统一的退出机制和日志系统

触发器线程通常遵循以下工作模式：

等待启动信号
定期轮询目标进程状态
当条件满足时生成核心转储
处理退出请求

实现新触发器的四步流程

第一步：命令行参数处理

选择开关名称：建议使用2字符的短名称，如"-so"表示socket触发器
参数解析：在GetOptions函数中添加处理逻辑
输入验证：确保阈值参数合法（非负整数）

else if( 0 == strcasecmp( argv[i], "/so" ) ||
         0 == strcasecmp( argv[i], "-so" ))
{
    if( i+1 >= argc || self->SocketThreshold != -1 ) return PrintUsage();
    if(!ConvertToInt(argv[i+1], &self->SocketThreshold)) return PrintUsage();
    if(self->SocketThreshold < 0)
    {
        Log(error, "Invalid socket threshold count specified.");
        return PrintUsage();
    }
    i++;
}

更新帮助文档：同步修改使用说明、man页面和README

第二步：配置系统更新

头文件修改：在ProcDumpConfiguration.h中添加新字段

int SocketThreshold;  // 套接字数量阈值

初始化设置：在InitProcDumpConfiguration中设置默认值(-1表示禁用)
配置复制：确保新字段在配置复制时正确处理

第三步：创建监控线程

在CreateMonitorThreads函数中添加线程创建逻辑：

if (self->SocketThreshold != -1 && !tooManyTriggers)
{
    if (self->nThreads < MAX_TRIGGERS)
    {
        if ((rc = pthread_create(&self->Threads[self->nThreads].thread, 
                               NULL, 
                               SocketCountMonitoringThread, 
                               (void *)self)) != 0)
        {
            Trace("CreateMonitorThreads: failed to create SocketThread.");
            return rc;
        }
        self->Threads[self->nThreads].trigger = SocketCount;
        self->nThreads++;
    }
    else
    {
        tooManyTriggers = true;
    }
}

同时需要在ProfilerCommon.h的TriggerType枚举中添加新类型。

第四步：实现监控逻辑

声明函数：在Monitor.h中添加线程函数声明
实现基础框架：

void* SocketCountMonitoringThread(void *thread_args)
{
    Trace("SocketCountMonitoringThread: Enter [id=%d]", gettid());
    struct ProcDumpConfiguration *config = (struct ProcDumpConfiguration *)thread_args;
    
    // 初始化代码...
    
    if ((rc = WaitForQuitOrEvent(config, &config->evtStartMonitoring, INFINITE_WAIT)) == WAIT_OBJECT_0 + 1)
    {
        while ((rc = WaitForQuit(config, config->PollingInterval)) == WAIT_TIMEOUT)
        {
            // 实际监控逻辑
        }
    }
    
    Trace("SocketCountMonitoringThread: Exit [id=%d]", gettid());
    return NULL;
}

实现核心监控逻辑：

int socketCount = GetCurrentSocketCount(config->ProcessId);
if (socketCount >= config->SocketThreshold)
{
    Log(info, "Trigger: Sockets:%d on process ID: %d", socketCount, config->ProcessId);
    rc = WriteCoreDump(writer);
    if(rc != 0)
    {
        SetQuit(config, 1);
    }
    
    if ((rc = WaitForQuit(config, config->ThresholdSeconds * 1000)) != WAIT_TIMEOUT)
    {
        break;
    }
}