3.Activation functions

最新推荐文章于 2022-07-10 16:21:45 发布

原创最新推荐文章于 2022-07-10 16:21:45 发布 · 253 阅读

0 ·

CC 4.0 BY-SA版权

Deep Learning 专栏收录该内容

3 篇文章

订阅专栏

本文探讨了激活函数在神经网络中隐藏层的应用，强调其对于捕捉非线性关系和提升模型预测能力的重要性。通过ReLU（修正线性单元）激活函数的实例，展示了如何在节点输入上应用激活函数来产生输出，以及如何使用ReLU函数处理正负输入值。文章还提供了计算神经网络输出的代码示例，包括节点值的计算、隐藏层输出的生成以及最终模型输出的计算。

Applying an activation function in the hidden layers is to achieve the maximum predictive power of neural network.
An activation function allows the model to capture non-linearities
Applied to node inputs to produce node outputs

Linear functions VS NonLinear functions
在这里插入图片描述
if the relationships in the data aren’t straight-line relationships, we need an activation function to capture non-linearities,

ReLU(Rectified linear activation)

在这里插入图片描述
The rectified linear activation function (called ReLU) has been shown to lead to very high-performance networks. This function takes a single number as an input, returning 0 if the input is negative, and the input if the input is positive.

Here are some examples:
relu(3) = 3
relu(-3) = 0

tanh()

def relu(input):
    '''Define your relu activation function here'''
    # Calculate the value for the output of the relu function: output
    output = max(0, input)
    
    # Return the value just calculated
    return(output)

# Calculate node 0 value: node_0_output
node_0_input = (input_data * weights['node_0']).sum()
node_0_output = relu(node_0_input)

# Calculate node 1 value: node_1_output
node_1_input = (input_data * weights['node_1']).sum()
node_1_output = relu(node_1_input)

# Put node values into array: hidden_layer_outputs
hidden_layer_outputs = np.array([node_0_output, node_1_output])

# Calculate model output (do not apply relu)
model_output = (hidden_layer_outputs * weights['output']).sum()

# Print model output
print(model_output)

# Define predict_with_network()
def predict_with_network(input_data_row, weights):

    # Calculate node 0 value
    node_0_input = (input_data_row * weights['node_0']).sum()
    node_0_output = relu(node_0_input)

    # Calculate node 1 value
    node_1_input = (input_data_row * weights['node_1']).sum()
    node_1_output = relu(node_1_input)

    # Put node values into array: hidden_layer_outputs
    hidden_layer_outputs = np.array([node_0_output, node_1_output])
    
    # Calculate model output
    input_to_final_layer = (hidden_layer_outputs * weights['output']).sum()
    model_output = relu(input_to_final_layer)
    
    # Return model output
    return(model_output)


# Create empty list to store prediction results
results = []
for input_data_row in input_data:
    # Append prediction to results
    results.append(predict_with_network(input_data_row, weights))

# Print results
print(results)