Skip to contents

Abstract

The application of genetic algorithms (GAs) offers a powerful approach to optimize mathematical functions, particularly non-linear functions such as quadratic equations. In this part we present a detailed examination of optimizing the quadratic function f(x)=x24x+4f(x) = x^2 - 4x + 4 through a genetic algorithm framework. By defining the initial population, evaluating fitness, selecting individuals, and iterating through crossover and mutation processes, we demonstrate how GAs can effectively converge to the optimal solution of a given function.

Introduction

Genetic algorithms are inspired by the principles of natural selection and genetics. They are widely utilized in optimization problems where traditional methods may falter due to the complexity of the search space. We will illustrate the step-by-step implementation of a genetic algorithm to minimize the quadratic function f(x)=x24x+4f(x) = x^2 - 4x + 4. The objective is to identify the value of xx that yields the minimum value of the function, thereby providing insights into the capabilities and workings of GAs.

Demonstration

Initialization of population

The optimization process begins with the initialization of a population. In this scenario, we define a population consisting of three individuals with randomly assigned integer values within the range of 0 to 3. The values of the individuals in the population are represented as X1(x=1)X_1(x=1), X2(x=3)X_2(x=3), and X3(x=0)X_3(x=0).

(Note: the values are random and the population should be highly diversified) The space of x value is kept integer type and on range from 0 to 3,for simplification.

population <- c(1, 3, 0)

Fitness Evaluation

The next step involves evaluating the fitness of each individual within the population. The fitness function, defined as f(x)=x24x+4f(x) = x^2 - 4x + 4, measures the performance of each individual. The fitness values for the initial population can be calculated as follows:

Calculate fitness(f(x)) for each individual:

  • For X1X_1: f(1)=1241+4=1f(1) = 1^2 - 4 \cdot 1 + 4 = 1
  • For X2X_2: f(3)=3243+4=1f(3) = 3^2 - 4 \cdot 3 + 4 = 1
  • For X3X_3: f(0)=0240+4=4f(0) = 0^2 - 4 \cdot 0 + 4 = 4

Coding the function f(x) in R A quadratic function is a function of the form: ax2+bx+c where a≠0

So for f(x)=x24x+4f(x) = x^2 - 4x + 4

The fitness function is implemented in R as follows:

a <- 1
b <- -4
c <- 4
f <- function(x) {
    a * x^2 + b * x + c
}

A graphical representation of the quadratic function f(x)f(x) is generated over the defined domain of 0x40 \leq x \leq 4, showcasing the function’s behavior and the positions of the population individuals within the graph.

Let’s try 0 ≤ x ≤ 3:

# Define the domain over which we want to plot f(x)
x <- seq(from = 0,
         to = 4,
         length.out = 100)

# Define the domain over which we want to plot f(x) and create df data frame
df <- x |>
  data.frame(x = _) |>
  dplyr::mutate(y = f(x))

# Define the space over which we want to population to be
possible_xvalues <- seq(from = 0,
                        to = 3,
                        length.out = 4)

# Create space data frame
space <- possible_xvalues |>
  data.frame(x = _) |>
  dplyr::mutate(y = f(x))

# Calculate fitness inline
fitness <- population ^ 2 - 4 * population + 4

# Selected the surviving parents
num_parents <- 2
selected_parents <- population |>
  order(fitness, decreasing = FALSE) |>
  head(num_parents)

# Plot f(x) using ggplot
ggplot2::ggplot(df, aes(x = x, y = y)) +
  geom_line(color = "black") + # Plot the function as a line
  geom_point(
    data = subset(space, x %in% c(1, 3)),
    color = "coral1",
    size = 3,
    shape = 8
  ) + # Plot points at x=1 and x=3
  geom_point(
    data = subset(space, (x == 0)),
    color = "blue",
    size = 3,
    shape = 8
  ) + # Plot a point at x=0
  geom_hline(yintercept = 0,
             linetype = "dashed",
             color = "blue") + # Add horizontal line at y=0
  geom_vline(xintercept = 0,
             linetype = "dashed",
             color = "blue") + # Add vertical line at x=0
  theme_minimal()

Selection of Parents

In this step, we select the individuals that will serve as parents for the crossover process. Selection is based on fitness values, favoring individuals with lower fitness scores. For this population, the individuals Y1(x=1)Y_1(x=1) and Y2(x=3)Y_2(x=3) are chosen as parents for crossover due to their respective fitness values.

Select parents for crossover:

    -   Y1(x=1), Y2(x=3)
# Plot f(x) using ggplot
ggplot(df, aes(x = x, y = y)) +
    geom_line(color = "black") + # Plot the function as a line
    geom_point(data = subset(space, x %in% c(1, 3)), color = "coral1", size = 6, shape = 8) + # Plot points at x=1 and x=3
    geom_hline(yintercept = 0, linetype = "dashed", color = "blue") + # Add horizontal line at y=0
    geom_vline(xintercept = 0, linetype = "dashed", color = "blue") + # Add vertical line at x=0
    theme_minimal() # Use a minimal theme

Crossover and Mutation

Crossover is performed to produce new offspring individuals. In this case, the offspring generated from the parents are Z1(x=1)Z_1(x=1) and Z2(x=3)Z_2(x=3), without introducing mutation for simplicity. The offspring maintain the population’s genetic material, allowing for exploration within the search space while leveraging successful traits from the parents.

-   Generate offspring through crossover and mutation:
    -   Z1(x=1), Z2(x=3) (no mutation in this example)

Replacement

Following the generation of offspring, a replacement strategy is employed to maintain the population size. The least fit individual, X3X_3, is replaced by one of the newly created offspring, ensuring that the overall population continues to evolve toward optimal solutions.

-   Replace individuals in the population:
    -   Replace X3 with Z1, maintaining the population size.

Iteration Until Termination

The algorithm iterates through the aforementioned steps—evaluation, selection, crossover, mutation, and replacement—until a termination condition is met. The optimal individuals are defined as those that yield the lowest value of the fitness function, effectively reaching the function’s minimum.

-  Repeat Steps 2-5 for multiple generations until a termination condition is met.

Finding the Optimal Solution

To ascertain the optimal solution for the quadratic function, we utilize the vertex formula of a quadratic equation, which determines the minimum point. The vertex can be calculated as:

F(b2a,f(b2a)) F\left(\frac{-b}{2a}, f\left(\frac{-b}{2a}\right)\right)

Implementing this in R allows us to identify the coordinates of the vertex, which represent the minimum point of the function.

find.fitting <- function(a, b, c) {
    x_fitting <- -b / (2 * a)
    y_fitting <- f(x_fitting)
    c(x_fitting, y_fitting)
}
F <- find.fitting(a, b, c)

Adding the Fitting to the plot:

# Plot f(x) using ggplot
ggplot(df, aes(x = x, y = y)) +
    geom_line(color = "black") + # Plot the function as a line
    geom_hline(yintercept = 0, linetype = "dashed") + # Add horizontal line at y=0
    geom_vline(xintercept = 0, linetype = "dashed") + # Add vertical line at x=0
    geom_point(x = F[1], y = F[2], shape = 18, size = 6, color = "red") + # Plot the vertex
    geom_text(x = F[1], y = F[2], label = "Fitting", vjust = -1, color = "red", size = 5) + # Add label next to the vertex
    theme_minimal() # Use a minimal theme

Analysis of X-Intercepts

In addition to finding the vertex, we explore the x-intercepts of the function, which are determined by solving the equation f(x)=0f(x) = 0 using the quadratic formula:

x=b±b24ac2a x = \frac{-b \pm \sqrt{b^2 - 4ac}}{2a}

The discriminant plays a critical role in identifying the nature of the roots:

  • A positive discriminant indicates two real solutions (x-intercepts).
  • A zero discriminant indicates one real solution (one x-intercept).
  • A negative discriminant indicates no real solutions.

This analysis can also be implemented in R to derive the x-intercepts:

find.roots <- function(a, b, c) {
    discriminant <- b^2 - 4 * a * c
    if (discriminant > 0) {
        c((-b - sqrt(discriminant)) / (2 * a), (-b + sqrt(discriminant)) / (2 * a))
    } else if (discriminant == 0) {
        -b / (2 * a)
    } else {
        NaN
    }
}
solutions <- find.roots(a, b, c)

Adding the x-intercepts to the plot:

# Plot f(x) using ggplot
ggplot(df, aes(x = x, y = y)) +
    geom_line(color = "black") + # Plot the function as a line
    geom_hline(yintercept = 0, linetype = "dashed") + # Add horizontal line at y=0
    geom_vline(xintercept = 0, linetype = "dashed") + # Add vertical line at x=0
    geom_point(data = data.frame(x = solutions, y = rep(0, length(solutions))), shape = 18, size = 6, color = "red") + # Plot x-intercepts
    geom_text(data = data.frame(x = solutions, y = rep(0, length(solutions)), label = "Fitting(x-intercept)"), aes(label = label), vjust = -1, color = "red", size = 5) + # Add labels next to x-intercepts
    theme_minimal() # Use a minimal theme

Conclusion

This exploration into the optimization of the quadratic function f(x)=x24x+4f(x) = x^2 - 4x + 4 using genetic algorithms highlights the efficacy of evolutionary computation techniques in navigating complex search spaces. Through iterative evaluation, selection, crossover, and replacement processes, the genetic algorithm not only identifies the optimal solution but also provides a framework that can be adapted for a myriad of optimization challenges. As demonstrated, the approach successfully finds both the vertex and the x-intercepts, underscoring the method’s versatility in function optimization.