
Finally moving on from the Held Karp relaxation, we arrive at the second step of the Asadpour asymmetric traveling salesman problem algorithm. Referencing the Algorithm 1 from the Asadpour paper, we are now finally on step two.
Algorithm 1 An
-approximation algorithm for the ATSP
Input: A set
consisting of points and a cost function satisfying the triangle inequality. Output:
-approximation of the asymmetric traveling salesman problem instance described by and .
- Solve the Held-Karp LP relaxation of the ATSP instance to get an optimum extreme point solution
. Define as in (5), making it a symmetrized and scaled down version of . Vector can be viewed as a point in the spanning tree polytope of the undirected graph on the support of that one obtains after disregarding the directions of arcs (See Section 3.) - Let
be the support graph of when the direction of the arcs are disregarded. Find weights such that the exponential distribution on the spanning trees, (approximately) preserves the marginals imposed by , i.e. for any edge , for a small enough value of . (In this paper we show that suffices for our purpose. See Section 7 and 8 for a description of how to compute such a distribution.) - Sample
spanning trees from . For each of these trees, orient all its edges so as to minimize its cost with respect to our (asymmetric) cost function . Let be the tree whose resulting cost is minimal among all of the sampled trees. - Find a minimum cost integral circulation that contains the oriented tree
. Shortcut this circulation to a tour and output it. (See Section 4.)
Sections 7 and 8 provide two different methods to find the desired probability distribution, with section 7 using a combinatorial approach and section 8 the ellipsoid method. Considering that there is no ellipsoid solver in the scientific python ecosystem, and my mentors and I have already decided not to implement one within this project, I will be using the method in section 7.
The algorithm given in section 7 is as follows:
- Set
. - While there exists an edge
with :
- Compute
such that if we define as , and for all , then . - Set
. - Output
.
This structure is fairly straightforward, but we need to know what
Finding
Notice that the formula for
where
The first thing that I noticed is that in the denominator the summation is over all spanning trees for in the graph, which for the complete graphs we will be working with is exponential so a `brute force’ approach here is useless. Fortunately, Asadpour and team realized we can use Kirchhoff’s matrix tree theorem to our advantage.
As an aside about Kirchhoff’s matrix tree theorem, I was not familiar with this theorem before this project so I had to do a bit of reading about it. Basically, if you have a laplacian matrix (the adjacency matrix minus the degree matrix), the absolute value of any cofactor is the number of spanning trees in the graph. This was something completely unexpected to me, and I think that it is very cool that this type of connection exists.
The details of using Kirchhoff’s theorem are given in section 5.3.
We will be using a weighted laplacian
where
Now, we know that applying Krichhoff’s theorem to
but which part of
If we apply
So moving from the first row to the second row is a confusing step, but essentially we are exploiting the properties of exponents.
Recall that
Each exponential factor has the same base, so we can collapse that into
which is also
but we know that
Once we put that back into the summation we arrive at the denominator in
Next, we need to find the numerator for
There is a way to use Krichhoff’s matrix tree theorem here as well.
If we had a graph in which every spanning tree could be mapped in a one-to-one fashion onto every spanning tree in the original graph which contains the desired edge
From here, the logic to show that a cofactor from
At this point, we have all of the needed information to create some pseudo code for the next function in the Asadpour method, spanning_tree_distribution()
.
Here I will use an inner function q()
to find
def spanning_tree_distribution
input: z, the symmetrized and scaled output of the Held Karp relaxation.
output: gamma, the maximum entropy exponential distribution for sampling spanning trees
from the graph.
def q
input: e, the edge of interest
# Create the laplacian matrices
write lambda = exp(gamma) into the edges of G
G_laplace = laplacian(G, lambda)
G_e = nx.contracted_edge(G, e)
G_e_laplace = laplacian(G, lambda)
# Delete a row and column from each matrix to made a cofactor matrix
G_laplace.delete((0, 0))
G_e_laplace.delete((0, 0))
# Calculate the determinant of the cofactor matrices
det_G_laplace = G_laplace.det
det_G_e_laplace = G_e_laplace.det
# return q_e
return det_G_e_laplace / det_G_laplace
# initialize the gamma vector
gamma = 0 vector of length G.size
while true
# We will iterate over the edges in z until we complete the
# for loop without changing a value in gamma. This will mean
# that there is not an edge with q_e > 1.2 * z_e
valid_count = 0
# Search for an edge with q_e > 1.2 * z_e
for e in z
q_e = q(e)
z_e = z[e]
if q_e > 1.2 * z_e
delta = ln(q_e * (1 - 1.1 * z_e) / (1 - q_e) * 1.1 * z_e)
gamma[e] -= delta
else
valid_count += 1
if valid_count == number of edges in z
break
return gamma
Next Steps#
The clear next step is to implement the function spanning_tree_distribution
using the pseudo code above as an outline.
I will start by writing q
and testing it with the same graphs which I am using to test the Held Karp relaxation.
Once q
is complete, the rest of the function seems fairly straight forward.
One thing that I am concerned about is my ability to test spanning_tree_distribution
.
There are no examples given in the Asadpour research paper and no other easy resources which I could turn to in order to find an oracle.
The only method that I can think of right now would be to complete this function, then complete sample_spanning_tree
.
Once both functions are complete, I can sample a large number of spanning trees to find an experimental probability for each tree, then run a statistical test (such as an h-test) to see if the probability of each tree is near
where
Both methods seem very computationally intensive and because they are sampling from a probability distribution they may fail randomly due to an unlikely sample.
References#
A. Asadpour, M. X. Goemans, A. Mardry, S. O. Ghran, and A. Saberi, An o(log n / log log n)-approximation algorithm for the asymmetric traveling salesman problem, Operations Research, 65 (2017), pp. 1043-1061.