CS 6601: Artificial Intelligence - Assignment 1 - Search

Setup

Clone the repository and activate the Conda repository you created in Assignment 0:

git clone https://github.gatech.edu/omscs6601/assignment_1.git
conda activate ai_env

Overview

Search is an integral part of AI. It helps in problem solving across a wide variety of domains where a solution isn’t immediately clear. You will implement several graph search algorithms with the goal of solving bi-directional and tri-directional search.

Submission

All code you will edit is in the submission.py file, which will be submitted to Gradescope for grading. You are allowed two submissions every thirty minutes. In your Gradescope submission history, you can mark a certain submission as 'Active'.

The Files

While you'll only have to edit and submit submission.py, there are a number of notable files:

File	Description
submission.py	Where you will implement your PriorityQueue, Breadth First Search, Uniform Cost Search, A Search, Bi-directional Search, Tri-directional Search*
*search_submission_tests.py*	Simple unit tests to validate your searches validity and number of nodes explored
*search_submission_tests_grid.py*	Tests searches on uniform grid and highlights path and explored nodes.
*search_unit_tests.py*	More detailed tests that run searches from all possible pairs of nodes in the graph
*romania_graph.pickle*	Serialized graph files for Romania.
*atlanta_osm.pickle*	Serialized graph files for Atlanta (optional for robust testing for Race!).
*explorable_graph.py*	A wrapper around `networkx` that tracks explored nodes. FOR DEBUGGING ONLY
*visualize_graph.py*	Module to visualize search results. See below on how to use it.
*osm2networkx.py*	Module used by visualize graph to read OSM networks.

Resources

Links from Canvas, below the videos:

Resources for bi-directional searches

Please refrain from referring code/psuedocode from other resources aside from these.

The Assignment

Your task is to implement several informed search algorithms that will calculate a driving route between two points in Romania with a minimal time and space cost. There is a search_submission_tests.py file to help you along the way. Your searches should be executed with minimal runtime and memory overhead.

We will be using an undirected network representing a map of Romania (and an optional Atlanta graph used for the Race!).

Grading

Points for each section are awarded based on finding the correct path and by evaluating the number of nodes explored. To track the number of times a node is explored during the search, the ExplorableGraph wrapper is used on the networkx Graph class. Every time you process a node, by calling graph[node] or graph.neighbors(node), the count for that node increases by one. You will need to use one of these methods to add a node's neighbors to the search queue, just be careful not to call it unnecessarily throughout your code. We have created the graph.get_edge_weight(u, v) method to be used to access edge weights between two nodes, u and v. All other normal networkx Graph operations can be performed.

Visualizing the Atlanta graph:

The Atlanta graph is used in some later parts of this assignment. However, it is too big to display within a Python window like Romania. As a result, when you run the bidirectional tests in search_submission_tests.py, it generates a JSON file in the GeoJSON format. To see the graph, you can upload it to a private GitHub Gist or use this site. If you want to see how visualize_graph.py is used, take a look at the class TestBidirectionalSearch in search_submission_tests.py

Frequently Asked Questions

If start and goal are the same, you should return [].

When nodes in the priority queue have the same priority value, break ties according to FIFO. Hint: A counter can be used to track when nodes enter the priority queue.

Your priority queue implementation should allow for duplicate nodes to enter the queue.

There is a little more to this when you get to tridirectional, so read those Notes especially carefully as well

Do not use graph.explored_nodes for anything that you submit to Gradescope. This can be used for debugging, but you should not be calling this in your code. Please make sure you read the "grading" section above.

Do not create a copy of the graph structure for any of the algorithms or compuations.

If you are stuck, check out the resources! We recognize this is a hard assignment and tri-directional search is a more research-oriented topic than the other search algorithms. Many previous students have found it useful to go through the resources in this README if they are having difficulty understanding the algorithms. Hopefully they are of some use to you all as well! :)

We have included the "Haversine" heuristic in the search_submission_tests.py file. All of the local tests on the Atlanta map use this method. For the race, you can use whatever you choose, but know that the Atlanta map positions are (latitude, longitude). If you would like to learn more about this formula, here is a link: https://en.wikipedia.org/wiki/Haversine_formula

Make sure you clean up any changes/modifications/additions you make to the networkx graph structure before you exit the search function. Depending on your changes, the auto grader might face difficulties while testing. The best alternative is to create your own data structure(s).

If you're having problems (exploring too many nodes) with your Breadth first search implementation, one thing many students have found useful is to re-watch the Canvas videos for an optimization trick mentioned.

Most 'NoneType object ...' errors are because the path you return is not completely connected (a pair of successive nodes in the path are not connected). Or because the path variable itself is empty.

Adding unit tests to your code may cause your submission to fail. It is best to comment them out when you submit.

The submissions will be graded by an autograder, and it will time out in 10 minutes. If you observe an abnormality, let us know on Ed.

You may have two submissions in a window of 30min

Gradescope will only allow the imports we provide you. I.e. You will lose a submission if you submit something with any other imports

Unit Tests

We have provided two official unit test files, and one unofficially developed one that students have found useful. They are not complete, and these tests are not guaranteed to ensure full points on the autograder, but they should help in development. To run:

python search_submission_tests.py       # Basic tests, visualizes on Romania
python search_submission_tests_grid.py  # Visualize search on grid
python search_unit_tests.py             # Unofficial, checks for path correctness

Warmups

We'll start by implementing some simpler optimization and search algorithms before the real exercises.

Warmup 1: Priority queue

[5 points]

In all searches that involve calculating path cost or heuristic (e.g. uniform-cost), we have to order our search frontier. It turns out the way that we do this can impact our overall search runtime.

To show this, you'll implement a priority queue which will help you in understanding its performance benefits. For large graphs, sorting all input to a priority queue is impractical. As such, the data structure you implement should have an amortized O(1) insertion and O(lg n) removal time. It should do better than the naive implementation in our tests (InsertionSortQueue), which sorts the entire list after every insertion.

In this implementation of priority queue, if two elements have the same priority, they should be served according to the order in which they were enqueued (see Hint 3).

Notes:

Please note that the algorithm runtime is not the focus of this assignment. The already-imported heapq library should achieve the desired runtime.

The local tests provided are used to test the correctness of your implementation of the Priority Queue. To verify that your implementation consistently beats the naive implementation, you might want to test it with a large number of elements.

If you use the heapq library, keep in mind that the queue will sort entries as a whole upon being enqueued, not just on the first element. This means you need to figure out a way to keep elements with the same priority in FIFO order.

You may enqueue nodes however you like, but when your Priority Queue is tested, we feed node in the form (priority, value).

Warmup 2: BFS

[5 pts]

To get you started with handling graphs, implement and test breadth-first search over the test network.

You'll complete this by writing the breadth_first_search() method. This returns a path of nodes from a given start node to a given end node, as a list.

For this part, it is optional to use the PriorityQueue as your frontier. You will require it from the next question onwards. You can use it here too if you want to be consistent.

Notes:

You need to include start and goal in the path.

If your start and goal are the same then just return [].

You can access all the neighbors of a given node by calling graph[node], or graph.neighbors(node) ONLY.

You are not allowed to maintain a cache of the neighbors for any node. You need to use the above mentioned methods to get the neighbors.

To measure your search performance, the explorable_graph.py provided keeps track of which nodes you have accessed in this way (this is referred to as the set of 'Explored' nodes). To retrieve the set of nodes you've explored in this way, call graph.explored_nodes. If you wish to perform multiple searches on the same graph instance, call graph.reset_search() to clear out the current set of 'Explored' nodes. WARNING, these functions are intended for debugging purposes only. Calls to these functions will fail on Gradescope.

In BFS, make sure you process the neighbors in alphabetical order. Because networkx uses dictionaries, the order that it returns the neighbors is not fixed. This can cause differences in the number of explored nodes from run to run. If you sort the neighbors alphabetically before processing them, you should return the same number of explored nodes each time.

For BFS only, the autograder requires implementing an optimization trick which fully explores fewer nodes. You may find it useful to re-watch the Canvas videos for this.

Warmup 3: Uniform-cost search

[10 points]

Implement uniform-cost search, using PriorityQueue as your frontier. From now on, PriorityQueue should be your default frontier.

uniform_cost_search() should return the same arguments as breadth-first search: the path to the goal node (as a list of nodes).

Notes:

You need to include start and goal in the path.

If your start and goal are the same then just return [].

The above are just to keep your results consistent with our test cases.

You can access all the neighbors of a given node by calling graph[node], or graph.neighbors(node) ONLY.

You can access the weight of an edge using: graph.get_edge_weight(node_1, node_2). Not using this method will result in your explored nodes count being higher than it should be.

You are not allowed to maintain a cache of the neighbors for any node. You need to use the above mentioned methods to get the neighbors and corresponding weights.

We will provide some margin of error in grading the size of your 'Explored' set, but it should be close to the results provided by our reference implementation.

Warmup 4: A* search

[10 points]

Implement A* search using Euclidean distance as your heuristic. You'll need to implement euclidean_dist_heuristic() then pass that function to a_star() as the heuristic parameter. We provide null_heuristic() as a baseline heuristic to test against when calling a_star tests.

Hint: You can find a node's position by calling the following to check if the key is available: graph.nodes[n]['pos']

Notes:

You need to include start and goal in the path.

If your start and goal are the same then just return [].

The above are just to keep your results consistent with our test cases.

You can access all the neighbors of a given node by calling graph[node], or graph.neighbors(node) ONLY.

You can access the weight of an edge using: graph.get_edge_weight(node_1, node_2). Not using this method will result in your explored nodes count being higher than it should be.

You are not allowed to maintain a cache of the neighbors for any node. You need to use the above mentioned methods to get the neighbors and corresponding weights.

You can access the (x, y) position of a node using: graph.nodes[n]['pos']. You will need this for calculating the heuristic distance.

We will provide some margin of error in grading the size of your 'Explored' set, but it should be close to the results provided by our reference implementation.

Exercises

The following exercises will require you to implement several kinds of bidirectional searches. The benefits of these algorithms over uninformed or unidirectional search are more clearly seen on larger graphs. As such, during grading, we will evaluate your performance on the map of Romania included in this assignment.

For these exercises, we recommend you take a look at the resources mentioned earlier.

Exercise 1: Bidirectional uniform-cost search

[20 points]

Implement bidirectional uniform-cost search. Remember that this requires starting your search at both the start and end states.

bidirectional_ucs() should return the path from the start node to the goal node (as a list of nodes).

Notes:

You need to include start and goal in the path. Make sure the path returned is from start to goal and not in the reverse order.

If your start and goal are the same then just return [].

The above are just to keep your results consistent with our test cases.

You can access all the neighbors of a given node by calling graph[node], or graph.neighbors(node) ONLY.

You can access the weight of an edge using: graph.get_edge_weight(node_1, node_2). Not using this method will result in your explored nodes count being higher than it should be.

You are not allowed to maintain a cache of the neighbors for any node. You need to use the above mentioned methods to get the neighbors and corresponding weights.

We will provide some margin of error in grading the size of your 'Explored' set, but it should be close to the results provided by our reference implementation.

Exercise 2: Bidirectional A* search

[29 points]

Implement bidirectional A* search. Remember that you need to calculate a heuristic for both the start-to-goal search and the goal-to-start search.

To test this function, as well as using the provided tests, you can compare the path computed by bidirectional A* to bidirectional UCS search above. bidirectional_a_star() should return the path from the start node to the goal node, as a list of nodes.

Notes:

You need to include start and goal in the path.

If your start and goal are the same then just return [].

The above are just to keep your results consistent with our test cases.

You can access all the neighbors of a given node by calling graph[node], or graph.neighbors(node) ONLY.

You can access the weight of an edge using: graph.get_edge_weight(node_1, node_2). Not using this method will result in your explored nodes count being higher than it should be.

You are not allowed to maintain a cache of the neighbors for any node. You need to use the above mentioned methods to get the neighbors and corresponding weights.

You can access the (x, y) position of a node using: graph.nodes[n]['pos']. You will need this for calculating the heuristic distance.

We will provide some margin of error in grading the size of your 'Explored' set, but it should be close to the results provided by our reference implementation.

Exercise 3: Tridirectional UCS search

[12 points]

Implement tridirectional search in the naive way: starting from each goal node, perform a uniform-cost search and keep expanding until two of the three searches meet. This should be one continuous path that connects all three nodes.

For example, suppose we have goal nodes [a,b,c]. Then what we want you to do is to start at node a and expand like in a normal search. However, notice that you will be searching for both nodes b and c during this search and a similar search will start from nodes b and c. Finally, please note that this is a problem that can be accomplished without using 6 frontiers, which is why we stress that this is not the same as 3 bi-directional searches.

tridirectional_search() should return a path between all three nodes. You can return the path in any order. Eg. (1->2->3 == 3->2->1). You may also want to look at the Tri-city search challenge question on Canvas.

Notes:

You need to include start and goal in the path.

If all three nodes are the same then just return [].

If there are 2 identical goals (i.e. a,b,b) then return the path [a...b] (i.e. just the path from a to b).

The above are just to keep your results consistent with our test cases.

You can access all the neighbors of a given node by calling graph[node], or graph.neighbors(node) ONLY.

You can access the weight of an edge using: graph.get_edge_weight(node_1, node_2). Not using this method will result in your explored nodes count being higher than it should be.

You are not allowed to maintain a cache of the neighbors for any node. You need to use the above mentioned methods to get the neighbors and corresponding weights.

We will provide some margin of error in grading the size of your 'Explored' set, but it should be close to the results provided by our reference implementation.

Exercise 4: Upgraded Tridirectional search

[8 points]

This is the heart of the assignment. Implement tridirectional search in such a way as to consistently improve on the performance of your previous implementation. This means consistently exploring fewer nodes during your search in order to reduce runtime. Keep in mind, we are not performing 3 bidirectional A* searches. We are searching from each of the goals towards the other two goals, in the direction that seems most promising.

The specifics are up to you, but we have a few suggestions:

Tridirectional A*
choosing landmarks and pre-computing reach values
ATL (A*, landmarks, and triangle-inequality)
shortcuts (skipping nodes with low reach values)

tridirectional_upgraded() should return a path between all three nodes.

Notes:

You need to include start and goal in the path.

If all three nodes are the same then just return [].

If there are 2 identical goals (i.e. a,b,b) then return the path [a...b] (i.e. just the path from a to b).

The above are just to keep your results consistent with our test cases.

You can access all the neighbors of a given node by calling graph[node], or graph.neighbors(node) ONLY.

You can access the weight of an edge using: graph.get_edge_weight(node_1, node_2). Not using this method will result in your explored nodes count being higher than it should be.

You are not allowed to maintain a cache of the neighbors for any node. You need to use the above mentioned methods to get the neighbors and corresponding weights.

You can access the (x, y) position of a node using: graph.nodes[n]['pos']. You will need this for calculating the heuristic distance.

We will provide some margin of error in grading the size of your 'Explored' set, but it should be close to the results provided by our reference implementation.

Final Task: Return your name

[1 point]

A simple task to wind down the assignment. Return your name from the function aptly called return_your_name().

The Race!

Here's your chance to show us your best stuff. This part is mandatory if you want to compete in the race for extra credit. Implement custom_search() using whatever strategy you like. More details will be posted soon on Piazza.

Bonus points are added to the grade for this assignment, not to your overall grade.

The Race! will be based on Atlanta Pickle data.

CS6601 AI Assignment 1 Search

加微信 powcoder

QQ 1823890830

Programming Help Add Wechat powcoder

Email: powcoder@163.com

CS6601 AI Assignment 1 Search

加微信 powcoder

QQ 1823890830

Programming Help Add Wechat powcoder

Email: powcoder@163.com

CS6601 Assignment 1 Search

加微信 powcoder

QQ 1823890830

Programming Help Add Wechat powcoder

CS 6601: Artificial Intelligence - Assignment 1 - Search

Setup

Overview

Submission

The Files

Resources

The Assignment

Grading

Visualizing the Atlanta graph:

Frequently Asked Questions

Unit Tests

Warmups

Warmup 1: Priority queue

Warmup 2: BFS

Warmup 3: Uniform-cost search

Warmup 4: A* search

Exercises

Exercise 1: Bidirectional uniform-cost search

Exercise 2: Bidirectional A* search

Exercise 3: Tridirectional UCS search

Exercise 4: Upgraded Tridirectional search

Final Task: Return your name

The Race!

CS6601 AI Assignment 1 Search

加微信 powcoder

QQ 1823890830

Programming Help Add Wechat powcoder

Email: powcoder@163.com

CS6601 AI Assignment 1 Search

加微信 powcoder

QQ 1823890830

Programming Help Add Wechat powcoder

Email: powcoder@163.com

CS6601 Assignment 1 Search

加微信 powcoder

QQ 1823890830

Programming Help Add Wechat powcoder

Email: powcoder@163.com

About

Languages