|
1 | | -Collections of around 1 million granules or more will need to be triggered from EC2, as they will surpass the API gateway timeout limit if ran from gapConfig API. |
| 1 | +Collections of around 1 million granules or more will need to be triggered from EC2, as they will surpass the API gateway timeout limit if ran from gapConfig API. |
2 | 2 |
|
3 | | -There are two functions here, each with different uses: |
| 3 | +There are two folders here, each with different uses: |
4 | 4 |
|
5 | | -invoke_gap_config.sh is used for single collections that are greater than ~1 million granules |
| 5 | +In the invoke_single_collection directory, the invoke_gap_config.sh script will run for a single collection. In the bulk_invoke directory, the lambda_bulk_invoker.py script will run for a list of collections. |
6 | 6 |
|
7 | | -To run: |
8 | 7 |
|
9 | | -1. Launch or use an existing EC2 instance in the same VPC as gapConfig API. |
10 | | -2. Prepare the input file and script provided in this folder. The event.json file needs to be modified to run for your specified collection. |
11 | | -3. Run the script: './invoke_gap_config.sh' |
12 | | -4. Check the response: 'cat response.json' |
| 8 | +To run for a single collection: |
13 | 9 |
|
14 | | -lambda_bulk_invoker.py is used for larger lists of collections and will process them sequentially. |
| 10 | +- Launch or use an existing EC2 instance in the same VPC as gapConfig API. |
| 11 | +- Prepare the input file and script provided in the invoke_single_collection folder. The event.json file needs to be modified to run for your specified collection. |
| 12 | +Usage: './invoke_gap_config.sh' |
| 13 | +Check the response: 'cat response.json' |
15 | 14 |
|
16 | | -To run: |
17 | | -1. Create a collections.csv with first column collection ID and second column version. Third column for tolerance is optional |
18 | | -2. The lambda name for gapConfig and the csv file are specified as command line arguments. |
19 | | -2. EXAMPLE RUN: python3 lambda_bulk_invoker.py gapConfigLambdaName collections.csv |
| 15 | + |
| 16 | +lambda_bulk_invoker.py is used for larger lists of collections and will process them sequentially. |
| 17 | + |
| 18 | +To run a list of collections: |
| 19 | + |
| 20 | +- Create lambda_bulk_invoker.py on the EC2 instance. Paste the code from this repository into that file. |
| 21 | +- The EC2 Instance should have sqs:GetQueueUrl and sqs:GetQueueAttributes permissions for the gapDetectionIngestQueue |
| 22 | +- The lambda name for gapConfig, the csv file, and the queue name are specified as command line arguments. |
| 23 | +Usage: python3 lambda_queue_batch_processor.py <lambda_function_name> <csv_file> <queue_name> |
| 24 | +Example: python3 lambda_queue_batch_processor.py gesdisc-cumulus-uat-gapConfig collections.csv gesdisc-cumulus-uat-gapDetectionIngestQueue |
0 commit comments