I am a docent at the Torrey Pines State Natural Reserve and when I saw on our torreypines.org website bird data spanning 15 years sitting in PDF’s I knew I had to do something to visualize it and make it more useful.
How It Works
The data started off in PDFs: One PDF for every month since 2006. First I had to create a couple of scripts in NodeJS to read the PDFs and output the data into a useable format. The first script reads the text from the PDF files and places them into a txt file. The second script reads that data, cleans it, organizes it, and parses it into JSON.That JSON file is stored in an S3 bucket.
The app (app.js) uses an API call to retrieve that data, where it is then further organized (dates are formatted, birds are sorted). The data is then formatted to specifically fit the requirements for the Plotly graph I chose.