Skip to content

Latest commit

 

History

History
98 lines (70 loc) · 2.68 KB

README.md

File metadata and controls

98 lines (70 loc) · 2.68 KB

lambda-yfinance

I was super bothered about Google Sheets GOOGLEFINANCE() function being broken for quite a few tickers I want to track in my spreadsheet. This gave me a good reason to have my own AWS lambda powered stocks price API using the Python yfinance library :-).

As explained here, I have patched the yfinance lib to achieve better run times.

Testing Notes

When running on AWS lambda, the yfinance history download was quite slow if querying a big enough number for tickers (e.g. ~16). It's a combination of:

  • a bug in the multitasking setup done in the yfinance lib, limiting the number of parallel downloads.
  • No timezone cache without setting up some storage. This results in an extra HTTP request to query each ticker's TZ. The easiest way I can think to fix this is to set up an extra cache layer, containing the TZ of all tickers you care about.
  • Individual HTTP request taking a bit longer than expected.

There are patches in this repo to address that. I should eventually try to upstream them ...

Some benchmarks:

  • running the function on my machine (8 cores MPB M1) takes less than 1s.
  • running on a 4 core VM takes around 4s.
  • running on AWS lambda is around 8 to 12s.

Features

  • Support for both JSON and CSV output modes.
  • Can request multiple tickers in a single API invocation.

See the TLDR to grok the API syntax.

TLDR

Run locally

Run locally for test purposes.

Start server in one terminal:

$ ./main.py

Send request in another terminal:

$ curl -s "localhost:5000/price?tickers=MSFT,AMZN,GOOG&csv=1" | jq -r .body
MSFT,338.115,1.15
AMZN,142.585,3.15
GOOG,137.72,0.38
~ »

Setup on AWS

  1. Setup an AWS lambda with this code - both the function and the associated layer.

NOTES about the libraries layer:

  • I have included a make target to create the Python layer package on macOS using lima. Make sure the ARCH used by pip3 inside lima matches the architecture configured on AWS! If you are limited to only build on a specific architecture, you can use QEMU user emulation + podman to build other archs. See the Makefile and https://wiki.debian.org/QemuUserEmulation
  1. Create an API gateway as trigger.
  2. Test the API with:
curl "https://API-ID.execute-api.eu-central-1.amazonaws.com/default/LAMBDA-NAME?tickers=VWRA.L&csv=1"
VWRA.L,110.38,0.3

Using on gsheet

Get the price:

=index(importdata("https://API-ID.execute-api.eu-central-1.amazonaws.com/default/LAMBDA-NAME?tickers=VWRA.L&csv=1"), 1, 2)

Get the daily change percentage:

=index(importdata("https://API-ID.execute-api.eu-central-1.amazonaws.com/default/LAMBDA-NAME?tickers=VWRA.L&csv=1"), 1, 3)