How to upload File in FastAPI, then to Amazon S3 and finally process it?

As per FastAPI’s documentation, UploadFile uses Python’s SpooledTemporaryFile, a “file stored in memory up to a maximum size limit, and after passing this limit it will be stored in disk.”. It “operates exactly as TemporaryFile”, which “is destroyed as soon as it is closed (including an implicit close when the object is garbage collected)”. It seems that, once the contents of the file have been read by boto3, the file gets closed, which, in turn, causes the file to be deleted.

Option 1

Read the file contents—using contents = file.file.read(), as shown in this answer (or for async reading/writing see here)—and then upload these bytes to your server, instead of a file object (if that is supported by the server).

Update

I should also mention that there is an
internal “cursor” (or “file pointer”) denoting the position from
which the file contents will be read (or written). When calling
read() reads all the way to the end of the buffer, leaving zero
bytes beyond the cursor. Thus, one could also use the seek() method
to set the current position of the cursor to 0 (i.e., rewinding the
cursor to the start of the file); thus, allowing you to likely pass the file object (i.e., upload_fileobj(file.file) see this answer) after reading the file contents (i.e., contents = file.file.read()). As per FastAPI’s documentation:

seek(offset): Goes to the byte position offset (int) in the file.

  • E.g., await myfile.seek(0) would go to the start of the file.
  • This is especially useful if you run await myfile.read() once and then need to read the contents again.

Option 2

Copy the contents of the file into a NamedTemporaryFile, which, unlike TemporaryFile, “has a visible name in the file system” that “can be used to open the file” (that name can be retrieved from the name attribute ). Additionally, it can remain accesible after it is closed, by setting the delete argument to False; thus, allowing the file to reopen when needed. Once you are done with it, you can delete it using the os.remove() or os.unlink() method. Below is a working example (inspired by this answer):

import uvicorn
from fastapi import FastAPI, File, UploadFile
from tempfile import NamedTemporaryFile
import os

app = FastAPI()

@app.post("/upload")
def upload_file(file: UploadFile = File(...)):
    temp = NamedTemporaryFile(delete=False)
    try:
        try:
            contents = file.file.read()
            with temp as f:
                f.write(contents);
        except Exception:
            return {"message": "There was an error uploading the file"}
        finally:
            file.file.close()
            
        # Here, upload the file to your S3 service using `temp.name`
        
    except Exception:
        return {"message": "There was an error processing the file"}
    finally:
        #temp.close()  # the `with` statement above takes care of closing the file
        os.remove(temp.name)  # Delete temp file
    
    print(contents)  # Handle file contents as desired
    return {"filename": file.filename}

Option 3

You could even keep the bytes in an in-memory buffer BytesIO, use it to upload the contents to the S3 bucket, and finally close it (“The buffer is discarded when the close() method is called.”). Remember to call seek(0) method to reset the cursor back to the beginning of the file after you finish writing to the BytesIO stream.

contents = file.file.read()
temp_file = io.BytesIO()
temp_file.write(contents)
temp_file.seek(0)
s3_client.upload_fileobj(temp_file, "local", "myfile.txt")
temp_file.close()

Leave a Comment

tech