I have a very big zip file in my google drive which contain several subfloders. Now, I’d like to extract only a few subfolders (not all folder into colab). Is there any way for this?
For instance, suppose the zip file name is "MyBigFile.zip" which contain "folder1", "folder2", "folder3", "folder4", and "folder5". I only want to import and extract "folder1",and "folder4" into my google colab (and better import only 200 images from it only). How is it possible? any suggestion?
*if this is related: each folder 1-5 contains around 50000 .png files
After some searching I found something. You can use the
zipfile module in google collab too.
from zipfile import ZipFile from google.colab import drive drive.mount('/content/drive/') zipfile = ZipFile("quote.zip") def extract(folderName, numberOfFiles): files = list(filter(lambda x: x.startswith(folderName), zipfile.namelist()))[:numberOfFiles] for file in files: zipfile.extract(file, 'extractedFolder') extract("folder1/", 200) extract("folder4/", 100) zipfile.close()