jboothomasSQL query S3 objects with Apache drillIn this article I will cover how to setup and configure apache drill to run SQL queries against parquet files stored in an S3 bucket.May 10
jboothomasHive-metastore on K8S with S3 external tableIn this blog I will cover how to setup Hive metastore on K8S and then leverage external S3 datasets.Aug 10, 20231
jboothomasLakefs.io on kubernetes with PureStorage FlashBladeAs per their website LakeFS brings software engineering best practices and applies them to data engineering. Let’s go over how to get up…Mar 1Mar 1
Joshua RobinsonListing 67 Billion Objects in 1 BucketIn this post, I look at what it takes to list all keys in a single bucket with 67 billion objects and build a simple list benchmark program…Dec 8, 2020Dec 8, 2020
jboothomasClickhouse and Flashblade S3In this quick how to I will cover the steps to leverage Pure Storage Flashblade S3 storage with a Clickhouse installation.Aug 22, 2023Aug 22, 2023
jboothomasSQL query S3 objects with Apache drillIn this article I will cover how to setup and configure apache drill to run SQL queries against parquet files stored in an S3 bucket.May 10
jboothomasHive-metastore on K8S with S3 external tableIn this blog I will cover how to setup Hive metastore on K8S and then leverage external S3 datasets.Aug 10, 20231
jboothomasLakefs.io on kubernetes with PureStorage FlashBladeAs per their website LakeFS brings software engineering best practices and applies them to data engineering. Let’s go over how to get up…Mar 1
Joshua RobinsonListing 67 Billion Objects in 1 BucketIn this post, I look at what it takes to list all keys in a single bucket with 67 billion objects and build a simple list benchmark program…Dec 8, 2020
jboothomasClickhouse and Flashblade S3In this quick how to I will cover the steps to leverage Pure Storage Flashblade S3 storage with a Clickhouse installation.Aug 22, 2023
jboothomasTrino S3 via hive-metastore integrationIn this blog I will go over how to use S3 storage on a Pure Storage Flashblade with Trino the fast distributed SQL query engine for big…Aug 10, 2023
jboothomasDremio S3 and NFS integrationIn this blog I will go over how you can use fast NFS and S3 from Pure Storage to power your Dremio K8S deployments.Aug 10, 2023
Joshua RobinsonImproving Python S3 Client Performance with RustReplacing Boto3 for Fun and ProfitMar 31, 20221