Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.
-
Updated
May 29, 2024 - Scala
Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.
This repository focuses on providing interview scenario questions that I have encountered during interviews. The questions are designed to simulate real-world scenarios and test your problem-solving and technical skills. By exploring these scenarios, you can gain insights into common interview topics and prepare yourself for similar challenges.
Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.
Repositorio pensado na criação de um ambiente Spark, para desenvolvimento de pipelines de dados.
A Scala kernel for Jupyter
Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.
Spark based applications to perform big data analytics
ORM for Apache Spark and DataFrames schema manager
Qbeast-spark: DataSource enabling multi-dimensional indexing and efficient data sampling. Big Data, free from the unnecessary!
🏆 Spark4You Design patterns
Introduction to Spark Batch processing.
Tutorials on Big Data essentials: Hadoop, MapReduce, Spark.
The Internals of Spark SQL
Apache Spark Connect Client for Rust
O objetivo deste trabalho é explorar as capacidades de arquiteturas de bancos de dados distribuídos para lidar com conjuntos de dados complexos, em particular, o "Relatório de Saldo Mensal da Conta", que apresenta todos os Saldos Mensais das Contas dos clientes entre Jan/2020 e Dez/2020.
Large dataSet of IPL Data till 2017 analysis using PySpark.
Add a description, image, and links to the spark-sql topic page so that developers can more easily learn about it.
To associate your repository with the spark-sql topic, visit your repo's landing page and select "manage topics."