|
You are here |
ministryofjustice.github.io | ||
| | | | |
ducklake.select
|
|
| | | | | DuckLake simplifies lakehouses by using a standard SQL database for all metadata, instead of complex file-based systems, while still storing data in open formats like Parquet. This makes it more reliable, faster, and easier to manage. | |
| | | | |
lakefs.io
|
|
| | | | | Explore data pipeline automation and boost business growth through enhanced data quality, efficiency, and scalability. Learn how to streamline data management. | |
| | | | |
www.dquach.com
|
|
| | | | | [AI summary] The article discusses advancements in data engineering and AI, focusing on agentic frameworks, Open Table Formats, and tools like AWS SageMaker Lakehouse and Apache Iceberg. | |
| | | | |
jack-vanlightly.com
|
|
| | | In the world of open table formats (Apache Iceberg, Delta Lake, Apache Hudi, Apache Paimon, etc), an emerging trend is to provide interoperability between table formats by cross-publishing metadata. It allows a table to be written in table format X but read in format Y or Z. Cross-publishing is the idea of a table having: * A primary table format that you write to. * Equivalent metadata files of one or more secondary formats that allow the table to be read as if it were of that secondary format. | ||