Dataaaaa !

Une plateforme pour les réunir toutes

Today
Logo

cocoindex - Data transformation for AI

CocoIndex is a data transformation framework for AI, ultra performant with incremental processing and built-in data lineage.

Logo

Deepnote - Deepnote is a data notebook for the AI era

Deepnote is a drop-in replacement for Jupyter with an AI-first design, sleek UI, new blocks, and native data integrations.

Logo

Supercharging the Developer Workflow for AI with Snowflake's Integrated Dev Environment

Snowflake enhances the integrated development environment for AI with tools like Workspaces, Git, VS Code, and AI features like Cortex Code and AISQL.

Logo

Data Without Limits to Fuel Your Enterprise AI

Snowflake introduces new tools for data migration with AI, interoperability, and governance, including SnowConvert AI, pg_lake, and enhancements to Snowflake Horizon Catalog.

Logo

Snowflake Intelligence: All Your Knowledge. One Trusted AI.

Snowflake Intelligence, an enterprise intelligence agent, enables deep analysis, verified answers, and enhanced security for all users.

Logo

Snowflake Brings More Intelligent, Governed AI to Enterprises

Snowflake introduces new capabilities to accelerate data preparation and modernize developer workflows, simplifying the use and deployment of AI at scale.

Meet the founders of dltHub: Matthaus Krzykowski, Marcin Rudolf, Adrian Brudaru, and Anna Hoffmann

dltHub develops a Python-native data platform to accelerate data pipelines, combining simplicity with enterprise-grade governance.

Meet the founders of Columnar: Ian Cook, David Li, and Matt Topol

Columnar, founded by Apache Arrow contributors, introduces Arrow-native ADBC drivers to enhance data connectivity for platforms like Snowflake and DuckDB.

Why we're making nao Free

nao offers a free version of its data IDE integrated with AI features, allowing to connect data warehouses and execute SQL.

Logo

Snowflake Intelligence (General availability)

Snowflake announces the general availability of Snowflake Intelligence, a powerful tool for analyzing structured and unstructured data.

Logo

Snowflake Machine Learning Experiments (Preview)

Snowflake introduces machine learning experiments to track and evaluate models through Snowsight, enabling comparison of collected data to select the best model.

Logo

Snowflake-managed MCP server (General availability)

Snowflake announces the general availability of its managed MCP server, providing standardized integration and robust governance for AI agents.

Logo

Cortex Agents (General availability)

Snowflake announces the general availability of Cortex Agents, a tool for orchestrating structured and unstructured data using LLMs and key components like planning and analysis.

Yesterday
Logo

Real-Time Text-to-SQL Behind Snowflake Intelligence

Snowflake Intelligence leverages real-time text-to-SQL processing to enhance query efficiency.

Logo

Optimizing Query Execution in Cortex AISQL

Optimizing query execution in Cortex AISQL using advanced techniques to enhance performance.

Logo

Chat with your Snowflake Data from Microsoft Teams

Integration of Snowflake Cortex with Microsoft Teams to interact with data via a bot, using AI agents and semantic views.

Logo

BigQuery : The Data Engineering Agent is now in preview

BigQuery introduces a Data Engineering Agent in preview, automating complex tasks.

Apache Arrow's Final Frontier: Replacing Outdated Database Drivers

Apache Arrow aims to replace outdated database drivers, enhancing interoperability and performance of modern data systems.

Logo

LLM Client, Server API and UI

Lightweight tool to access multiple LLMs, with multi-provider support, OpenAI-compatible API, and UI interface. Features include cost analysis, configuration management, and Docker support.

Logo

Faster root cause for slow traces with ClickStack Event Deltas

ClickStack Event Deltas speeds up root cause analysis for slow traces by automatically comparing attributes of fast and slow traces, leveraging ClickHouse for high-performance observability.

Logo

Global weather data from flying airplanes

Using ClickHouse to analyze weather data from airplane telemetry, leveraging color space conversion functions and trigonometric calculations.

Logo

marimo: A reactive notebook for Python

marimo is a reactive Python notebook for running reproducible experiments, querying with SQL, executing as a script, deploying as an app, and versioning with git.

Logo

FlinkSketch: Democratizing the Benefits of Sketches for the Flink Community

FlinkSketch is a library of sketching algorithms for Flink, enabling various streaming analytics capabilities through efficient algorithms.

"You Don't Need Kafka, Just Use Postgres" Considered Harmful

Technical comparison between Kafka and Postgres for real-time data processing, highlighting Kafka's advantages for event streaming.

Sunday, November 2
Logo

shed: CLI to manage your SQL database schemas and migrations

CLI tool for database schema management using SQLModel and Alembic, with JSON-schema export for Pydantic.

Saturday, November 1

FastMCP 2.13: Storage, Security, and Scale

FastMCP 2.13 introduces persistent storage, robust authentication, and performance optimizations for production MCP servers.

Graph RAG vs SQL RAG

Comparison of RAG performance on graph and SQL databases using a Formula 1 results dataset.

Friday, October 31
Logo

Turn Data Into Intelligence In Your Everyday Workflows

Snowflake Cortex Agents simplifies AI-powered data interactions within Microsoft 365 Copilot and Teams, enabling analysis and insight generation from structured and unstructured data.

Logo

BigQuery : October 31, 2025

Increased row capacity for pivot tables in Connected Sheets from 100,000 to 200,000 rows.

Logo

Optimize Storage Costs and Simplify Compliance with Storage Lifecycle Policies, Now Generally Available

Snowflake announces general availability of storage lifecycle policies to optimize costs and simplify compliance.

Logo

Cool stuff Google Cloud customers built, Oct. edition: Research agents, agentic “teams,” decentralized contracts & more

Showcase of Google Cloud customer projects: AI research agents for Deutsche Bank, migration to CloudSQL for Rent the Runway, and AI assistants for Seattle Children's and FOX Sports.

Logo

Organization-level findings in the Trust Center

Snowflake announces security features in Trust Center to analyze violations at the organization level.

Thursday, October 30

Why You’ll Never Have a FAANG Data Infrastructure and That’s the Point | Part 1

Analysis of FAANG data infrastructures, highlighting their design philosophies rather than tools, and proposing a hybrid approach for non-FAANG organizations.

Machine-learning predictive autoscaling for Flink

Grab uses machine learning for predictive autoscaling of Flink applications, optimizing CPU usage and reducing costs.

Exploring how PostgreSQL 18 conquered time with temporal constraints

PostgreSQL 18 introduces temporal constraints with WITHOUT OVERLAPS and PERIOD to enhance temporal data integrity.

Logo

BigQuery : Apache Iceberg REST catalog in BigLake metastore now generally available

The Apache Iceberg REST catalog in BigLake metastore is now generally available with new features.

Logo

4 Senior Data Engineers Answer 10 Top Reddit Questions

Four senior data engineers address Reddit's top questions on fundamentals, data quality, and tech choices.

Logo

Dagster 1.12: Monster Mash

Dagster 1.12 enhances user experience with a streamlined UI, GA Components, simplified deployments, and orchestration improvements like FreshnessPolicies.

Logo

Improve logs compression with log clustering

This post demonstrates using log clustering with Drain3 and ClickHouse UDFs to automatically structure raw application logs, achieving nearly 50x compression.

Logo

Announcing Expanded Integration Between Oracle Database and the Snowflake AI Data Cloud

Snowflake announces expanded integration with Oracle Database to enhance data connectivity and analytics.

Logo

Getting started with an ELT pipeline

Explores ELT pipelines and their scalable design, highlighting dbt's role in simplifying collaboration and data transformation.

Logo

Data transformation in the data warehouse

This post explores the importance of data transformation in data warehouses and how dbt facilitates the creation of reliable, scalable data pipelines.

Logo

Snowflake Data Clean Rooms updates

Snowflake Data Clean Rooms updates with UI enhancements, API improvements, and better error messaging.

Wednesday, October 29
Logo

Apache Polaris 1.2 (incubating): Enhanced Governance & Connectivity

Apache Polaris 1.2 enhances governance and connectivity, integrating better with Snowflake.

dbt Labs Open Sources MetricFlow: An Independent Schema for Data Interoperability

dbt Labs open sources MetricFlow, an independent schema for data interoperability, enhancing consistency and collaboration in data pipelines.

The Case Against PGVector

Analysis of operational challenges and limitations of pgvector in production, highlighting indexing issues, real-time search problems, and filtering complexities.

Logo

BigQuery : Groupement de réservations pour la priorisation des slots inactifs

BigQuery now allows grouping reservations to prioritize idle slot sharing within a group, providing better control over slot allocation for high-priority workloads.

Logo

Snowflake Native Apps: Shareback

Snowflake Native Apps can now securely request permission from consumers to share data back with the provider.

Introducing dbc

Introduction to dbc, a command-line tool that manages connections and executes SQL queries.

Announcing Columnar

Launch of Columnar, a new open-source platform for data management.

Showing 1 to 50 of 1368 articles
...