Une plateforme pour les réunir toutes
CocoIndex is a data transformation framework for AI, ultra performant with incremental processing and built-in data lineage.
Deepnote is a drop-in replacement for Jupyter with an AI-first design, sleek UI, new blocks, and native data integrations.
Snowflake enhances the integrated development environment for AI with tools like Workspaces, Git, VS Code, and AI features like Cortex Code and AISQL.
Snowflake introduces new tools for data migration with AI, interoperability, and governance, including SnowConvert AI, pg_lake, and enhancements to Snowflake Horizon Catalog.
Snowflake Intelligence, an enterprise intelligence agent, enables deep analysis, verified answers, and enhanced security for all users.
Snowflake introduces new capabilities to accelerate data preparation and modernize developer workflows, simplifying the use and deployment of AI at scale.
dltHub develops a Python-native data platform to accelerate data pipelines, combining simplicity with enterprise-grade governance.
Columnar, founded by Apache Arrow contributors, introduces Arrow-native ADBC drivers to enhance data connectivity for platforms like Snowflake and DuckDB.
nao offers a free version of its data IDE integrated with AI features, allowing to connect data warehouses and execute SQL.
Snowflake announces the general availability of Snowflake Intelligence, a powerful tool for analyzing structured and unstructured data.
Snowflake introduces machine learning experiments to track and evaluate models through Snowsight, enabling comparison of collected data to select the best model.
Snowflake announces the general availability of its managed MCP server, providing standardized integration and robust governance for AI agents.
Snowflake announces the general availability of Cortex Agents, a tool for orchestrating structured and unstructured data using LLMs and key components like planning and analysis.
Snowflake Intelligence leverages real-time text-to-SQL processing to enhance query efficiency.
Optimizing query execution in Cortex AISQL using advanced techniques to enhance performance.
Integration of Snowflake Cortex with Microsoft Teams to interact with data via a bot, using AI agents and semantic views.
BigQuery introduces a Data Engineering Agent in preview, automating complex tasks.
Apache Arrow aims to replace outdated database drivers, enhancing interoperability and performance of modern data systems.
Lightweight tool to access multiple LLMs, with multi-provider support, OpenAI-compatible API, and UI interface. Features include cost analysis, configuration management, and Docker support.
ClickStack Event Deltas speeds up root cause analysis for slow traces by automatically comparing attributes of fast and slow traces, leveraging ClickHouse for high-performance observability.
Using ClickHouse to analyze weather data from airplane telemetry, leveraging color space conversion functions and trigonometric calculations.
marimo is a reactive Python notebook for running reproducible experiments, querying with SQL, executing as a script, deploying as an app, and versioning with git.
FlinkSketch is a library of sketching algorithms for Flink, enabling various streaming analytics capabilities through efficient algorithms.
Technical comparison between Kafka and Postgres for real-time data processing, highlighting Kafka's advantages for event streaming.
CLI tool for database schema management using SQLModel and Alembic, with JSON-schema export for Pydantic.
FastMCP 2.13 introduces persistent storage, robust authentication, and performance optimizations for production MCP servers.
Comparison of RAG performance on graph and SQL databases using a Formula 1 results dataset.
Snowflake Cortex Agents simplifies AI-powered data interactions within Microsoft 365 Copilot and Teams, enabling analysis and insight generation from structured and unstructured data.
Increased row capacity for pivot tables in Connected Sheets from 100,000 to 200,000 rows.
Snowflake announces general availability of storage lifecycle policies to optimize costs and simplify compliance.
Showcase of Google Cloud customer projects: AI research agents for Deutsche Bank, migration to CloudSQL for Rent the Runway, and AI assistants for Seattle Children's and FOX Sports.
Snowflake announces security features in Trust Center to analyze violations at the organization level.
Analysis of FAANG data infrastructures, highlighting their design philosophies rather than tools, and proposing a hybrid approach for non-FAANG organizations.
Grab uses machine learning for predictive autoscaling of Flink applications, optimizing CPU usage and reducing costs.
PostgreSQL 18 introduces temporal constraints with WITHOUT OVERLAPS and PERIOD to enhance temporal data integrity.
The Apache Iceberg REST catalog in BigLake metastore is now generally available with new features.
Four senior data engineers address Reddit's top questions on fundamentals, data quality, and tech choices.
Dagster 1.12 enhances user experience with a streamlined UI, GA Components, simplified deployments, and orchestration improvements like FreshnessPolicies.
This post demonstrates using log clustering with Drain3 and ClickHouse UDFs to automatically structure raw application logs, achieving nearly 50x compression.
Snowflake announces expanded integration with Oracle Database to enhance data connectivity and analytics.
Explores ELT pipelines and their scalable design, highlighting dbt's role in simplifying collaboration and data transformation.
This post explores the importance of data transformation in data warehouses and how dbt facilitates the creation of reliable, scalable data pipelines.
Snowflake Data Clean Rooms updates with UI enhancements, API improvements, and better error messaging.
Apache Polaris 1.2 enhances governance and connectivity, integrating better with Snowflake.
dbt Labs open sources MetricFlow, an independent schema for data interoperability, enhancing consistency and collaboration in data pipelines.
Analysis of operational challenges and limitations of pgvector in production, highlighting indexing issues, real-time search problems, and filtering complexities.
BigQuery now allows grouping reservations to prioritize idle slot sharing within a group, providing better control over slot allocation for high-priority workloads.
Snowflake Native Apps can now securely request permission from consumers to share data back with the provider.
Introduction to dbc, a command-line tool that manages connections and executes SQL queries.
Launch of Columnar, a new open-source platform for data management.