Skip to content

Data Analytics

Join our forum to discuss your ideas with Aiven community or check out our public roadmap.

Data Analytics

Categories

JUMP TO ANOTHER FORUM

37 results found

  1. As a business owner,
    I would like to have the ability to set up a replication of my Aiven for ClickHouse service, and all the object storage attached to it, to a secondary region,
    so that in the case of needing disaster recovery from the primary region going down, we could fail our workload traffic over to the secondary region.

    The replicated secondary region does not need to be writable until the primary is unavailable, but we would like to read from it to validate consistency. As a customer we'll handle the failover mechanism.

    7 votes

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)

    We’ll send you updates on this idea

    How important is this to you?

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)
  2. As developer against ClickHouse,
    I want to utilise the S3Queue engine available in v23.8
    so that I can watch an S3 bucket, and auto consume new files as they appear, via a materialized view, as soon as they arrive.

    This will enable me to keep my analytics up to date with no need for additional engineering effort or streaming pipelines.

    6 votes

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)

    We’ll send you updates on this idea

    How important is this to you?

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)
  3. As someone managing a Clickhouse instance that delivers my production workloads,
    I want to have better insight into what is causing slow queries against my data,
    so that I can get to the root cause and take corrective actions.
    In addition, it's important to have these metrics available for export to my monitoring system

    6 votes

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)

    We’ll send you updates on this idea

    How important is this to you?

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)
  4. As a user of Aiven for ClickHouse on AWS, Microsoft Azure or Google Cloud:
    I would like to leverage my own object storage account with Tiered Storage for Aiven for ClickHouse (that I am already using BYOC on).

    4 votes

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)

    We’ll send you updates on this idea

    How important is this to you?

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)
  5. As a developer using Kafka to stream messages to ClickHouse,
    I want to be sure that messages as delivered and ingested exactly once, as opposed to at-least-once, to ClickHouse to not have to deal with duplicated data that I have to deduplicate down the line. This removes both a cognitive and operational load from my data pipeline
    c.f. https://github.com/ClickHouse/clickhouse-kafka-connect

    The exactly-once semantics has been implemented in the newer version of the ClickHouse Kafka Connect Sink (https://github.com/ClickHouse/clickhouse-kafka-connect).

    4 votes

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)

    We’ll send you updates on this idea

    How important is this to you?

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)
  6. As a customer of Aiven, I would like to pull data from many disparate sources into my ClickHouse data warehouse to provide my users best-in-class analytics and performance. Adding support for DeltaLake would open up new sources from which I can seamlessly consume from.

    4 votes

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)

    We’ll send you updates on this idea

    How important is this to you?

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)
  7. I need to read and write Parquet format files to/from Azure Blob Storage.

    3 votes

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)

    We’ll send you updates on this idea

    How important is this to you?

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)
  8. As a user sending Kafka messages to Aiven for ClickHouse,
    I want to be able to send those messages using the protobuf format

    3 votes

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)

    We’ll send you updates on this idea

    How important is this to you?

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)
  9. As an Data Engineer
    I want to use 3rd party ETL tools to populate Clickhouse tables
    so that I can use clickhouse as my warehouse. Currently popular tools such as airbyte do not work when connected to Aiven clickhouse due to the fact that it creates internal state tables within clickhouse, and it fails given that tables are not allow to be created outside the context of the Aiven console. Customers are unable to use external tools that can self manage state and required the permission.

    2024-01-26 22:37:45 normalization > Code: 497. DB::Exception: avnadmin: Not enough privileges. To execute this…

    3 votes

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)

    We’ll send you updates on this idea

    How important is this to you?

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)
  10. As a developer, as a DR requirement, I would like to be able to restore the service from its backup in cases where the service's hosting region is down.

    3 votes

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)

    We’ll send you updates on this idea

    How important is this to you?

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)
  11. As an Aiven for ClickHouse user,
    I want to be able to use the query editor in the Aiven console in my ClickHouse services to run large queries without worrying of timeouts or bugs.

    3 votes

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)

    We’ll send you updates on this idea

    How important is this to you?

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)
  12. As developer ina. company with a large amount of data stored in Iceberg,
    I want to have the ClickHouse Iceberg Table engine,
    so that I can effectively analyse the large amounts of data stored in iceberg, via Aiven for Clickhouse.

    3 votes

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)

    We’ll send you updates on this idea

    How important is this to you?

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)
  13. As a developer with a current PostgreSQL instance,
    I would like to utilise the ClickHouse PostgreSQL Table engine,
    so that I can easily read from that external PostgreSQL instance and insert the data into my Aiven for ClickHouse instance.

    3 votes

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)

    We’ll send you updates on this idea

    How important is this to you?

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)
  14. As a technologist within an organisation delivering a data platform via integration of multiple open source services,
    I want to have the ability to use wildcards in the kafkatopiclist for Aiven for ClickHouse,
    so that I can reduce the management overhead and reduce the time to value for new development.

    3 votes

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)

    We’ll send you updates on this idea

    How important is this to you?

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)
  15. As someone running analytics with Clickhouse,
    I want to be able to create functions that read data from an external MongoDB instance, as well as create tables that span data in external MongoDB instances,
    so that I can enrich my analytics with data stored elsewhere, or easily migrate data from a MongoDB instance to my Aiven for Clickhouse instance.

    3 votes

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)

    We’ll send you updates on this idea

    How important is this to you?

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)
  16. As a developer I want the Kafka - Clickhouse native integration to support Parquet format. This will simplify our set-up and allow us to remove a part of the architecture.

    2 votes

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)

    We’ll send you updates on this idea

    How important is this to you?

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)
  17. As a user of both PostgreSQL (on-prem, on another cloud provider or at Aiven) and Aiven for ClickHouse,
    I want to be able to ingest my PostgreSQL data, not as a on time snapshot or a remote view of the data like the current integrations allow me to, but as tables created and stored in ClickHouse that pull updates from PostgreSQL regularly.

    2 votes

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)

    We’ll send you updates on this idea

    How important is this to you?

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)
  18. As a user of Aiven for ClickHouse,
    I want to use the most optimized default settings possible. From the start I want to have the default JOIN set as the most performant possible and not the ClickHouse default one. This comes without me, the user, having to change any settings. However, I should be able to tweak those settings if needed in specific cases.
    See https://clickhouse.com/docs/en/engines/table-engines/special/join

    2 votes

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)

    We’ll send you updates on this idea

    How important is this to you?

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)
  19. As as user of Aiven for ClickHouse,
    In some cases, I want to have an alternative to JOIN statements that is faster and less resource heavy. For this, I want to create a dictionary of key-attributes that can be used in queries.
    I also want to be able to work on my dictionary with all the related ClickHouse functions (c.f. https://clickhouse.com/docs/en/sql-reference/functions/ext-dict-functions)

    2 votes

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)

    We’ll send you updates on this idea

    How important is this to you?

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)
  20. As a developer,
    I want to be able to extend the behavior of ClickHouse by creating lambda expressions that can utilize SQL constructs and functions that can be used like any in-built function in a query. I want to be able to do so with the command "CREATE FUNCTION"

    2 votes

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)

    We’ll send you updates on this idea

    How important is this to you?

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)
← Previous 1
  • Don't see your idea?