Skip to content

Data Analytics

Join our forum to discuss your ideas with Aiven community or check out our public roadmap.

Data Analytics

Categories

JUMP TO ANOTHER FORUM

39 results found

  1. As developer against ClickHouse,
    I want to utilise the S3Queue engine available in v23.11
    so that I can watch an S3 bucket, and auto consume new files as they appear, via a materialized view, as soon as they arrive.

    This will enable me to keep my analytics up to date with no need for additional engineering effort or streaming pipelines.

    7 votes

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)

    We’ll send you updates on this idea

    How important is this to you?

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)
  2. As a business owner,
    I would like to have the ability to set up a replication of my Aiven for ClickHouse service, and all the object storage attached to it, to a secondary region,
    so that in the case of needing disaster recovery from the primary region going down, we could fail our workload traffic over to the secondary region.

    The replicated secondary region does not need to be writable until the primary is unavailable, but we would like to read from it to validate consistency. As a customer we'll handle the failover mechanism.

    7 votes

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)

    We’ll send you updates on this idea

    How important is this to you?

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)
  3. As someone managing a Clickhouse instance that delivers my production workloads,
    I want to have better insight into what is causing slow queries against my data,
    so that I can get to the root cause and take corrective actions.
    In addition, it's important to have these metrics available for export to my monitoring system

    6 votes

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)

    We’ll send you updates on this idea

    How important is this to you?

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)
  4. As a user sending Kafka messages to Aiven for ClickHouse,
    I want to be able to send those messages using the protobuf format

    5 votes

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)

    We’ll send you updates on this idea

    How important is this to you?

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)
  5. As a developer using Kafka to stream messages to ClickHouse,
    I want to be sure that messages as delivered and ingested exactly once, as opposed to at-least-once, to ClickHouse to not have to deal with duplicated data that I have to deduplicate down the line. This removes both a cognitive and operational load from my data pipeline
    c.f. https://github.com/ClickHouse/clickhouse-kafka-connect

    The exactly-once semantics has been implemented in the newer version of the ClickHouse Kafka Connect Sink (https://github.com/ClickHouse/clickhouse-kafka-connect).

    5 votes

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)

    We’ll send you updates on this idea

    How important is this to you?

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)
  6. As an Aiven user and Aiven for ClickHouse user,
    I want to be able to use Aiven for ClickHouse on Oracle Cloud with all the Aiven capabilities available on the other cloud providers, seamlessly.

    4 votes

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)

    We’ll send you updates on this idea

    How important is this to you?

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)
  7. As a user of Aiven for ClickHouse on AWS, Microsoft Azure or Google Cloud:
    I would like to leverage my own object storage account with Tiered Storage for Aiven for ClickHouse (that I am already using BYOC on).

    4 votes

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)

    We’ll send you updates on this idea

    How important is this to you?

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)
  8. As a developer, as a DR requirement, I would like to be able to restore the service from its backup in cases where the service's hosting region is down.

    4 votes

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)

    We’ll send you updates on this idea

    How important is this to you?

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)
  9. As a customer of Aiven, I would like to pull data from many disparate sources into my ClickHouse data warehouse to provide my users best-in-class analytics and performance. Adding support for DeltaLake would open up new sources from which I can seamlessly consume from.

    4 votes

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)

    We’ll send you updates on this idea

    How important is this to you?

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)
  10. I need to read and write Parquet format files to/from Azure Blob Storage.

    3 votes

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)

    We’ll send you updates on this idea

    How important is this to you?

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)
  11. As an Data Engineer
    I want to use 3rd party ETL tools to populate Clickhouse tables
    so that I can use clickhouse as my warehouse. Currently popular tools such as airbyte do not work when connected to Aiven clickhouse due to the fact that it creates internal state tables within clickhouse, and it fails given that tables are not allow to be created outside the context of the Aiven console. Customers are unable to use external tools that can self manage state and required the permission.

    2024-01-26 22:37:45 normalization > Code: 497. DB::Exception: avnadmin: Not enough privileges. To execute this…

    3 votes

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)

    We’ll send you updates on this idea

    How important is this to you?

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)
  12. As an Aiven for ClickHouse user,
    I want to be able to use the query editor in the Aiven console in my ClickHouse services to run large queries without worrying of timeouts or bugs.

    3 votes

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)

    We’ll send you updates on this idea

    How important is this to you?

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)
  13. As developer ina. company with a large amount of data stored in Iceberg,
    I want to have the ClickHouse Iceberg Table engine,
    so that I can effectively analyse the large amounts of data stored in iceberg, via Aiven for Clickhouse.

    3 votes

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)

    We’ll send you updates on this idea

    How important is this to you?

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)
  14. As a developer with a current PostgreSQL instance,
    I would like to utilise the ClickHouse PostgreSQL Table engine,
    so that I can easily read from that external PostgreSQL instance and insert the data into my Aiven for ClickHouse instance.

    3 votes

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)

    We’ll send you updates on this idea

    How important is this to you?

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)
  15. As a technologist within an organisation delivering a data platform via integration of multiple open source services,
    I want to have the ability to use wildcards in the kafkatopiclist for Aiven for ClickHouse,
    so that I can reduce the management overhead and reduce the time to value for new development.

    3 votes

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)

    We’ll send you updates on this idea

    How important is this to you?

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)
  16. As someone running analytics with Clickhouse,
    I want to be able to create functions that read data from an external MongoDB instance, as well as create tables that span data in external MongoDB instances,
    so that I can enrich my analytics with data stored elsewhere, or easily migrate data from a MongoDB instance to my Aiven for Clickhouse instance.

    3 votes

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)

    We’ll send you updates on this idea

    How important is this to you?

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)
  17. As an Aiven for ClickHouse user,
    I want to be able to choose the backup to fork and restore from, and not only use the last backup available.

    2 votes

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)

    We’ll send you updates on this idea

    How important is this to you?

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)
  18. As an Aiven for ClickHouse users,
    I want to decide what time (hours and minute) the Aiven backup service will run and save a backup of my data in order to schedule my workloads accordingly (ex. data ingestion, power off)

    2 votes

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)

    We’ll send you updates on this idea

    How important is this to you?

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)
  19. As a user of both PostgreSQL (on-prem, on another cloud provider or at Aiven) and Aiven for ClickHouse,
    I want to be able to ingest my PostgreSQL data, not as a on time snapshot or a remote view of the data like the current integrations allow me to, but as tables created and stored in ClickHouse that pull updates from PostgreSQL regularly.

    2 votes

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)

    We’ll send you updates on this idea

    How important is this to you?

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)
  20. As a user of Aiven for ClickHouse,
    I want to use the most optimized default settings possible. From the start I want to have the default JOIN set as the most performant possible and not the ClickHouse default one. This comes without me, the user, having to change any settings. However, I should be able to tweak those settings if needed in specific cases.
    See https://clickhouse.com/docs/en/engines/table-engines/special/join

    2 votes

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)

    We’ll send you updates on this idea

    How important is this to you?

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)
← Previous 1
  • Don't see your idea?