Data Analytics: New (91 ideas)

I suggest you ...

What's on your mind?

(thinking…)

Enter your idea and we'll search to see if someone has already suggested it.

If a similar idea already exists, you can support and comment on it.

If it doesn't exist, you can post your idea so others can support it.

Enter your idea and we'll search to see if someone has already suggested it.

Manage ClickHouse integrations' credentials securely & easily

As a ClickHouse user, I want to be able to use table functions & engines that require identification credentials (ex. private remote s3 bucket, remote delta lake table etc.) without hassle and risks.
With named collections in Aiven for ClickHouse, you can set your integrations credentials once and use it safely with all your remote queries.
Moreover, you can easily rotate credentials if needed: change credentials once using the Aiven console and apply it to all your integrations.

1 vote

We're glad you're here
Please sign in to leave feedback

Signed in as (Sign out)

Close

Close

We’ll send you updates on this idea

Roadmapped · 0 comments · Aiven for ClickHouse® · Edit… · Delete… · Admin →

How important is this to you?

We're glad you're here
Please sign in to leave feedback

Signed in as (Sign out)

Close

Close
External schema registry for kafka

As developer
I want to have a possibility to connect to external kafka and use external schema registry at once,
so that I can digest external messages encoded with external schema (AvroConfluent for example).

1 vote

We're glad you're here
Please sign in to leave feedback

Signed in as (Sign out)

Close

Close

We’ll send you updates on this idea

Gathering Interest · 0 comments · Aiven for ClickHouse® · Delete… · Admin →

How important is this to you?

We're glad you're here
Please sign in to leave feedback

Signed in as (Sign out)

Close

Close
Restore from external snapshot

As a database administrator
I want to restore from external snapshot that isnt hosted on Aiven
so that I can migrate data from certain OpenSearch and Elasticsearch cluster (Aiven and Non-Aiven) to Aiven for OpenSearch

3 votes

We're glad you're here
Please sign in to leave feedback

Signed in as (Sign out)

Close

Close

We’ll send you updates on this idea

Roadmapped · 0 comments · Aiven for OpenSearch® · Delete… · Admin →

How important is this to you?

We're glad you're here
Please sign in to leave feedback

Signed in as (Sign out)

Close

Close
Support AzureBlobStorage engine for Clickhouse

I need to read and write Parquet format files to/from Azure Blob Storage.

3 votes

We're glad you're here
Please sign in to leave feedback

Signed in as (Sign out)

Close

Close

We’ll send you updates on this idea

Gathering Interest · 0 comments · Aiven for ClickHouse® · Delete… · Admin →

How important is this to you?

We're glad you're here
Please sign in to leave feedback

Signed in as (Sign out)

Close

Close
make shard_indexing_pressure configurable

Dear Aiven Community,

As a DevOps Engineer,
I want to be able to configure the shardindexingpressure settings directly within the Aiven platform and through the Aiven Terraform provider,
so that I can better manage indexing loads on Opensearch clusters, optimize performance during high data throughput, and prevent potential bottlenecks.

In addition, this configuration capability is essential for dynamically adjusting indexing pressure based on real-time data demands. Currently, trying to enable or modify shardindexingpressure settings results in a 403 error, indicating that the feature is not supported in Aiven's current Opensearch offerings. Enabling this feature would allow users to set parameters like softlimit, minlimit, and maxoutstandingrequests to improve operational efficiency and resource utilization.

This idea is important to me because it supports critical DevOps principles such as infrastructure as code and automation, which are essential for scalable, efficient deployment and management of large-scale search environments. By allowing users to configure these settings, Aiven can enhance its service offerings, making it a more attractive option for enterprises requiring robust, adaptable search infrastructure.

The benefits of enabling shardindexingpressure in Aiven Opensearch include:

Improved Performance Management: Users gain the ability to finely tune their cluster's response to varying indexing loads, enhancing overall system responsiveness and stability.
Greater Flexibility and Control: Configurability via both the Aiven console and Terraform promotes a more tailored approach to resource management, which is especially beneficial in environments with fluctuating data loads.
Enhanced Resource Efficiency: Proper management of shard indexing pressure prevents overutilization of shard resources, facilitating better distribution and utilization of resources across the cluster.
I urge the community to support this feature request, as it will significantly enhance the utility and manageability of Aiven's Opensearch service for all users facing similar challenges.

Thank you for considering this enhancement.

Best regards,

Dear Aiven Community,

As a DevOps Engineer,
I want to be able to configure the shardindexingpressure settings directly within the Aiven platform and through the Aiven Terraform provider,
so that I can better manage indexing loads on Opensearch clusters, optimize performance during high data throughput, and prevent potential bottlenecks.

In addition, this configuration capability is essential for dynamically adjusting indexing pressure based on real-time data demands. Currently, trying to enable or modify shardindexingpressure settings results in a 403 error, indicating that the feature is not supported in Aiven's current Opensearch offerings. Enabling this feature would allow users to set parameters…
- shard_indexing_pressure.txt 1 KB
9 votes

We're glad you're here
Please sign in to leave feedback

Signed in as (Sign out)

Close

Close

We’ll send you updates on this idea

Roadmapped · 0 comments · Aiven for OpenSearch® · Delete… · Admin →

How important is this to you?

We're glad you're here
Please sign in to leave feedback

Signed in as (Sign out)

Close

Close
Opensearch Large Shard Warning

As an operator of Opensearch,
I want to be alerted when my shards are outside of recommend best practices of 10-50GB / shard,
so that I can avoid having overly large shard size cause performance problems for ingestion and query.

In addition, please tell me how within the alert to split my shards if they do get too large.

1 vote

We're glad you're here
Please sign in to leave feedback

Signed in as (Sign out)

Close

Close

We’ll send you updates on this idea

0 comments · Aiven for OpenSearch® · Delete… · Admin →

How important is this to you?

We're glad you're here
Please sign in to leave feedback

Signed in as (Sign out)

Close

Close

Roadmapped · Hoang Minh Vo responded

Thanks Jason. We can roadmap this. WIll update the idea when we have more concrete timeline
Make index snapshots possible for faster data restoration

As a SRE or operations engineer
I want to be able to create index snapshots manually or with a policy
so that I can easily and quickly restore partial data after an unintentional index corruption.
In addition, I understand this would require a shared storage between cluster nodes which means additional storage so it would make sense to make it a paid feature and/or include it in the tiered storage project.

5 votes

We're glad you're here
Please sign in to leave feedback

Signed in as (Sign out)

Close

Close

We’ll send you updates on this idea

1 comment · Aiven for OpenSearch® · Delete… · Admin →

How important is this to you?

We're glad you're here
Please sign in to leave feedback

Signed in as (Sign out)

Close

Close

Gathering Interest · Hoang Minh Vo responded

Thanks for posting the idea, the idea is reasonable. I will update the status once we know more about plan with this idea
Support custom plugins in OpenSearch

As a developer
I want to use a custom plugin in OpenSearch
so that I can implement custom scoring or text analysis or use a community open source plugin.

3 votes

We're glad you're here
Please sign in to leave feedback

Signed in as (Sign out)

Close

Close

We’ll send you updates on this idea

0 comments · Aiven for OpenSearch® · Delete… · Admin →

How important is this to you?

We're glad you're here
Please sign in to leave feedback

Signed in as (Sign out)

Close

Close

Shelved · Hoang Minh Vo responded

We need to validate each plugins installed in Aiven for OpenSearch to ensure that the plugin wont cause any harm to our services.
Enable all cross cluster replication APIs

As an OpenSearch operator
I want to be able to pause, stop, and start cross cluster replication
so that I can use this feature to support failover in a disaster recovery scenario.

Typically in Elasticsearch or Opensearch, CCR can be used to support a DR deployment by placing two clusters (a leader and follower) in separate regions. When the leader cluster becomes unavailable, applications/clients can failover to the follower cluster by stopping replication on the following cluster which makes it a regular index.

https://opensearch.org/docs/latest/tuning-your-cluster/replication-plugin/api/

1 vote

We're glad you're here
Please sign in to leave feedback

Signed in as (Sign out)

Close

Close

We’ll send you updates on this idea

0 comments · Aiven for OpenSearch® · Delete… · Admin →

How important is this to you?

We're glad you're here
Please sign in to leave feedback

Signed in as (Sign out)

Close

Close

Gathering Interest · Hoang Minh Vo responded

Currently we are controlling this process of start/pause etc so we can control replication process during the entire cluster's lifecycle (through all the node recycling, upgrading etc.) to ensure the stability of our services.

I put this to Gather interest, I am also aware of the failover capability mentioned in the ideas as well (we have different ideas ticket for that), this is something we can have a look and see if we need to expose all APIs if the main usecase is failover
OpenSearch compute-optimized and storage-optimized clusters

As an architect
I want to create right-sized clusters for my use case
so that I can get the most value.

Currently, all the OpenSearch clusters have a 1:4 CPU:RAM ratio. High throughput application search use cases often have small data sets and can benefit from more relative CPU than RAM or disk (e.g. 1:2 CPU:RAM ratio). Logging use cases with large volumes of data may benefit from storage optimized instances with 1:8 CPU:RAM ratio with more disk.

1 vote

We're glad you're here
Please sign in to leave feedback

Signed in as (Sign out)

Close

Close

We’ll send you updates on this idea

Gathering Interest · 0 comments · Aiven for OpenSearch® · Delete… · Admin →

How important is this to you?

We're glad you're here
Please sign in to leave feedback

Signed in as (Sign out)

Close

Close
Opensearch backpressure mechanisms

As an Opensearch administrator, I want to be able to limit the impact of heavy requests on my cluster.
So that when some client applications make these requests, I can mitigate impacts for other client applications.

An example would be to be able to use backpressure mechanism
https://opensearch.org/docs/latest/tuning-your-cluster/availability-and-recovery/search-backpressure/

13 votes

We're glad you're here
Please sign in to leave feedback

Signed in as (Sign out)

Close

Close

We’ll send you updates on this idea

Roadmapped · 0 comments · Aiven for OpenSearch® · Delete… · Admin →

How important is this to you?

We're glad you're here
Please sign in to leave feedback

Signed in as (Sign out)

Close

Close
Support two-zone OpenSearch clusters

As an OpenSearch architect
I want to provision the minimum amount of hardware I need to meet my requirements
so that I can optimize costs.

Currently, OpenSearch on Aiven only supports 3 AZ deployments for production-grade plans. OpenSearch clusters with data nodes deployed across 2 AZs can be considered production-grade as long as you have master nodes across 3 AZs.

1 vote

We're glad you're here
Please sign in to leave feedback

Signed in as (Sign out)

Close

Close

We’ll send you updates on this idea

Shelved · 0 comments · Aiven for OpenSearch® · Delete… · Admin →

How important is this to you?

We're glad you're here
Please sign in to leave feedback

Signed in as (Sign out)

Close

Close
Support for searchable snapshots

As an user of Opensearch

I want to be able to store large amounts of immutable logs for lengthy periods of time

so that I can support compliance and other regulatory requirements placed on me

In addition, I need this to be provided in a cost efficient manner, leveraging technologies such as object storage. Given queries against this data are infrequent price far outweighs performance.

2 votes

We're glad you're here
Please sign in to leave feedback

Signed in as (Sign out)

Close

Close

We’ll send you updates on this idea

Roadmapped · 0 comments · Aiven for OpenSearch® · Delete… · Admin →

How important is this to you?

We're glad you're here
Please sign in to leave feedback

Signed in as (Sign out)

Close

Close
Opensearch ML Plugin cluster settings in UI

As Developer
I want to be able to leverage the new ML capabilities in OpenSearch (https://opensearch.org/docs/latest/ml-commons-plugin/)
so that I can use new features like Semantic search, leveraging external models etc.
There are various cluster settings that needs to be exposed to the end-users to enable these features.

The most urgent is "onlyrunonmlnode" which is set to "true" in Aiven Clusters. This needs to be set to "false" to allow ML workloads to be executed on any node (until we have the capability to assign dedicated ML-nodes in the cluster)

There are more configs that needs to be exposed but "onlyrunonmlnode" is a direct blocker for any ML use-case.

As Developer
I want to be able to leverage the new ML capabilities in OpenSearch (https://opensearch.org/docs/latest/ml-commons-plugin/)
so that I can use new features like Semantic search, leveraging external models etc.
There are various cluster settings that needs to be exposed to the end-users to enable these features.

The most urgent is "onlyrunonmlnode" which is set to "true" in Aiven Clusters. This needs to be set to "false" to allow ML workloads to be executed on any node (until we have the capability to assign dedicated ML-nodes in the cluster)

There are more configs that…

7 votes

We're glad you're here
Please sign in to leave feedback

Signed in as (Sign out)

Close

Close

We’ll send you updates on this idea

Roadmapped · 1 comment · Aiven for OpenSearch® · Delete… · Admin →

How important is this to you?

We're glad you're here
Please sign in to leave feedback

Signed in as (Sign out)

Close

Close
Support for 3rd party ETL tools with Aiven for ClickHouse

As an Data Engineer
I want to use 3rd party ETL tools to populate Clickhouse tables
so that I can use clickhouse as my warehouse. Currently popular tools such as airbyte do not work when connected to Aiven clickhouse due to the fact that it creates internal state tables within clickhouse, and it fails given that tables are not allow to be created outside the context of the Aiven console. Customers are unable to use external tools that can self manage state and required the permission.

2024-01-26 22:37:45 normalization > Code: 497. DB::Exception: avnadmin: Not enough privileges. To execute this query, it's necessary to have the grant CREATE DATABASE ON airbytedefault.*. (ACCESS_DENIED) (version 23.8.8.1)

As an Data Engineer
I want to use 3rd party ETL tools to populate Clickhouse tables
so that I can use clickhouse as my warehouse. Currently popular tools such as airbyte do not work when connected to Aiven clickhouse due to the fact that it creates internal state tables within clickhouse, and it fails given that tables are not allow to be created outside the context of the Aiven console. Customers are unable to use external tools that can self manage state and required the permission.

2024-01-26 22:37:45 normalization > Code: 497. DB::Exception: avnadmin: Not enough privileges. To execute this…

3 votes

We're glad you're here
Please sign in to leave feedback

Signed in as (Sign out)

Close

Close

We’ll send you updates on this idea

Gathering Interest · 1 comment · Aiven for ClickHouse® · Delete… · Admin →

How important is this to you?

We're glad you're here
Please sign in to leave feedback

Signed in as (Sign out)

Close

Close
Set BASE_URL for Opensearch dashboards

As Opensearch Administrator
I want to be able to set custom Base_URLs for my OpenSearch clusters
so that I can simplify usage for my customers when they have multiple OS services per group/client/customer.

1 vote

We're glad you're here
Please sign in to leave feedback

Signed in as (Sign out)

Close

Close

We’ll send you updates on this idea

Gathering Interest · 0 comments · Aiven for OpenSearch® · Delete… · Admin →

How important is this to you?

We're glad you're here
Please sign in to leave feedback

Signed in as (Sign out)

Close

Close
PPL & Observability SIEM features in OpenSearch

As a Security Analyst,

I want to utilise the Observability plugin & PPL in an efficient manner. Features such as tooltips and autocomplete would help a lot, as well as bug fixes and regular updates.

The syntax is not well nor widely understood, and there are lingering bugs, which for a user are very hard to duplicate across the microcosm of repositories which bundle into the suite.

Observabilitiy & PPL feels like a very promising place for OpenSearch to become more useful to a security operations team, where currently the capabilities are extremely limiting.

While OpenSearch Dashboards, with tenancy and VisBuilder have been used, they are complicated, and the search capabilities limit the ability to work in an effective, visual and fast manner.

Visual aspects are too separate from Search, drill downs are minimal or with a massive time overhead; search is relegated largely to the very basics, for a little more capability they need to be written as an api style query.

Observability & PPL could be a great improvement if it grows, particularly with security team workflows in mind.

As a Security Analyst,

I want to utilise the Observability plugin & PPL in an efficient manner. Features such as tooltips and autocomplete would help a lot, as well as bug fixes and regular updates.

The syntax is not well nor widely understood, and there are lingering bugs, which for a user are very hard to duplicate across the microcosm of repositories which bundle into the suite.

Observabilitiy & PPL feels like a very promising place for OpenSearch to become more useful to a security operations team, where currently the capabilities are extremely limiting.

While OpenSearch Dashboards, with tenancy and…

2 votes

We're glad you're here
Please sign in to leave feedback

Signed in as (Sign out)

Close

Close

We’ll send you updates on this idea

Shelved · 1 comment · Aiven for OpenSearch® · Delete… · Admin →

How important is this to you?

We're glad you're here
Please sign in to leave feedback

Signed in as (Sign out)

Close

Close
Support JDBC for data ingestion to Aiven for ClickHouse

As a user of other services wanting to migrate my data to Aiven for ClickHouse,
I want to be able to use generic JDBC compatible tools to move my data out of my current service into Aiven for ClickHouse easily, and at an acceptable speed.
This can be done using the JDBC table function or table engine.

0 votes

We're glad you're here
Please sign in to leave feedback

Signed in as (Sign out)

Close

Close

We’ll send you updates on this idea

Gathering Interest · 0 comments · Aiven for ClickHouse® · Edit… · Delete… · Admin →

How important is this to you?

We're glad you're here
Please sign in to leave feedback

Signed in as (Sign out)

Close

Close
Make OpenSearch Dashboards session timeout configurable

As a developer
I would like to have the following configuration options exposed:

opensearch_security.cookie.ttl
opensearch_security.session.ttl
opensearch_security.session.keepalive

so that I can lengthen the dashboard session timeout for my users.

3 votes

We're glad you're here
Please sign in to leave feedback

Signed in as (Sign out)

Close

Close

We’ll send you updates on this idea

Gathering Interest · 0 comments · Aiven for OpenSearch® · Delete… · Admin →

How important is this to you?

We're glad you're here
Please sign in to leave feedback

Signed in as (Sign out)

Close

Close
Connecting Clickhouse to a PostgreSQL read replica

At this moment there are only the primary Postgres instances available for a clickhouse connect. Wouldn't it make sense to have the capability connection Clickhouse to a read replica to prevent performance impacts on the primary?

1 vote

We're glad you're here
Please sign in to leave feedback

Signed in as (Sign out)

Close

Close

We’ll send you updates on this idea

Gathering Interest · 0 comments · Aiven for ClickHouse® · Delete… · Admin →

How important is this to you?

We're glad you're here
Please sign in to leave feedback

Signed in as (Sign out)

Close

Close

← Previous 1 2 3 4 5 Next →

Don't see your idea?

Data Analytics

Categories

I suggest you ...

We're glad you're here

We're glad you're here

Your importance score has been recorded.

We're glad you're here

We're glad you're here

Your importance score has been recorded.

We're glad you're here

We're glad you're here

Your importance score has been recorded.

We're glad you're here

We're glad you're here

Your importance score has been recorded.

We're glad you're here

We're glad you're here

Your importance score has been recorded.

We're glad you're here

We're glad you're here

Your importance score has been recorded.

We're glad you're here

We're glad you're here

Your importance score has been recorded.

We're glad you're here

We're glad you're here

Your importance score has been recorded.

We're glad you're here

We're glad you're here

Your importance score has been recorded.

We're glad you're here

We're glad you're here

Your importance score has been recorded.

We're glad you're here

We're glad you're here

Your importance score has been recorded.

We're glad you're here

We're glad you're here

Your importance score has been recorded.

We're glad you're here

We're glad you're here

Your importance score has been recorded.

We're glad you're here

We're glad you're here

Your importance score has been recorded.

We're glad you're here

We're glad you're here

Your importance score has been recorded.

We're glad you're here

We're glad you're here

Your importance score has been recorded.

We're glad you're here

We're glad you're here

Your importance score has been recorded.

We're glad you're here

We're glad you're here

Your importance score has been recorded.

We're glad you're here

We're glad you're here

Your importance score has been recorded.

We're glad you're here

We're glad you're here

Your importance score has been recorded.

We're glad you're here