Skip to content

Data Analytics

Join our forum to discuss your ideas with Aiven community or check out our public roadmap.

Data Analytics

Categories

JUMP TO ANOTHER FORUM

93 results found

  1. As an Aiven user and Aiven for ClickHouse user,
    I want to be able to use Aiven for ClickHouse on OVH Cloud with all the Aiven capabilities available on the other cloud providers, seamlessly.

    1 vote

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)

    We’ll send you updates on this idea

    How important is this to you?

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)
  2. As an Aiven user and Aiven for ClickHouse user,
    I want to be able to use Aiven for ClickHouse on Oracle Cloud with all the Aiven capabilities available on the other cloud providers, seamlessly.

    4 votes

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)

    We’ll send you updates on this idea

    How important is this to you?

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)
  3. As a ClickHouse user, I want to be able to use table functions & engines that require identification credentials (ex. private remote s3 bucket, remote delta lake table etc.) without hassle and risks.
    With named collections in Aiven for ClickHouse, you can set your integrations credentials once and use it safely with all your remote queries.
    Moreover, you can easily rotate credentials if needed: change credentials once using the Aiven console and apply it to all your integrations.

    1 vote

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)

    We’ll send you updates on this idea

    How important is this to you?

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)
  4. As developer
    I want to have a possibility to connect to external kafka and use external schema registry at once,
    so that I can digest external messages encoded with external schema (AvroConfluent for example).

    1 vote

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)

    We’ll send you updates on this idea

    How important is this to you?

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)
  5. As a database administrator
    I want to restore from external snapshot that isnt hosted on Aiven
    so that I can migrate data from certain OpenSearch and Elasticsearch cluster (Aiven and Non-Aiven) to Aiven for OpenSearch

    3 votes

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)

    We’ll send you updates on this idea

    How important is this to you?

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)
  6. I need to read and write Parquet format files to/from Azure Blob Storage.

    3 votes

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)

    We’ll send you updates on this idea

    How important is this to you?

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)
  7. Dear Aiven Community,

    As a DevOps Engineer,
    I want to be able to configure the shardindexingpressure settings directly within the Aiven platform and through the Aiven Terraform provider,
    so that I can better manage indexing loads on Opensearch clusters, optimize performance during high data throughput, and prevent potential bottlenecks.

    In addition, this configuration capability is essential for dynamically adjusting indexing pressure based on real-time data demands. Currently, trying to enable or modify shardindexingpressure settings results in a 403 error, indicating that the feature is not supported in Aiven's current Opensearch offerings. Enabling this feature would allow users to set parameters…

    8 votes

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)

    We’ll send you updates on this idea

    How important is this to you?

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)
  8. As an operator of Opensearch,
    I want to be alerted when my shards are outside of recommend best practices of 10-50GB / shard,
    so that I can avoid having overly large shard size cause performance problems for ingestion and query.

    In addition, please tell me how within the alert to split my shards if they do get too large.

    1 vote

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)

    We’ll send you updates on this idea

    How important is this to you?

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)
    Roadmapped  ·  Hoang Minh Vo responded

    Thanks Jason. We can roadmap this. WIll update the idea when we have more concrete timeline

  9. As a SRE or operations engineer
    I want to be able to create index snapshots manually or with a policy
    so that I can easily and quickly restore partial data after an unintentional index corruption.
    In addition, I understand this would require a shared storage between cluster nodes which means additional storage so it would make sense to make it a paid feature and/or include it in the tiered storage project.

    5 votes

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)

    We’ll send you updates on this idea

    How important is this to you?

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)

    Thanks for posting the idea, the idea is reasonable. I will update the status once we know more about plan with this idea

  10. As a developer
    I want to use a custom plugin in OpenSearch
    so that I can implement custom scoring or text analysis or use a community open source plugin.

    2 votes

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)

    We’ll send you updates on this idea

    How important is this to you?

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)
    Shelved  ·  Hoang Minh Vo responded

    We need to validate each plugins installed in Aiven for OpenSearch to ensure that the plugin wont cause any harm to our services.

  11. As an OpenSearch operator
    I want to be able to pause, stop, and start cross cluster replication
    so that I can use this feature to support failover in a disaster recovery scenario.

    Typically in Elasticsearch or Opensearch, CCR can be used to support a DR deployment by placing two clusters (a leader and follower) in separate regions. When the leader cluster becomes unavailable, applications/clients can failover to the follower cluster by stopping replication on the following cluster which makes it a regular index.

    https://opensearch.org/docs/latest/tuning-your-cluster/replication-plugin/api/

    1 vote

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)

    We’ll send you updates on this idea

    How important is this to you?

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)

    Currently we are controlling this process of start/pause etc so we can control replication process during the entire cluster's lifecycle (through all the node recycling, upgrading etc.) to ensure the stability of our services.


    I put this to Gather interest, I am also aware of the failover capability mentioned in the ideas as well (we have different ideas ticket for that), this is something we can have a look and see if we need to expose all APIs if the main usecase is failover 

  12. As an architect
    I want to create right-sized clusters for my use case
    so that I can get the most value.

    Currently, all the OpenSearch clusters have a 1:4 CPU:RAM ratio. High throughput application search use cases often have small data sets and can benefit from more relative CPU than RAM or disk (e.g. 1:2 CPU:RAM ratio). Logging use cases with large volumes of data may benefit from storage optimized instances with 1:8 CPU:RAM ratio with more disk.

    1 vote

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)

    We’ll send you updates on this idea

    How important is this to you?

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)
  13. As an Opensearch administrator, I want to be able to limit the impact of heavy requests on my cluster.
    So that when some client applications make these requests, I can mitigate impacts for other client applications.

    An example would be to be able to use backpressure mechanism
    https://opensearch.org/docs/latest/tuning-your-cluster/availability-and-recovery/search-backpressure/

    13 votes

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)

    We’ll send you updates on this idea

    How important is this to you?

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)
  14. As an OpenSearch architect
    I want to provision the minimum amount of hardware I need to meet my requirements
    so that I can optimize costs.

    Currently, OpenSearch on Aiven only supports 3 AZ deployments for production-grade plans. OpenSearch clusters with data nodes deployed across 2 AZs can be considered production-grade as long as you have master nodes across 3 AZs.

    1 vote

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)

    We’ll send you updates on this idea

    How important is this to you?

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)
  15. As an user of Opensearch

    I want to be able to store large amounts of immutable logs for lengthy periods of time

    so that I can support compliance and other regulatory requirements placed on me

    In addition, I need this to be provided in a cost efficient manner, leveraging technologies such as object storage. Given queries against this data are infrequent price far outweighs performance.

    2 votes

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)

    We’ll send you updates on this idea

    How important is this to you?

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)
  16. As Developer
    I want to be able to leverage the new ML capabilities in OpenSearch (https://opensearch.org/docs/latest/ml-commons-plugin/)
    so that I can use new features like Semantic search, leveraging external models etc.
    There are various cluster settings that needs to be exposed to the end-users to enable these features.

    The most urgent is "onlyrunonmlnode" which is set to "true" in Aiven Clusters. This needs to be set to "false" to allow ML workloads to be executed on any node (until we have the capability to assign dedicated ML-nodes in the cluster)

    There are more configs that…

    7 votes

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)

    We’ll send you updates on this idea

    How important is this to you?

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)
  17. As an Data Engineer
    I want to use 3rd party ETL tools to populate Clickhouse tables
    so that I can use clickhouse as my warehouse. Currently popular tools such as airbyte do not work when connected to Aiven clickhouse due to the fact that it creates internal state tables within clickhouse, and it fails given that tables are not allow to be created outside the context of the Aiven console. Customers are unable to use external tools that can self manage state and required the permission.

    2024-01-26 22:37:45 normalization > Code: 497. DB::Exception: avnadmin: Not enough privileges. To execute this…

    3 votes

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)

    We’ll send you updates on this idea

    How important is this to you?

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)
  18. As Opensearch Administrator
    I want to be able to set custom Base_URLs for my OpenSearch clusters
    so that I can simplify usage for my customers when they have multiple OS services per group/client/customer.

    1 vote

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)

    We’ll send you updates on this idea

    How important is this to you?

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)
  19. As a Security Analyst,

    I want to utilise the Observability plugin & PPL in an efficient manner. Features such as tooltips and autocomplete would help a lot, as well as bug fixes and regular updates.

    The syntax is not well nor widely understood, and there are lingering bugs, which for a user are very hard to duplicate across the microcosm of repositories which bundle into the suite.

    Observabilitiy & PPL feels like a very promising place for OpenSearch to become more useful to a security operations team, where currently the capabilities are extremely limiting.

    While OpenSearch Dashboards, with tenancy and…

    2 votes

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)

    We’ll send you updates on this idea

    How important is this to you?

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)
  20. As a user of other services wanting to migrate my data to Aiven for ClickHouse,
    I want to be able to use generic JDBC compatible tools to move my data out of my current service into Aiven for ClickHouse easily, and at an acceptable speed.
    This can be done using the JDBC table function or table engine.

    0 votes

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)

    We’ll send you updates on this idea

    How important is this to you?

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)
  • Don't see your idea?