Skip to content

need help to Store fluentd logs in azure datalake as parquet.gz format #19

@wasifshareef

Description

@wasifshareef

Hi,

I will appreciate if you can help me to conform if fluentd-plugin-azurestorage-gen2 supports fluentd logs shipped to azure datalake as perquet.gz format.

I tried with below configuration for store but it doesnot store files in perquet.gz . I believe perquet.gz format is supported with fluentd-s3 plugin

@type azurestorage_gen2
time_as_integer true
##################################################################
azure_oauth_tenant_id xxxxxxxx
azure_oauth_app_id xxxxxxxxx
azure_oauth_secret xxxxxxxxx
azure_oauth_identity_authority https://login.microsoftonline.com
azure_storage_account xxxxx
azure_container xxxxx
auto_create_container true

store_as gzip

     store_as                         parquet
     compress                         parquet
     parquet_compression_codec        gzip
     format                           single_value
     azure_object_key_format %{path}%{time_slice}/%{uuid_flush}.%{file_extension}
     path xxxxxx
    ##################################################################
    check_object false
    <format>
            @type json
    </format>
    <buffer>
            @type file
            path /var/log/td-agent/azure-bro
            timekey 2m
            timekey_wait 1m
            timekey_use_utc true
            chunk_limit_size 10m
            flush_thread_count 8
            flush_interval 10s
            queue_limit_length 32
            flush_mode interval
            retry_max_interval 30
            retry_forever true
            total_limit_size 15GB
    </buffer>
    time_slice_format %Y/%m/%d/%H

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions