Skip to content

Releases: chdb-io/chdb

v3.6.0

08 Sep 07:39
504a439
Compare
Choose a tag to compare

What's Changed

New Contributors

Full Changelog: v3.5.0...v3.6.0

v3.5.0

28 Jul 04:39
2d4f024
Compare
Choose a tag to compare

What's Changed

# Example of Using PyArrow RecordBatchReader with `deltalake.write_deltalake`
import pyarrow as pa
from deltalake import write_deltalake
from chdb import session as chs

sess = chs.Session()
# Get streaming result in arrow format
stream_result = sess.send_query("SELECT * FROM numbers(100000)", "Arrow")

# Create RecordBatchReader with custom batch size (default rows_per_batch=1000000)
batch_reader = stream_result.record_batch(rows_per_batch=10000)

# Use RecordBatchReader with external libraries like Delta Lake
write_deltalake(
    table_or_uri="./my_delta_table",
    data=batch_reader,
    mode="overwrite"
)

stream_result.close()

sess.close()

Full Changelog: v3.4.1...v3.5.0

v3.4.1

09 Jul 07:56
a89a503
Compare
Choose a tag to compare

What's Changed

Full Changelog: v3.4.0...v3.4.1

v3.4.0

26 Jun 04:18
68644d9
Compare
Choose a tag to compare

What's Changed

Full Changelog: v3.3.0...v3.4.0

v3.3.0

05 Jun 12:50
3f9e059
Compare
Choose a tag to compare

What's Changed

  • Fix signal loss by removing the signal handler in the ClickHouse engine by @wudidapaopao in #327
  • Add README documentation for JSON type support and new interface by @wudidapaopao in #335
  • Add storage_rows_read and storage_bytes_read interfaces for retrieving storage metrics by @wudidapaopao in #334
  • Implement JSON type support by @wudidapaopao in #330
import chdb
import pandas as pd

dict_data = {
        "a": [1, 2, 3, 4, 5, 6],
        "b": ["tom", "jerry", "auxten", "tom", "jerry", "auxten"],
        "dict_col": [
            {'id': 1, 'tags': ['urgent', 'important'],
                'metadata': {'created': '2024-01-01'}},
            {'id': 2, 'tags': ['normal'], 
                'metadata': {'created': '2024-02-01'}},
            {'id': 3, 'name': 'tom'},
            {'id': 4, 'value': '100'},
            {'id': 5, 'value': 101},
            {'id': 6, 'value': 102},
        ],
    }

df = pd.DataFrame(dict_data)

chdb.query("SELECT b, sum(a) FROM Python(df) GROUP BY b ORDER BY b").show()
# "auxten",9
# "jerry",7
# "tom",5

chdb.query(
    "SELECT dict_col.id FROM Python(dict_data) WHERE dict_col.value='100'"
).show()
# 4

Full Changelog: v3.2.0...v3.3.0

v3.2.0

24 Apr 02:02
6cdf9d2
Compare
Choose a tag to compare

What's Changed

  • Add connection based API example by @auxten in #319
  • Fix the potential memory leak issue caused by query result memory not being properly released by @wudidapaopao in #320
  • Add support for streaming query in chDB by @wudidapaopao in #326

New Contributors

Streaming API

carbon

Full Changelog: v3.1.2...v3.2.0

v3.1.2

28 Mar 05:24
218a3ed
Compare
Choose a tag to compare

What's Changed

  • Fix connection_wrapper destructor sometimes closing other connections by @meastham in #312
  • Disable AVX2 by default by @auxten in #314

New Contributors

Full Changelog: v3.1.1...v3.1.2

v3.1.1

10 Mar 11:28
ae670ba
Compare
Choose a tag to compare

What's Changed

  • Fix Null, UUID and df arrow table output format by @auxten in #308

Full Changelog: v3.1.0...v3.1.1

v3.1.0

03 Mar 04:29
85c6bda
Compare
Choose a tag to compare

What's Changed

Full Changelog: v3.0.1...v3.1.0

v3.0.1

24 Feb 12:40
1212d86
Compare
Choose a tag to compare

What's Changed

New Contributors

Full Changelog: v3.0.0...v3.0.1