Posting this in case anyone looks for it (including myself once I forget
for a "stored" DuckDB database:
Use connection string like:
Driver=DuckDB Driver;Database=C:\Dev\Testing\duckdb\myduck.db
This property must be set:
enhanced_column_info=false
Btw the docs mention that the DuckDB native driver is modeled after SQLite's driver. I wonder how hard it would be to add a native DuckDB driver to Zeoslib. Note that I don't know how popular DuckDB is, so that should be taken into account.
DuckDB ODBC connection
-
- Platinum Boarder
- Posts: 1984
- Joined: 17.01.2011, 14:17
Re: DuckDB ODBC connection
Hello Mark,
I have to admit, that I don't like SQLite very much because it isn't type safe. After a quick look at the data types of DuckDB, I saw that the varchar type isn't limited in size -> to be fully compatible, we would have to either map all varchar fields to memo columns or introduce a limit of our own.
I tried to play with the varchar type and checked the metadata information. The documentation lists 13 columns. When querieng the columns view in the information_schema, the duckdb tells me it has 45 columns...
So... writing a preliminary driver shouldn't be too hard but still requires time. The lack in type safety and the lack in documentation will require (hard) decisions that probably will limit the usability of the driver. Getting this under control will require more time later on.
If I have to do a suggestion for an embedded database: Use Firebird. It is much better documented and supported.
Egonhugeist once told me that he thinks writing a new driver probably would take about 4 weeks for him. The hardest part usually is doing metadata information queries for system tables and the like.
I have to admit, that I don't like SQLite very much because it isn't type safe. After a quick look at the data types of DuckDB, I saw that the varchar type isn't limited in size -> to be fully compatible, we would have to either map all varchar fields to memo columns or introduce a limit of our own.
I tried to play with the varchar type and checked the metadata information. The documentation lists 13 columns. When querieng the columns view in the information_schema, the duckdb tells me it has 45 columns...
So... writing a preliminary driver shouldn't be too hard but still requires time. The lack in type safety and the lack in documentation will require (hard) decisions that probably will limit the usability of the driver. Getting this under control will require more time later on.
If I have to do a suggestion for an embedded database: Use Firebird. It is much better documented and supported.
-
- Platinum Boarder
- Posts: 1984
- Joined: 17.01.2011, 14:17
Re: DuckDB ODBC connection
Doing a simplistic driver seems to be not too hard. I started on a header translation...
Re: DuckDB ODBC connection
Thanks for that info! I totally agree on Firebird. I use it for all my internal projects. The DuckDB thing is just because it was being used at a customer site. I've heard some buzz on it, but I don't know how popular it is at all. I'm happy to help with testing or developing btw.
-Mark
-Mark
-
- Platinum Boarder
- Posts: 1984
- Joined: 17.01.2011, 14:17
Re: DuckDB ODBC connection
Hello Mark,
the buzz seemes to be about DuckDB being an easy to use and implement column store, suited for use in OLAP scenarios where data analysis is the important part and not transaction processing.
I uploaded a first version of a DuckDB plain driver. Unfortunately it currently depends on the FPC ctypes unit. I am not sure, if we have a similar unit in Zeos. I didn't do a full translation of the header but functions for connecting, querying and using results are there.
The next steps are:
I am not sure where on this you want to step in? Or do you have other ideas on what to do? I also intend to base the development of the statement object on the TZAbstractBeginnerPreparedStatement. Although Egonhugeist dislikes it, it should make development faster but the result will be slower than other drivers.
You see - my approach is to get to results as fast as possible, but by sacrificing speed / performance for that.
Which is the next question: What kind of result sets do you expect to use? Just some rows or thousands of rows that need to be processes as fast as possible?
Please let me know what you think.
Best regards,
Jan
the buzz seemes to be about DuckDB being an easy to use and implement column store, suited for use in OLAP scenarios where data analysis is the important part and not transaction processing.
I uploaded a first version of a DuckDB plain driver. Unfortunately it currently depends on the FPC ctypes unit. I am not sure, if we have a similar unit in Zeos. I didn't do a full translation of the header but functions for connecting, querying and using results are there.
The next steps are:
- make the plain driver compile with Delphi too. -> This could be postponed if you can develop with FPC/Lazarus too
- create a dbc layer driver
- create a statement and implement parameter binding
- implement a result set.
- get the test suite running on DuckDB
- implement the metadata objects - maybe.... I really dislike this part, but think it is an essential capability of Zeos.
I am not sure where on this you want to step in? Or do you have other ideas on what to do? I also intend to base the development of the statement object on the TZAbstractBeginnerPreparedStatement. Although Egonhugeist dislikes it, it should make development faster but the result will be slower than other drivers.
You see - my approach is to get to results as fast as possible, but by sacrificing speed / performance for that.
Which is the next question: What kind of result sets do you expect to use? Just some rows or thousands of rows that need to be processes as fast as possible?
Please let me know what you think.
Best regards,
Jan
Re: DuckDB ODBC connection
To be completely honest this is the first time I hear about it :)
After RabbitMQ, DuckDuckGo and so on it seems having an animal in product names is trendy lately.
Edit: I'd be careful with a native driver approach, because
If it means what I think it is, I can see this opening a huge can of worms in the near future.DuckDB offers a flexible extension mechanism that allows defining new data types
Delphi 12.2, Zeos 8 from latest GIT snapshot
Using:
- MySQL server 8.0.18; libmysql.dll 8.0.40 x64 5.7.19 x68, libmariadb.dll 3.3.11
- Oracle server 11.2.0, 12.1.0, 19.0.0; oci.dll 21.15
- MSSQL 2012, 2019; sybdb.dll FreeTDS_3102
- SQLite 3.47
Using:
- MySQL server 8.0.18; libmysql.dll 8.0.40 x64 5.7.19 x68, libmariadb.dll 3.3.11
- Oracle server 11.2.0, 12.1.0, 19.0.0; oci.dll 21.15
- MSSQL 2012, 2019; sybdb.dll FreeTDS_3102
- SQLite 3.47
-
- Platinum Boarder
- Posts: 1984
- Joined: 17.01.2011, 14:17
Re: DuckDB ODBC connection
Sooo - I added a first implementation for a DBC layer connection. Currently it is possible, to connect using a TZConnection object and to use TZConnection.ExecuteDirect. The driver only uses UTF8 because I had the impression that DuckDB is UTF8 only - I couldn't find any documentation about that.
Creating statements, result sets or callable statements is not supported yet and so no usage of TZQuery or TZReadOnlyQuery or any other component is possible.
Also transaction support or changing the default schema is not supported and has to be implemented.
Currently if one wishes to test DuckDB, the ZPlainDuckDB and ZDbcDuckDB units will have to be added manually to the project since the packages don't include them yet.
And their varchar type cannot be restricted in length. So all their varchar fields will be mapped to TMemo for now.
Creating statements, result sets or callable statements is not supported yet and so no usage of TZQuery or TZReadOnlyQuery or any other component is possible.
Also transaction support or changing the default schema is not supported and has to be implemented.
Currently if one wishes to test DuckDB, the ZPlainDuckDB and ZDbcDuckDB units will have to be added manually to the project since the packages don't include them yet.
Yes - there are some pitfalls there. But we have the same situation for PostgreSQL. I simply don't plan to support any types that we cannot handle. They have a hugeint type which effectively is Int128 -> I don't know how to support this. It could be converted to a double though.
And their varchar type cannot be restricted in length. So all their varchar fields will be mapped to TMemo for now.
-
- Platinum Boarder
- Posts: 1984
- Joined: 17.01.2011, 14:17
Re: DuckDB ODBC connection
Today I added a first go at a statement. Since we don't have a result set, it's use is very limited. The DuckDB connection doesn't return this statement yet but still generates an error.
Also I added a Wiki page for the driver, so there will be a list of todo's and shortcomings as well as ideas for resolving these shortcomings: https://sourceforge.net/p/zeoslib/wiki/DuckDB/
Note: Development speed will slow down considerably from now on. Until today we had hoildays and I was able to spend a lot of time on this. Starting tomorrow I will have to work again
Also I added a Wiki page for the driver, so there will be a list of todo's and shortcomings as well as ideas for resolving these shortcomings: https://sourceforge.net/p/zeoslib/wiki/DuckDB/
Note: Development speed will slow down considerably from now on. Until today we had hoildays and I was able to spend a lot of time on this. Starting tomorrow I will have to work again