Query on date doesn't return what expected

stephancb · November 14, 2024, 5:52am

The expression has only one placeholder ‘?’, but two parameters ‘startdate’ and ‘enddate’, of which only ‘startdate’ will be used. So it should return rows with the start date if there are any. Try

SQLWhereExpression(“date >= ?1 AND date <= ?2”, startdate, enddate),

andreh · November 16, 2024, 12:50am

you need this model (Todo) git

with the function search (line 32) which is called from DashController

and you can load this data (CSV) to the “Todo” model output2.csv

Thank you!

drizk1 · November 16, 2024, 3:55pm

ok, so I just threw it in TidierDB since I was having some issues setting up the above method, but I was able to correctly filter by dates with sql.

DBInterface.execute(db, "select user_id, category, date from read_csv(
    'https://raw.githubusercontent.com/amhg/parentModule/refs/heads/main/output2.csv',
    ignore_errors=true) where date >= '2024-10-03' AND date <= '2024-10-27'") |>DataFrame 
#returns 13 rows shown below

Another option (with the code and sql output are below via duckdb) is to use STRPTIME to explicitly convert… ymd converts the sql to STRPTIME('2024-10-03', '%Y-%m-%d') which might be the solution to your output issue. The STRPTIME conversion is not necessary in TidierDB either, so i am not sure why the issue is arising for you. The CSV does appear to be a bit buggy to read as i had to use ignore_errors=true

using TidierDB
db = connect(duckdb())
data = db_table(db, "read_csv(
    'https://raw.githubusercontent.com/amhg/parentModule/refs/heads/main/output2.csv',
    ignore_errors=true
)")  

@chain t(data) begin
    @select(user_id, category, date)
    @filter(date >= ymd("2024-10-03"))
    @arrange(date)
    @aside @show_query _
    @collect
end

WITH cte_1 AS (
SELECT *
        FROM read_csv(
    'https://raw.githubusercontent.com/amhg/parentModule/refs/heads/main/output2.csv',
    delim=',',
    header=true,
    ignore_errors=true
)
        WHERE date >= STRPTIME('2024-10-03', '%Y-%m-%d'))  
SELECT user_id, category, date
        FROM cte_1  
        ORDER BY date ASC
20×3 DataFrame
 Row │ user_id  category    date       
     │ Int64    String      Date       
─────┼─────────────────────────────────
   1 │       3  personal    2024-10-05
   2 │       3  hobby       2024-10-06
   3 │       4  work        2024-10-06
   4 │       4  shopping    2024-10-06
   5 │       3  learning    2024-10-09
   6 │       3  work        2024-10-10
   7 │       3  learning    2024-10-16
   8 │       3  work        2024-10-16
   9 │       3  other       2024-10-17
  10 │       4  accounting  2024-10-18
  11 │       4  personal    2024-10-21
  12 │       3  accounting  2024-10-21
  13 │       3  family      2024-10-22
  14 │       3  work        2024-10-28
  15 │       3  other       2024-10-30
  16 │       4  learning    2024-10-30
  17 │       3  hobby       2024-10-31
  18 │       3  errands     2024-10-31
  19 │       3  personal    2024-11-02
  20 │       3  shopping    2024-11-06

@chain t(data) begin
    @select(user_id, category, date)
    @filter(date >= ymd("2024-10-03") && date <= ymd("2024-10-27"))
    @arrange(date)
    @aside @show_query _
    @collect
end
13×3 DataFrame
 Row │ user_id  category    date       
     │ Int64    String      Date       
─────┼─────────────────────────────────
   1 │       3  personal    2024-10-05
   2 │       3  hobby       2024-10-06
   3 │       4  work        2024-10-06
   4 │       4  shopping    2024-10-06
   5 │       3  learning    2024-10-09
   6 │       3  work        2024-10-10
   7 │       3  learning    2024-10-16
   8 │       3  work        2024-10-16
   9 │       3  other       2024-10-17
  10 │       4  accounting  2024-10-18
  11 │       4  personal    2024-10-21
  12 │       3  accounting  2024-10-21
  13 │       3  family      2024-10-22

andreh · November 16, 2024, 8:49pm

Thank you! let me check it out
Could you share your whole code in git?
the project I am following it uses SearchLight do you also use it?

How do you connect to an existing DB with DuckDB?
I looked at the documentation https://duckdb.org/docs/api/julia.html

It shows only how to create a:

# create a new in-memory database
con = DBInterface.connect(DuckDB.DB, ":memory:")

drizk1 · November 16, 2024, 9:16pm

The entirety of my code i used is copied above, but if you still need me to I can throw on my github if thats what u mean? i dont use git.

here is how you connect to an existing one stored locally.

db = DBInterface.connect(DuckDB.DB, "/path/to/you/database.db")
db2 = DBInterface.connect(DuckDB.DB, "/path/to/you/database.sqlite")

I have never used SearchLight before, no. Just briefly looked at their docs.

andreh · November 16, 2024, 10:25pm

I see, ok !

may I ask you how do you specify the SQL Driver when executing the command:

DBInterface.execute(db, "select user_id, category, date from read_csv(
    'https://raw.githubusercontent.com/amhg/parentModule/refs/heads/main/output2.csv',
    ignore_errors=true) where date >= '2024-10-03' AND date <= '2024-10-27'") |>DataFrame

I have MySQL installed and I would like to try

Topic		Replies	Views
Unable to successfully read sqlserver Date datatype [ODBC, DataFrames] New to Julia data , database	1	493	October 20, 2021
Dataframe encoding New to Julia question	6	1289	February 17, 2019
JuliaDB, handling and transforming dates New to Julia	5	552	January 14, 2020
Can't load sqlite table to dataframe New to Julia question	2	360	June 27, 2023
How load date on SQLite General Usage package	1	286	September 15, 2021

Query on date doesn't return what expected

Related topics