Commit Graph

602 Commits

Author SHA1 Message Date
TrisNol ac6ca3547b test: Add unit test for news api wrapper 2023-11-11 14:30:00 +01:00
TrisNol ae41cf61bc checkpoint: Resolve error in handelsblatt text fetch 2023-11-11 14:30:00 +01:00
TrisNol a428eb4432 checkpoint: Init news extraction components and main app 2023-11-11 14:30:00 +01:00
Philipp 905021af14 Experimental caching (#285)
Added some caching decoraterors to speedup page delivery.
2023-11-11 14:28:25 +01:00
Philipp 066800123d Created pipeline to run ner sentiment and sql ingest (#314)
Created a dataprocessing pipline that enhances the raw mined data with
Organsiation extractions and sentiment analysis prio to moving the data
to the sql db.
The transfer of matched data is done afterword.

---------

Co-authored-by: SeZett <zeleny.sebastian@fh-swf.de>
2023-11-11 13:28:12 +00:00
Tristan Nolde a6d486209a Introduce extended_financial_data code (#357)
Introducing the previously developed method to fetch the financial data
via table parsing (aka "data lake like solution") in a non-destructive
manner by defaulting to the current RegEx-based behaviour.
2023-11-11 14:10:20 +01:00
Philipp e5b61bc19c Added multi relation dropdowns to dashbord (#363)
This change allows for a more complete combination of relation
combinations to be filtered.
2023-11-11 13:47:46 +01:00
Philipp ad8f5d0fb1 Added github actions automerge for pre-commit updates. (#362) 2023-11-11 13:30:16 +01:00
TrisNol b0bcdc6fe1 refactor: PR feedback implemented 2023-11-11 11:18:23 +01:00
Tristan Nolde 834f93a26e Update src/aki_prj23_transparenzregister/utils/data_extraction/bundesanzeiger.py
Co-authored-by: Philipp Horstenkamp <philipp@horstenkamp.de>
2023-11-11 11:03:36 +01:00
TrisNol e1b8397f9e feat: Introduce switch for different financial extraction routines 2023-11-11 11:03:36 +01:00
TrisNol 9edf5b1dce test: Increase coverage for multi-column headers 2023-11-11 11:03:36 +01:00
TrisNol 3ba8c0abea refactor: Remove debugging statements 2023-11-11 11:03:36 +01:00
TrisNol 3b1f0425cf deps: Adding html5lib for table parsing via Pandas 2023-11-11 11:03:36 +01:00
TrisNol 801f945c59 temp: Print exception for test debuggin 2023-11-11 11:01:17 +01:00
Tristan Nolde c19697c7f8 Update src/aki_prj23_transparenzregister/utils/data_extraction/bundesanzeiger.py
Co-authored-by: Philipp Horstenkamp <philipp@horstenkamp.de>
2023-11-11 11:01:17 +01:00
Tristan Nolde fe7690620a Update src/aki_prj23_transparenzregister/utils/data_extraction/bundesanzeiger.py
Co-authored-by: Philipp Horstenkamp <philipp@horstenkamp.de>
2023-11-11 11:01:17 +01:00
TrisNol fecf42d75a test: Unit test new KPI extraction 2023-11-11 11:01:17 +01:00
TrisNol f8a0d58314 feat(data-extraction): Provide KPI table analysis in bundesanzeiger wrapper 2023-11-11 11:01:17 +01:00
TrisNol 815e08a8f1 checkpoint: Transform values to € and normalize column names 2023-11-11 11:01:17 +01:00
TrisNol ec11ae13aa checkpoint: Parse table into dict of financial data 2023-11-11 11:01:17 +01:00
TrisNol 972fcd155e checkpoint: Normalize HTML tables fetched from Bundesanzeiger 2023-11-11 11:01:17 +01:00
Tristan Nolde 8781d746e7 hotfix: Add missing networkx dependency (#361)
Depyloment on Jupiter is currently broken due to missing `networkx`
dependency:

![image](https://github.com/fhswf/aki_prj23_transparenzregister/assets/56017812/8601fb23-e16b-4986-a52a-36a02c6bea29)

Should be fixed by the changes included
2023-11-10 22:52:18 +01:00
TrisNol c333ad70c5 hotfix: Add missing networkx dependency 2023-11-10 21:47:34 +01:00
Tim Ronneburg 247719c76f Feature/visualize verflechtungen (#324) 2023-11-10 19:33:19 +01:00
TrisNol da72c3d0a8 lint: Format company_elements.py 2023-11-10 19:21:32 +01:00
TrisNol a1d8e942a9 test: Adapt home.py to run unit tests 2023-11-10 19:20:49 +01:00
Tim fdbb6b5fd4 Added Graph to Company page again 2023-11-10 18:57:11 +01:00
Tim e5769b3c25 Added Tests
Co-authored-by: Tristan Nolde <TrisNol@users.noreply.github.com>
2023-11-10 18:56:51 +01:00
Tim 410b690873 Added test 2023-11-10 18:56:51 +01:00
Tim 41af7e2d18 Added test behaviour 2023-11-10 18:56:51 +01:00
Tim 4d2ca3b3e7 Refactored Session handling for Network analysis 2023-11-10 18:56:51 +01:00
Tim ac46348cc8 Added Dash DAQ 2023-11-10 18:55:13 +01:00
Tim c38460c740 fixed mypy errors 2023-11-10 18:54:30 +01:00
Tim f38728450d now ruff confirm 2023-11-10 18:53:47 +01:00
Tim f2ac0eda91 Added Realtion_count MEthod 2023-11-10 18:53:47 +01:00
Tim 76af89ff32 updated poetry lock 2023-11-10 18:53:32 +01:00
Tim 5b7f82a983 Bug fixes v2 2023-11-10 18:52:13 +01:00
Tim 152743597e Bug fixes 2023-11-10 18:52:13 +01:00
Tim 31d7098d48 Checkpoint commit 2023-11-10 18:52:13 +01:00
Tim c5721362ac Test Bugs 2023-11-10 18:52:01 +01:00
Tim 4fe97dfd86 Bug fixes 2023-11-10 18:51:44 +01:00
Tim deee0cd09d Added Datatable 2023-11-10 18:51:44 +01:00
Tim b594add257 Added more customizing 2023-11-10 18:51:44 +01:00
Tim 30f9e4506f solved errors 2023-11-10 18:50:38 +01:00
Tim 7e8adfafd5 Test Version 2023-11-10 18:50:11 +01:00
Tim 891ed277b6 Added comments, descriptions and cleaned up. 2023-11-10 18:49:09 +01:00
Tim bcb6df8e5d Added 2d and 3d network to dash 2023-11-10 18:48:51 +01:00
Tim e45f3a3b98 Added 3D Network 2023-11-10 18:48:18 +01:00
Tim 077846d3be NetworkX experiments 2023-11-10 18:48:18 +01:00