314 Commits

Author SHA1 Message Date
TrisNol
4058824f15 fix(data-extraction): Resolve regex issue in detecting auditors 2023-10-17 17:56:26 +02:00
Sebastian
c680ac9759
Feature/ner (#103)
NER und Sentiment-Pipeline mit Services zur Datenextraktion.

---------

Co-authored-by: Philipp Horstenkamp <philipp@horstenkamp.de>
Co-authored-by: TrisNol <tristan.nolde@yahoo.de>
2023-10-16 19:54:24 +02:00
Tristan Nolde
99b61e7c2e
feat(data-extraction): Extract c/o relation from street (#222) 2023-10-15 13:46:10 +02:00
TrisNol
f1474feaf8 refactor: Adapt to extended unit tests 2023-10-15 13:21:41 +02:00
Tristan Nolde
fd47487367
Update tests/utils/data_extraction/unternehmensregister/transform_test.py
Co-authored-by: Philipp Horstenkamp <philipp@horstenkamp.de>
2023-10-15 13:07:34 +02:00
Tristan Nolde
f6f4c99147
Update src/aki_prj23_transparenzregister/models/company.py
Co-authored-by: Philipp Horstenkamp <philipp@horstenkamp.de>
2023-10-15 13:07:24 +02:00
TrisNol
8db04177be feat(data-extraction): Extract c/o relation from street in company relation 2023-10-15 13:06:32 +02:00
Tristan Nolde
7e54ab98c5
fix(data-extraction): Parse date from Gesellschaftsvertrag entry (#221) 2023-10-15 13:06:04 +02:00
TrisNol
eba5235dff refactor: Implement PR feedback 2023-10-15 12:05:25 +02:00
Tristan Nolde
15ace5382d
Update src/aki_prj23_transparenzregister/utils/data_extraction/unternehmensregister/transform.py
Co-authored-by: Philipp Horstenkamp <philipp@horstenkamp.de>
2023-10-15 11:56:06 +02:00
Tristan Nolde
d34a0ffeac
Update src/aki_prj23_transparenzregister/utils/data_extraction/unternehmensregister/transform.py
Co-authored-by: Philipp Horstenkamp <philipp@horstenkamp.de>
2023-10-15 11:55:58 +02:00
Tristan Nolde
39c13ac74a
Update tests/utils/data_extraction/unternehmensregister/transform_test.py
Co-authored-by: Philipp Horstenkamp <philipp@horstenkamp.de>
2023-10-15 11:51:11 +02:00
TrisNol
b972acee7a fix(data-extraction): Parse date from Gesellschaftsvertrag entry 2023-10-14 18:22:41 +02:00
KM-R
84772a5511
Small mypy fix (#219) 2023-10-14 18:12:01 +02:00
c0d42a22d7
General updates & linting (#218) 2023-10-14 17:43:45 +02:00
6365e252b9
Added location to person (#185) 2023-10-14 15:27:19 +00:00
411f1053dd
Set home title (#217) 2023-10-14 17:16:54 +02:00
f8c111d7e2
Resolve mismatch between staging and prod db data for financials (#211)
SQL Creation is now done dynamicly by the definition of the enumeration
type.
2023-10-14 17:16:14 +02:00
KM-R
9f7d714403
Visualize financials (#206)
Adds the financial graph to the company page. The graph is only
available for companies with existing financial data.
2023-10-14 17:08:34 +02:00
c8d3c7395b
Update docker-compose.yml (#210) 2023-10-12 18:09:52 +02:00
Tristan Nolde
f65a377d73
fix(data-extraction): Handle malformed date_of_birth fields (#204)
fix(data-extraction): Handle malformed date_of_birth fields
2023-10-09 19:15:37 +02:00
68346aff15
Removed error suppression from sql data transformation. (#205) 2023-10-09 17:58:20 +02:00
TrisNol
84d0139531 fix(data-extraction): Handle malformed date_of_birth fields 2023-10-07 17:01:34 +02:00
Tristan Nolde
7500895982
fix: Add script to fix malformed yearly_result entries (#202) 2023-10-07 12:35:29 +02:00
TrisNol
9cc58ba8be fix: Add script to fix malformed yearly_result entries 2023-10-07 09:11:43 +02:00
63325e7faa
Add constraints to the SQL entities (#186) 2023-10-06 18:48:58 +02:00
ea9c777217
Enabeling data processing in docker container. (#195) 2023-10-06 18:36:29 +02:00
b1ca268a62
SQL fixes after new mongo ingest (#199) 2023-10-06 18:22:19 +02:00
8bb27f5195
Set the DASH_URL_BASE_PATHNAME in the docker container to be used as a default (#192)
Set the DASH_URL_BASE_PATHNAME in the docker container to be used as a
prefix.

Added a Dash Title
2023-10-06 18:13:25 +02:00
Tristan Nolde
5137026fab
hotfix(jupiter): Missing port forwarding of mongo container to host (#198) 2023-10-06 18:04:21 +02:00
Tristan Nolde
5e737162a7
Update docker-compose.yml
Co-authored-by: Philipp Horstenkamp <philipp@horstenkamp.de>
2023-10-06 17:59:14 +02:00
TrisNol
54151c5041 hotfix(jupiter): Missing port forwarding of mongo container to host 2023-10-06 17:30:18 +02:00
ab96c5a01a
Switched two docker container off. 2023-10-05 22:16:45 +02:00
Christian Gawron
d57007e26b changed volumes for data bases 2023-10-05 20:41:47 +02:00
09c36960e3
Add an list of missing relation partners to be searched (#171)
- [x] Add a new table
- [x] Add a field to the table that can register if the company was
already queried
- [x] Add a field to the table that counts how many times a relation
partner was missing
- [x] Add a function that restets the counter

Also:
- Reworked the get_company function to use the location dict as kwargs
2023-10-05 19:57:30 +02:00
c6f2c7467c
Rework the transfer of company data to fit the new data in the mongodb (#188)
This adds the additional company data as proposed to the sql db.

- [x] @TrisNol Is everything included or did I miss a feature. Relations
are in another issue.
- [x] @KM-R New DB features for the Dashbord for your review.
2023-10-05 19:47:46 +02:00
KM-R
2152704dfc
175 create person page (#178)
Created person page and updated search bar in the header to search for persons
2023-10-05 18:00:31 +02:00
Tristan Nolde
bf7c072e87
Fix/company names with quotes (#187) 2023-10-04 20:07:51 +02:00
ea89a64a80
Linter update to ruff==0.0.292 (#191) 2023-10-04 20:04:59 +02:00
41f2c9f995
Executing black over all jupyter notebook (#190)
Reverting black for the jupyter notebooks gets old. Can we just run
black over all of them?
2023-10-04 20:03:47 +02:00
TrisNol
38d19a835d refactor: Replace last replace with call of dedicated function 2023-10-04 19:39:50 +02:00
030ad00c7d
Testing speedup with in memory SQLite (#189)
If no SQLite File is written and deleted testing is MUCH faster.
2023-10-04 19:36:57 +02:00
TrisNol
c2f1670032 refactor: Implement MyPy feedback 2023-10-03 16:43:23 +02:00
TrisNol
0dfe95652d refactor: Implement MyPy feedback 2023-10-03 16:42:09 +02:00
TrisNol
259259953e refactor: Move quote removal funtion to string utils, adapt to requirements 2023-10-03 16:37:54 +02:00
TrisNol
2a446a9937 checkpoint: Remove quotes from company names in relations 2023-10-03 14:33:46 +02:00
TrisNol
49498ad7c0 checkpoint: Remove quotes from company name 2023-10-03 14:33:45 +02:00
Tristan Nolde
aa6c0efc87
docs: Meeting notes for 2023-09-28 (#164)
Zum Review
2023-10-03 14:33:08 +02:00
Tristan Nolde
7e9cff046a
fix(data-extraction): Parse house-number from street field if possibl… (#179) 2023-10-03 14:26:21 +02:00
TrisNol
231513d45d docs: Adding Seminararbeit on Data Extraction 2023-10-03 12:38:17 +02:00