d0677287b6
Added a filter for financial reports. ( #372 )
...
Finanical reports are now filtered before beeing added to the SQL
database to only added knwon keys.
Some matching is also done.
The most importend missing reports are printed to be implemented later
on.
Rapidfuzz could be used.
2023-11-13 18:52:12 +01:00
af8a907cf9
Stop table reset of better persistent tables. ( #373 )
2023-11-12 14:27:44 +01:00
170056bf58
test: Cover apps/fetch_news.py with unit tests
2023-11-11 14:30:00 +01:00
ac6ca3547b
test: Add unit test for news api wrapper
2023-11-11 14:30:00 +01:00
066800123d
Created pipeline to run ner sentiment and sql ingest ( #314 )
...
Created a dataprocessing pipline that enhances the raw mined data with
Organsiation extractions and sentiment analysis prio to moving the data
to the sql db.
The transfer of matched data is done afterword.
---------
Co-authored-by: SeZett <zeleny.sebastian@fh-swf.de >
2023-11-11 13:28:12 +00:00
a6d486209a
Introduce extended_financial_data code ( #357 )
...
Introducing the previously developed method to fetch the financial data
via table parsing (aka "data lake like solution") in a non-destructive
manner by defaulting to the current RegEx-based behaviour.
2023-11-11 14:10:20 +01:00
e5b61bc19c
Added multi relation dropdowns to dashbord ( #363 )
...
This change allows for a more complete combination of relation
combinations to be filtered.
2023-11-11 13:47:46 +01:00
9edf5b1dce
test: Increase coverage for multi-column headers
2023-11-11 11:03:36 +01:00
fecf42d75a
test: Unit test new KPI extraction
2023-11-11 11:01:17 +01:00
e5769b3c25
Added Tests
...
Co-authored-by: Tristan Nolde <TrisNol@users.noreply.github.com >
2023-11-10 18:56:51 +01:00
410b690873
Added test
2023-11-10 18:56:51 +01:00
41af7e2d18
Added test behaviour
2023-11-10 18:56:51 +01:00
f38728450d
now ruff confirm
2023-11-10 18:53:47 +01:00
f2ac0eda91
Added Realtion_count MEthod
2023-11-10 18:53:47 +01:00
31d7098d48
Checkpoint commit
2023-11-10 18:52:13 +01:00
30f9e4506f
solved errors
2023-11-10 18:50:38 +01:00
7e8adfafd5
Test Version
2023-11-10 18:50:11 +01:00
6585a0ee11
On branch feature/visualize-verflechtungen
2023-11-10 18:45:25 +01:00
f9d3f0eb76
test: Cover apps/find_missing_companies.py
2023-11-05 13:47:06 +01:00
f7ec3eaf24
test: Increase test coverage and refactor v3
2023-11-05 12:55:47 +01:00
e8d1a37cff
test: Extend unit tests
2023-11-04 14:19:41 +01:00
61f94fa3b9
test: Unit tests
2023-11-04 11:24:36 +01:00
d6b07431e7
test: Adapt existing unit tests to refactored imports
2023-11-04 11:24:36 +01:00
ad36c68993
Moved the AI tests into the AI folder. ( #315 )
2023-11-03 13:45:24 +01:00
8d9981d967
Moved AI files in the AI module. ( #308 )
2023-11-02 20:30:04 +01:00
f72d606d18
Added base-path support in URL generating features ( #288 )
...
Add the basepath dash url to the path generation for dynamicly generated
links.
2023-10-29 20:40:40 +01:00
b564b2627c
Update company stats after extraction of more stammdaten ( #267 )
2023-10-26 19:15:39 +02:00
7953ba9291
Mixed typo fixes ( #270 )
2023-10-26 19:06:45 +02:00
896136dcee
Added an about page ( #251 )
...
This page was added since it is sometimes difficult to say which version
was deployed on an server. This should allow an easy lookup on the
server and make it comparable with what is expected.
2023-10-26 17:32:17 +02:00
1eb972b7ff
Adds the transfer of sentiments into the sql db ( #253 )
...
Transfers the sentimenes from the mongodb int the sql db.
2023-10-24 17:50:40 +02:00
36a0bab6ff
Add relations from finanical reports to SQL ( #216 )
2023-10-19 19:21:33 +02:00
83d313150c
test: Update to new functions
2023-10-17 18:47:25 +02:00
600039207d
test(data-extraction): Adapt unit tests to new behaviour
2023-10-17 18:16:44 +02:00
c680ac9759
Feature/ner ( #103 )
...
NER und Sentiment-Pipeline mit Services zur Datenextraktion.
---------
Co-authored-by: Philipp Horstenkamp <philipp@horstenkamp.de >
Co-authored-by: TrisNol <tristan.nolde@yahoo.de >
2023-10-16 19:54:24 +02:00
f1474feaf8
refactor: Adapt to extended unit tests
2023-10-15 13:21:41 +02:00
fd47487367
Update tests/utils/data_extraction/unternehmensregister/transform_test.py
...
Co-authored-by: Philipp Horstenkamp <philipp@horstenkamp.de >
2023-10-15 13:07:34 +02:00
8db04177be
feat(data-extraction): Extract c/o relation from street in company relation
2023-10-15 13:06:32 +02:00
7e54ab98c5
fix(data-extraction): Parse date from Gesellschaftsvertrag entry ( #221 )
2023-10-15 13:06:04 +02:00
eba5235dff
refactor: Implement PR feedback
2023-10-15 12:05:25 +02:00
39c13ac74a
Update tests/utils/data_extraction/unternehmensregister/transform_test.py
...
Co-authored-by: Philipp Horstenkamp <philipp@horstenkamp.de >
2023-10-15 11:51:11 +02:00
b972acee7a
fix(data-extraction): Parse date from Gesellschaftsvertrag entry
2023-10-14 18:22:41 +02:00
84772a5511
Small mypy fix ( #219 )
2023-10-14 18:12:01 +02:00
6365e252b9
Added location to person ( #185 )
2023-10-14 15:27:19 +00:00
f8c111d7e2
Resolve mismatch between staging and prod db data for financials ( #211 )
...
SQL Creation is now done dynamicly by the definition of the enumeration
type.
2023-10-14 17:16:14 +02:00
9f7d714403
Visualize financials ( #206 )
...
Adds the financial graph to the company page. The graph is only
available for companies with existing financial data.
2023-10-14 17:08:34 +02:00
84d0139531
fix(data-extraction): Handle malformed date_of_birth fields
2023-10-07 17:01:34 +02:00
7500895982
fix: Add script to fix malformed yearly_result entries ( #202 )
2023-10-07 12:35:29 +02:00
9cc58ba8be
fix: Add script to fix malformed yearly_result entries
2023-10-07 09:11:43 +02:00
63325e7faa
Add constraints to the SQL entities ( #186 )
2023-10-06 18:48:58 +02:00
b1ca268a62
SQL fixes after new mongo ingest ( #199 )
2023-10-06 18:22:19 +02:00