Update README.md
Browse files
README.md
CHANGED
|
@@ -12,4 +12,17 @@ tags:
|
|
| 12 |
- osint
|
| 13 |
- research
|
| 14 |
- crawling
|
| 15 |
-
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 12 |
- osint
|
| 13 |
- research
|
| 14 |
- crawling
|
| 15 |
+
---
|
| 16 |
+
|
| 17 |
+
# Scooby
|
| 18 |
+
|
| 19 |
+
Scooby is the first model created for the purpose of detecting profile pages while crawling.
|
| 20 |
+
|
| 21 |
+
It is trained mainly on scraped data from the sites of Serbian universities, but around 20%
|
| 22 |
+
of the data is scraped from websites of some organizations or companies.
|
| 23 |
+
|
| 24 |
+
## Preprocessing
|
| 25 |
+
|
| 26 |
+
For preprocessing, 2880x1620 resolution images were rescaled down to 360x480 (by mistake).
|
| 27 |
+
|
| 28 |
+
Number of channels is one, grayscale.
|