The Prolegis crawler uses an article extraction tool to find metadata from source content. The documentation below describes how to format your HTML so that we can extract high quality metadata. We’ve attempted to support open standards for HTML metadata wherever possible, so following these formats may improve metadata in other crawler based sources.
The attributes below are optional. If the HTML does not contain the documented elements, our crawler will attempt to pick the best value. If you're unsure why the crawler is not behaving as expected, or if you generally have questions about the crawler, please contact us via the chat tool in the bottom right corner of this screen.
Field | Priority | Placement | Format | Example |
---|---|---|---|---|
Title | 1 | head |
|
|
2 | inline |
|
|
|
Authors | 1 | head |
|
|
2 | inline |
|
|
|
3 | inline |
|
|
|
Published Date | 1 | head |
|
|
2 | inline |
|
|
|
Image Url | 1 | head |
|
|