mirror of
https://github.com/LemmyNet/lemmy-docs.git
synced 2024-11-01 10:09:54 +00:00
add ranking algo docs (#199)
* add ranking algo docs * add ranking algo page to summary * add rank_algorithm.png * move img up a level * move docs to contributors and add img * move rank algo image * move rank algo to top level * link to document from voting page * add file * fix summary * ../ --------- Co-authored-by: Dessalines <dessalines@users.noreply.github.com>
This commit is contained in:
parent
700597871d
commit
f5af7828ac
4 changed files with 50 additions and 0 deletions
|
@ -35,6 +35,7 @@
|
||||||
- [API](contributors/04-api.md)
|
- [API](contributors/04-api.md)
|
||||||
- [Federation](contributors/05-federation.md)
|
- [Federation](contributors/05-federation.md)
|
||||||
- [Resources](contributors/06-resources.md)
|
- [Resources](contributors/06-resources.md)
|
||||||
|
- [Ranking Algorithm](contributors/07-ranking-algo.md)
|
||||||
|
|
||||||
# Code of Conduct
|
# Code of Conduct
|
||||||
|
|
||||||
|
|
47
src/en/contributors/07-ranking-algo.md
Normal file
47
src/en/contributors/07-ranking-algo.md
Normal file
|
@ -0,0 +1,47 @@
|
||||||
|
# Trending / Hot / Best Sorting algorithm
|
||||||
|
|
||||||
|
## Goals
|
||||||
|
|
||||||
|
- During the day, new posts and comments should be near the top, so they can be voted on.
|
||||||
|
- After a day or so, the time factor should go away.
|
||||||
|
- Use a log scale, since votes tend to snowball, and so the first 10 votes are just as important as the next hundred.
|
||||||
|
|
||||||
|
## Implementations
|
||||||
|
|
||||||
|
### Reddit
|
||||||
|
|
||||||
|
Does not take the lifetime of the thread into account, [giving early comments an overwhelming advantage over later ones,](https://minimaxir.com/2016/11/first-comment/) with the effect being even worse in small communities. New comments pool at the bottom of the thread, effectively killing off discussion and making each thread a race to comment early. This lowers the quality of conversation and rewards comments that are repetitive and spammy.
|
||||||
|
|
||||||
|
### Hacker News
|
||||||
|
|
||||||
|
While far superior to Reddit's implementation for its decay of scores over time, [Hacker News' ranking algorithm](https://medium.com/hacking-and-gonzo/how-hacker-news-ranking-algorithm-works-1d9b0cf2c08d) does not use a logarithmic scale for scores.
|
||||||
|
|
||||||
|
### Lemmy
|
||||||
|
|
||||||
|
Counterbalances the snowballing effect of votes over time with a logarithmic scale. Negates the inherent advantage of early comments while still ensuring that votes still matter in the long-term, not nuking older popular comments.
|
||||||
|
|
||||||
|
```
|
||||||
|
Rank = ScaleFactor * log(Max(1, 3 + Score)) / (Time + 2)^Gravity
|
||||||
|
|
||||||
|
Score = Upvotes - Downvotes
|
||||||
|
Time = time since submission (in hours)
|
||||||
|
Gravity = Decay gravity, 1.8 is default
|
||||||
|
```
|
||||||
|
|
||||||
|
- Lemmy uses the same `Rank` algorithm above, in two sorts: `Active`, and `Hot`.
|
||||||
|
- `Active` uses the post votes, and latest comment time (limited to two days).
|
||||||
|
- `Hot` uses the post votes, and the post published time.
|
||||||
|
- Use Max(1, score) to make sure all comments are affected by time decay.
|
||||||
|
- Add 3 to the score, so that everything that has less than 3 downvotes will seem new. Otherwise all new comments would stay at zero, near the bottom.
|
||||||
|
- The sign and abs of the score are necessary for dealing with the log of negative scores.
|
||||||
|
- A scale factor of 10k gets the rank in integer form.
|
||||||
|
|
||||||
|
A plot of rank over 24 hours, of scores of 1, 5, 10, 100, 1000, with a scale factor of 10k.
|
||||||
|
|
||||||
|
![rank_algorithm.png](rank_algorithm.png)
|
||||||
|
|
||||||
|
#### Active User counts
|
||||||
|
|
||||||
|
Lemmy also shows counts of _active users_ for your site, and its communities. These are counted within the last `day`, `week`, `month`, and `half year`, and are cached on starting up lemmy, and every hour.
|
||||||
|
|
||||||
|
An active user is someone who has posted or commented on our instance or community within the last given time frame. For site counts, only local users are counted. For community counts, federated users are included.
|
BIN
src/en/contributors/rank_algorithm.png
Normal file
BIN
src/en/contributors/rank_algorithm.png
Normal file
Binary file not shown.
After Width: | Height: | Size: 54 KiB |
|
@ -26,3 +26,5 @@ Comments can be sorted in the following ways. These all keep the indentation int
|
||||||
- **Old**: Shows oldest comments first
|
- **Old**: Shows oldest comments first
|
||||||
|
|
||||||
Additionally there is a sort option **Chat**. This eliminates the hierarchy, and puts all comments on the top level, with newest comments shown at the top. It is useful to see new replies at any point in the conversation, but makes it difficult to see the context.
|
Additionally there is a sort option **Chat**. This eliminates the hierarchy, and puts all comments on the top level, with newest comments shown at the top. It is useful to see new replies at any point in the conversation, but makes it difficult to see the context.
|
||||||
|
|
||||||
|
The ranking algorithm is described in detail [here](../contributors/07-ranking-algo.md).
|
||||||
|
|
Loading…
Reference in a new issue