Recently, GitHub has changed their terms of service to use all user data for AI training unless users explicitly opt out. This is probably the way Microsoft has obtained "appropriately licensed data".
this is almost certainly too recent to have been used for training data, no? Unless they optimistically included most repos somehow?
this is almost certainly too recent to have been used for training data, no? Unless they optimistically included most repos somehow?