Improved support for external File content: Difference between revisions
No edit summary |
No edit summary |
||
Line 100: | Line 100: | ||
=== File API changes === | === File API changes === | ||
* pluginfile.php and draftfile.php (it's actually send_stored_file()): when file isn't local file (`repositoryid` isn't zero), stored_file instance should ask repository plugin‘s | * pluginfile.php and draftfile.php (it's actually send_stored_file()): when file isn't local file (`repositoryid` isn't zero), stored_file instance should ask repository plugin‘s send the file contents, Repository API decide return the cached copy or fresh contents | ||
* When call file_prepare_draft_area(), it should keep linking files' repository information | * Preserve file reference information when call file_storage::create_file_from_storedfile | ||
* file_storage::get_area_files should retrieve file reference information, also file_storage::get_file_by_id, file_storage::get_file_by_hash | |||
* stored_file::delete() should detect repository files, and process it properly | |||
* When call file_prepare_draft_area(), it should keep linking files' repository information | |||
* file_save_draft_files should preserve file reference information | |||
* Adding callback to files API delete function, if the actual file get deleted, callback needs to check all linked files, and convert them to actually files | * Adding callback to files API delete function, if the actual file get deleted, callback needs to check all linked files, and convert them to actually files | ||
* new method: file_storage::create_file_from_reference | * new method: file_storage::create_file_from_reference | ||
Line 111: | Line 115: | ||
=== Repository API changes === | === Repository API changes === | ||
* | * Additions to FILE_INTERNAL and FILE_EXTERNAL, we need another type FILE_REFERENCE = 4 // 0100, repository plugin needs to declare what types of files are supported | ||
* When users ask to create a reference (instead of copying) in file picker, Repository API try to cache the file in filepool(special file area for repository), after file downloaded, repository API should ask Files API to create a virtual file in `mdl_files` table, the existing fields stay the same, extra file reference information is stored in `mdl_files_reference` table, link or other format that repository plugins know. File API return the stored_file object, repository API generate the moodle url for users just like other moodle files without revealing the internal reference | * When users ask to create a reference (instead of copying) in file picker, Repository API try to cache the file in filepool(special file area for repository), after file downloaded, repository API should ask Files API to create a virtual file in `mdl_files` table, the existing fields stay the same, extra file reference information is stored in `mdl_files_reference` table, link or other format that repository plugins know. File API return the stored_file object, repository API generate the moodle url for users just like other moodle files without revealing the internal reference | ||
Line 120: | Line 124: | ||
* Cron will use repository plugin callbacks to clean up cache files, repoistory::cron($repositoryid) | * Cron will use repository plugin callbacks to clean up cache files, repoistory::cron($repositoryid) | ||
* repository::send_file(stored_file $stored_file) | |||
* repository::get_file_reference($ref) | |||
=== Repository plugins changes === | === Repository plugins changes === | ||
Line 142: | Line 150: | ||
==== class repository_cache ===== | ==== class repository_cache ===== | ||
* | |||
* | Returned stored_file instance or file path | ||
* store($url, $string_to_be_hashed) | |||
* get($string_to_be_hashed) | |||
=== Filepicker Javascript API for customizing=== | === Filepicker Javascript API for customizing=== |
Revision as of 05:29, 13 February 2012
Note: This page is a work-in-progress. Feedback and suggested improvements are welcome. Please join the discussion on moodle.org or use the page comments.
File synching | |
---|---|
Project state | Planning |
Tracker issue | MDL-28666 |
Discussion | TODO |
Assignee | Martin Dougiamas |
Moodle 2.2
Problems with Files in 2.0
- It is not currently easy to use a file in multiple places throughout Moodle and update them all at once
- It is not currently easy to create a simple shared "course repository" for teachers to use
Example use cases that will become possible
A teacher wants to upload a file once and use it in multiple courses. When they update the file, it should be updated in all their courses automatically.
- The teacher uploads the file to their private file area (or other repository).
- In each place they want to add the file, they use the file picker to select the file from their private files area (or other repo) and select "link to latest version".
- Replacing the file will automatically mean the linked copies use it.
- Deleting the original file will force the linked copies to be static copies (the teacher will be informed of all the copies before proceeding with the delete).
Several teachers want to create a shared repository of files together
- One teacher adds a course repository block to the course
- Using the block, the teacher creates an instance of a "?filesystem? repository" inside the course (essentially a folder).
- The content of the repository can be edited via the "Course repositories" block.
- All of the teachers in that course can now see that appear in their filepickers and select files
- Files can be "linked" as above
A student wants to submit a linked file as an assignment, so they can continue updating it after the assignment due date.
- This will not be possible, because the assignment will not allow linking to files.
- The student is forced to upload a copy of a file and this is protected by the assignment module.
Solution summary
The basic idea is to allow the CONTENT of files to be stored outside of the filepool, while all the metadata and access is controlled by Moodle exactly the same as it is now.
- Extend Files API with a new "reference" concept, with all files now having a "repositoryid" (the repository instance that the file came from) and a "reference" (the address in that repository of the content of the file).
- All files copied into Moodle don't have record in `files_reference` table, but others can specify a UUID (usually a URL with file-specific tokens in it, created by the original user who placed the file) in reference and repostiroyid columns.
- Improve the filesystem repository plugin to make it easier to create folder repositories on the fly, via a block.
- Add support to the filepicker UI to add "linking" to more repositories that support it, including the server files and filesystem repository.
- Virtual files are served via pluginfile.php URLs just like normal files:
- pluginfile.php uses module callback to determine access (as now), and if it passes then
- pluginfile.php calls a file logging subsystem to log the fact that this file is being served (useful for copyright reporting, for example)
- If the file is not 'local', then pluginfile.php uses the relevant repository callback to get the content of the file and streams it to the user with appropriate mimetypes etc
- The repositories have a way to cache this content in the normal Moodle filepool, to avoid repeated downloads.
- If an external repository is down or not configured, then the repository plugin can choose to just serve the local cached version (useful for restored backups and disaster tolerance)
- When a file is linked to another another file in the local location, then it has a location of "filepool". We need to pay attention to these during:
- Garbage cleanups
- Deleting the original file should create real copies of that file where required (user will be informed).
Note that the original URL of files in external repositories are never revealed to users.
Details
File picker walk-through
- User clicks the "Insert image" button in TinyMCE, then launches our File Picker in the dialog
- User chooses a repository which supports UUID direct file references (eg Equella, Alfresco etc)
- A. File picker will ask repository plugin for customised repository UI if supported
- B. File Picker use repository UI and <object> type to display customised UI in file picker right pane
- User selects a file from the repository, if is customised UI, file picker Javascript API will be used to trigger file selection event (with all file related information), the repository plugin gives you two choices in the file picker interface http://tracker.moodle.org/secure/attachment/25823/2011-11-09+15.23.png http://tracker.moodle.org/secure/attachment/25825/File+picker+options.png
- C. Use the current version: this will copy the file to Moodle (as now). No need to reference external content.
- C. Use the latest version: this will use a file reference, so that the most recent version is always pulled from the repository
- D. If "Use the latest version" selected, Repository API will download the file as usual but store it for caching, we could locate this cached file when we need it, cron will take responsibility of invalidating/updating it
- E. Repository API ask File API to create a file with file parameters and reference. File API stores the reference and repository instance id in `mdl_files_reference` table and returns the file URL which looks the same as any other Moodle file URL.
- File picker gives the file URL to TinyMCE
- When TinyMCE displays the resource, it will cause the browser to call the file URL, which contains pluginfile.php.
- Pluginfile.php uses File API to send file contents to browsers, if File API detects the requested file is not ordinary moodle files which is located at external repository, File API will ask Repository API for file content, Repository API will firstly look for the cached file, if file is too old or not found (removed by cron checking), Repository API will fetch the resource (and cache it), then return to File API. Alternatively, Repository API could disable caching, asking for fresh content all the time.
File request walk-through
- A. User request a file
- B. File API detects if the request file is regular moodle files or located at external repository, if it's external files, File API will ask repository API to grab the file
- C. File API collects the file reference information from database, it could be stored in php serialised or JSON format
- D. File API passes raw file reference information to repository plugin
- E. Repository plugin will firstly check if file available locally
- F. If not repository plugin will use file reference information to grab the file
- G. Repository API returns file content to File API to serve the file
Database changes
We can create another table for left joining, this requires File API query this table when locating files:
- New file table `mdl_files_reference`
- `id` primary key
- `fileid` foreign key of `mdl_files`.`id`
- `repositoryid`
- `reference` - can be URL, UUID or other data format, repository plugin callbacks know the meaning of this field. File reference should be cached when adding to moodle, contenthash should be accurate.
- `mdl_files_log` table for files access log, File API should have a new function to insert records to this table
- `id` primary key
- `userid` (0 if it's guest),
- `timeaccess`,
- `fileid`
File API changes
- pluginfile.php and draftfile.php (it's actually send_stored_file()): when file isn't local file (`repositoryid` isn't zero), stored_file instance should ask repository plugin‘s send the file contents, Repository API decide return the cached copy or fresh contents
- Preserve file reference information when call file_storage::create_file_from_storedfile
- file_storage::get_area_files should retrieve file reference information, also file_storage::get_file_by_id, file_storage::get_file_by_hash
- stored_file::delete() should detect repository files, and process it properly
- When call file_prepare_draft_area(), it should keep linking files' repository information
- file_save_draft_files should preserve file reference information
- Adding callback to files API delete function, if the actual file get deleted, callback needs to check all linked files, and convert them to actually files
- new method: file_storage::create_file_from_reference
public function create_file_from_reference($file_record, $repoisitoryid, $reference, array $options = NULL)
- Cron: we probably need a Repository API function to cache/update external files
Repository API changes
- Additions to FILE_INTERNAL and FILE_EXTERNAL, we need another type FILE_REFERENCE = 4 // 0100, repository plugin needs to declare what types of files are supported
- When users ask to create a reference (instead of copying) in file picker, Repository API try to cache the file in filepool(special file area for repository), after file downloaded, repository API should ask Files API to create a virtual file in `mdl_files` table, the existing fields stay the same, extra file reference information is stored in `mdl_files_reference` table, link or other format that repository plugins know. File API return the stored_file object, repository API generate the moodle url for users just like other moodle files without revealing the internal reference
- Making repository plugin upgrade and versioning possible, repository plugin may need to update `reference` in `mdl_files_reference` table if reference info changed. (we already have db/upgrade.php, make sure all new plugins have one)
- When deleting repository instance, all files imported by this instance will have to be converted to actual files, this has to be done by a File API function
- Cron will use repository plugin callbacks to clean up cache files, repoistory::cron($repositoryid)
- repository::send_file(stored_file $stored_file)
- repository::get_file_reference($ref)
Repository plugins changes
- Server files: store file parameters in `reference` field
- Private files: same as server files plugin
- Alfresco: Store UUID in `reference` field, but alfresco will change UUID once tomcat restart, may need other information to locate files
- Flickr private: needs flickr secret and token and photo id to locate the files
- File system: store file path in `reference` field
- s3: store file path, s3 repository will use secret and token to fetch the file from s3 no matter files are public or not.
- EQUELLA
- Admin installs EQUELLA and setup parameters for single sign on
- Teacher clicks EQUELLA instance, EQUELLA will return an URL of repository UI (plugin code)
- Teacher pick a file from EQUELLA, EQUELLA repository UI will revoke file picker JavaScript API to notify moodle download this resource (plugin code)
- Repository API stores file UUID and SSO userid, creates file reference in moodle file pool
- EQUELLA plugin implements method to download contents using stored UUID and SSO userid
- EQUELLA plugin implements method to update/invalidate cached resources (by cron)
Content caching
- Create moodledata/repostory/cache directory
- Generate hash based on file URL and request parameters (provided by repository plugin), not content hash because we cannot send content hash to external repository for file information
- Cached files are stored using hash code as file name
class repository_cache =
Returned stored_file instance or file path
- store($url, $string_to_be_hashed)
- get($string_to_be_hashed)
Filepicker Javascript API for customizing
- File picker should be able to dislable file references by taking an option
- Support <object> tag in filepicker container
- Provide Javascript API to allow plugin communicate with filepicker
- Notify file picker to download file
- Notify file picker to pop up authentication page
File manager to handle virtual files
Not much trouble here, need to make sure draftfile.php can serve external resources, because all files managed by file manager is in draft area