BinaryWorks.it Official Forum
BinaryWorks.it Official Forum
Home | Profile | Register | Active Topics | Members | Search | FAQ
Username:
Password:
Save Password
Forgot your Password? | Admin Options

 All Forums
 Plugins
 Information
 IMDB
 New Topic  Reply to Topic
 Printer Friendly
Author  Topic Next Topic Lock Topic Edit Topic Delete Topic New Topic Reply to Topic

JDommi
Administrator

Germany
4676 Posts

Posted - 05 Jan 2025 :  16:50:29  Show Profile  Edit Topic  Reply with Quote  View user's IP address  Delete Topic
The tt number must be passed as a parameter, including the page to be loaded and the Reference, as a single parameter (without spaces). Example MovieConnections for The Matrix: tt0133093/movieconnections/?ref_=tt_dyk_cnn

// Get the tt-Number
#LOADWEBSLOTS#
    #STARTREADWEB# // Main Page
    #FINDLINE#=<link rel="canonical"#15##ONERROR#=STOP
    #FIND#=/tt#11# #ONERROR#=STOP
    #COMPUTECURSOR#=+2
    #PUTCURSOR#=#11#
    #FIND#=/#12# #ONERROR#=STOP
    #TAKEWORD#=#47#,#11#,#12#
#ENDLOADWEBSLOTS#


Code at the beginning of each respective block for every page that has "see more" buttons. Connections, Awards, etc.

// Block to take MovieConnections
    #PUT#=\Exporter\IMDB.exe#11#
    // The EXE to call
    #STRING#=#12#=#47#/movieconnections/?ref_=tt_dyk_cnn#12#
	// releaseinfo/?ref_=tt_ov_at_dt_rdat 
	// keywords/?ref_=tttg_at_pl_kw 
	// companycredits/?ref_=ttkw_at_dt_cmpy
	// locations/?ref_=ttco_at_dt_flmg
	// goofs/?ref_=ttpg_at_dyk_gf // crazycredits/?ref_=ttg f_at_dyk_cc
	// awards/?ref_=ttalt_at_op_awd
	#EXECUTEFILEPARAMETER#=#12# // Pass Parameter
	#EXECUTEFILEFROMPLUGINS#=#11# // The Initial path is the EXE Folder\Plugins

    #STRING#=#11#=.\tempdir\SavedWebHtml.txt
    // This instruction opens a file from a fixed location, like c:\temp...
    #OPENFILE#=#11# 
    #STARTREADWEB#

    // Parse the saved page to get ALL movie connections
    // the code goes here
    ...
// End of block to take MovieConnections


Input to the script: tt0133093/movieconnections/?ref_=tt_dyk_cnn
Output of the script: .\tempdir\SavedWebHtml.txt

Unfortunately there is still only the imdb_plus script available that was based on the IMDB Scraper by tarzibou. That script has parsed all info and saved it in JSON-format.
Mine is intended to keep most work on the MagicScript-Engine as there are more "scripter" than Delphi-Programmer
Therefore the parsing has to be done completely from scratch.

Download:
https://www.mediafire.com/file/nmbsok8s28t9cuq/IMDB.rar/file

In order to achieve what is possible, you have to try the impossible over and over again.
Hermann Hesse

JDommi
Administrator

Germany
4676 Posts

Posted - 16 Jan 2025 :  23:16:39  Show Profile  Edit Reply  Reply with Quote  View user's IP address  Delete Reply
Well, Connections are now working. But there is a limitation on the length: 65.000 letters. For some movies like The Matrix that's not enough.
Does anyone have an idea of how to limit the list?

I only have added following code to the end of the standard IMDB (Complete) script:
// Hier
    #PUT#=\Exporter\IMDB.exe#11#
    // The EXE to call
    #STRING#=#12#=tt#47#/movieconnections/?ref_=tt_dyk_cnn#12#
	// releaseinfo/?ref_=tt_ov_at_dt_rdat 
	// keywords/?ref_=tttg_at_pl_kw 
	// companycredits/?ref_=ttkw_at_dt_cmpy
	// locations/?ref_=ttco_at_dt_flmg
	// goofs/?ref_=ttpg_at_dyk_gf // crazycredits/?ref_=ttg f_at_dyk_cc
	// awards/?ref_=ttalt_at_op_awd        
	#EXECUTEFILEPARAMETER#=#12# // Pass Parameter
	#EXECUTEFILEFROMPLUGINS#=#11# // The Initial path is the EXE Folder\Plugins

    #STRING#=#11#=.\tempdir\SavedWebhtml.txt
    // This instruction opens a file from a fixed location, like c:\temp...
    #OPENFILE#=#11# 
    #STARTREADWEB#
	
#PUT#=#13#
#FINDLINE#=textPrimary"><a href="#11#
#IF# #11#==#00#
	#GOTO#=NOCONNECTIONS
#ENDIF#

:LOOPCAT 
#FIND#=a href="#11#
#IF# #11#==#00#
	#GOTO#=NOMORECONNECTIONS
#ENDIF#
#FIND#=<span id="#11#
#FIND#=>#11#
#FIND#=<#12#
#TAKEWORD#=#14#,#11#,#12#	// Category
#STRING#=#13#=#13##14##NEWLINE#___________#NEWLINE#
#PUTCURSOR#=#08#
#FIND#=textPrimary"><a href="#09#
#LOADCURSOR#=#08#

:LOOPINCAT 
#FIND#=><a href="/title/#11#
#IF# #11#==#00#
     #GOTO#=NOMORECONNECTIONS
#ENDIF#
#FIND#=</div></div></ul><div class="ipc-html-content#12#
#TAKEWORD#=#14#,#11#,#12#
#CHECKVARIABLE#=#14#
#STARTDELETETAGS#
#STARTSUBSTITUTETAGS#
#CLEARTAGS#
#TRIM#
#STRING#=#13#=#13##14##NEWLINE##NEWLINE#
#FIND#=<p><a href="/title/#11#
#IF# #11#<<#09#
	#GOTO#=LOOPINCAT
#ENDIF#
#IF# #09#>>#00#
     #LOADCURSOR#=#09#
     #GOTO#=LOOPCAT
#ENDIF#

:NOMORECONNECTIONS 
#TAKEMOVIECONNECTION#=#13#

:NOCONNECTIONS 
#ENDTRAILER#

In order to achieve what is possible, you have to try the impossible over and over again.
Hermann Hesse
Go to Top of Page

Mawu
Senior Member

Germany
1035 Posts

Posted - 19 Jan 2025 :  13:56:30  Show Profile  Visit Mawu's Homepage  Edit Reply  Reply with Quote  View user's IP address  Delete Reply
quote:
Well, Connections are now working. But there is a limitation on the length: 65.000 letters. For some movies like The Matrix that's not enough.Does anyone have an idea of how to limit the list?


The 65000 letters limit is the limit of the text field in access. Therefore you can nothing to do against it. The only way would be to cut one of the "connections" subcategorys. The first candidate would be "Featured in".

www.schlemmerkino.de - Die Nr.1 für filmische Leckereien
Go to Top of Page
   Topic Next Topic Lock Topic Edit Topic Delete Topic New Topic Reply to Topic
 New Topic  Reply to Topic
 Printer Friendly
Jump To:
BinaryWorks.it Official Forum © Binaryworks.it Go To Top Of Page
Generated in 0.11 sec. Powered By: Snitz Forums 2000 Version 3.4.07